Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Setup buildbot for OpenBLAS #785

Closed
xianyi opened this issue Mar 1, 2016 · 18 comments
Closed

Setup buildbot for OpenBLAS #785

xianyi opened this issue Mar 1, 2016 · 18 comments

Comments

@xianyi
Copy link
Collaborator

xianyi commented Mar 1, 2016

Hi all,

I plan to setup buildbot for OpenBLAS. Dose anybody have some experiences?
Just go through the buildbot manual. It looks like I need to buy a VPS to run build-master. Then, I setup the build-slaver on OpenBLAS dev machines.

@xianyi
Copy link
Collaborator Author

xianyi commented Mar 1, 2016

http://build.openblas.net/

Setup the following builder

  • core2-linux
  • nehalem-linux
  • sandybridge-linux
  • haswell-linux
  • barcelona-linux
  • bulldozer-linux
  • piledriver-linux

@matthew-brett
Copy link
Contributor

Sorry to be slow to get back to you. I have a lot of buildbot experience; let me know if I can help.

@xianyi
Copy link
Collaborator Author

xianyi commented Mar 8, 2016

@matthew-brett , Do you know the reason for the following error? unstable network?

[Failure instance: Traceback (failure with no frames): <class 'twisted.internet.error.ConnectionLost'>: Connection to the other side was lost in a non-clean fashion.

@matthew-brett
Copy link
Contributor

I don't think I've seen these, at least, not as consistently as you have. Are the the slave machines on the same network as the master?

For my setup, I run no builds on the master machine - do you have the same?

@xianyi
Copy link
Collaborator Author

xianyi commented Mar 8, 2016

@matthew-brett , the master and slave are not at same network.

@matthew-brett
Copy link
Contributor

@stefanv @yarikoptic - have you seen this error on your setups?

@xianyi - I assume there was no reason the buildbot slaves could have restarted during the build?

What versions of masters / slaves do you have?

@xianyi
Copy link
Collaborator Author

xianyi commented Mar 8, 2016

The slaves are in China. The master is a VPS in US. I don't think the slave has restarted during the build.

@matthew-brett
Copy link
Contributor

Could this be relevant? http://trac.buildbot.net/ticket/2147

Is it possible the master has got overloaded?

Which buildslave / buildmaster versions are you running?

@xianyi
Copy link
Collaborator Author

xianyi commented Mar 9, 2016

build master

  buildbot --version
Buildbot version: 0.8.12
Twisted version: 13.2.0

build slave

buildslave --version
Buildslave version: 0.8.8
Twisted version: 13.2.0

@xianyi
Copy link
Collaborator Author

xianyi commented Mar 9, 2016

I just login to the slave. Although the twisted connection is closed, the test scripts (docker) are still running.

@matthew-brett
Copy link
Contributor

Maybe worth upgrading the buildslaves to latest?

@brada4
Copy link
Contributor

brada4 commented Mar 30, 2016

@xianyi would it be possible to set up clang scan-build (easy to set up with ubuntu/debian and "official" clang-3.7 clang-3.8). it is quite reveling regarding uninitialized variables

@xianyi
Copy link
Collaborator Author

xianyi commented Mar 30, 2016

@brada4 , OK.
I will try to add clang

@xianyi
Copy link
Collaborator Author

xianyi commented Mar 30, 2016

@brada4 , I added a clang build step at quick-xxx-linux builder. You can check the output http://build.openblas.net/builders/quick-core2-linux/builds/43/steps/shell_1/logs/stdio

Meanwhile, on OS X, the default compiler is clang.

@brada4
Copy link
Contributor

brada4 commented Mar 30, 2016

I did not mean alias gcc=clang, i meant alias make='scan-build-3.8 make'

e.g. within limits of my understanding

Index: kernel/generic/trsm_ltcopy_8.c
===================================================================
--- kernel/generic/trsm_ltcopy_8.c      (revision 6263)
+++ kernel/generic/trsm_ltcopy_8.c      (working copy)
@@ -798,7 +798,7 @@
        *(b +  1) = data02;
        *(b +  2) = data03;
        *(b +  3) = data04;
-       *(b +  4) = data05;
+       // *(b +  4) = data05;
       }

       if (ii < jj) {
Index: kernel/generic/zgemm_tcopy_4.c
===================================================================
--- kernel/generic/zgemm_tcopy_4.c      (revision 6263)
+++ kernel/generic/zgemm_tcopy_4.c      (working copy)
@@ -293,8 +293,8 @@

        aoffset1 += 8;
        aoffset2 += 8;
-       aoffset3 += 8;
-       aoffset4 += 8;
+       // aoffset3 += 8;
+       // aoffset4 += 8;

        boffset1 += m * 8;
        i --;
Index: kernel/generic/zneg_tcopy_4.c
===================================================================
--- kernel/generic/zneg_tcopy_4.c       (revision 6263)
+++ kernel/generic/zneg_tcopy_4.c       (working copy)
@@ -293,8 +293,8 @@

        aoffset1 += 8;
        aoffset2 += 8;
-       aoffset3 += 8;
-       aoffset4 += 8;
+       // aoffset3 += 8;
+       // aoffset4 += 8;

        boffset1 += m * 8;
        i --;
Index: .
===================================================================
--- .   (revision 6263)
+++ .   (working copy)

@xianyi
Copy link
Collaborator Author

xianyi commented Mar 30, 2016

Oh, I see.

However, when I ran scan-build-3.8 make, I met the following error.

make[1]: /usr/share/clang/scan-ar: Command not found
make[1]: *** [libs] Error 127
make[1]: Leaving directory `/home/traits/project/OpenBLAS_tmp/interface'
make: *** [libs] Error 1
scan-build: 48 bugs found.
scan-build: Run 'scan-view /tmp/scan-build-2016-03-31-034631-41225-1' to examine bug reports.

@brada4
Copy link
Contributor

brada4 commented Mar 30, 2016

ln -s `which ar` /usr/share/clang/scan-ar
ln -s `which ranlib` /usr/share/clang/scan-ranlib

You need some browser like w3m or firefox to see the result...

@martin-frbg
Copy link
Collaborator

@xianyi is buildbot setup still an open issue ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants