-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
python3-casacore cannot find dysco #234
Comments
Hi @phara92 thanks for reporting this. Not sure why I get a core dump trying to make my own:
|
@phara92 is there a specific reason you are using |
@Athanaseus I do not know what happens there in your case. It looks like the MS is corrupted. But I really do not know what is going on. @Athanaseus I do not know if I can share the MS I am trying to use with you. First I would need to ask the people who gave it to me. @gijzelaerr Since I am a tool developer, I thought |
I remeber now, try installing the |
|
About that dysco error I'm also not sure what is going on, since both casacore and dysco are the latest versions in KERN-6. Maybe ask upstream at the dysco repo what could be wrong. |
To me it seems to be a build issue. But I will ask upstream anyways. |
André says:
And apparently does not feel responsible for this. Do you think we can figure this out on the KERN side? |
Yeah this isn't very helpful and Andre doesn't care about proper management of SO numbers. Doing proper SO version managed is supposed to address this and would make it easier to pinpoint the issue. @Athanaseus can you have a look at this? Maybe just do a rebuild of the latest casacore and dysco package and see if we can reproduce the error, maybe indeed somewhere a version got mixed up meanwhile, it would help if we have a little MS that uses dysco. |
We're doing our best to make the software work for everyone, but it takes a lot of effort to keep everyone happy. Casacore has a feature to find storage managers at runtime, in this case As for the little test-MS, here is one. |
That's not a very collaborative remark, @gijzelaerr . And also just irrelevant: this doesn't have to do with proper SO versions of Dysco. The Dysco library ABI has never changed. It's the environment Dysco lives in, which is what Kern should fix. Of other packages, like aoflagger or wsclean, the ABI changes every release and even subreleases. Ole has a good solution for that, namely always depend the version of an executable on the exact same version of the lib. That works fine in Debian. If your system depends on me to do something with an SO version every time you release something, then indeed that's not going to work. However, again, the system that Ole uses works much better. But also, for things like SO versions or certain cmake Debian rules that Ole knows a lot better than me, Ole keeps patches in the Debian repository. If you want proper SO versions, go ahead and do that for Kern. |
just closing an issue is not very collaborative either. Most of the issues people have is with casacore and related libs having compatibility issues. for years I've tried to improve the situation by checking this and working together with upstream trying to improve the situation. Since I'm currently not paid to work in radio astronomy anymore, I'm doing this now in my spare time and I've lost the motivation to track this kind of issues. |
Frankly I didn't just close it, I gave an answer pointing as best as I could at what was the problem. I agree it would have been nicer if I had said "I'm closing this ticket here because I think it's not a Dysco issue I can fix", but it was late and thought closing implied this, so my apologies for that (the "comment and close" button is also just too easy to click ;) ). But throwing random accusations at me is not going to improve my willingness to help you. |
Yeah, I also could and should have brought the message differently, but I'm of the impression that SO versioning really is not seen as useful for managing dependencies at ASTRON, while Debian packages heavily relies on this. The Debian and KERN packages are almost the same, so maybe something is going wrong in the build procedure. Hopefully, @Athanaseus can find the time to look at this. |
I really do appreciate the effort by all of you :) :) :) |
I think this is a different discussion. I can't really speak for ASTRON as a whole, but I'm open to discussions about how to help packaging of software, but I'm not aware of any issue with the setup of SO versions in other packages I maintain. Ole seems to have a system for it which works well. Having a second version number going in parallel to the regular version nrs is a bit of a pain, certainly if it's required to increase whenever you do a release. We change ABIs all the time for some of the packages, and we can't keep up with that. Maybe a system that could work partly if is the SO version is set to the package version (this is done more often on Linux, but don't know if it is according to Debian rules). We always increase the package version for stable releases. That solves this therefore partly, i.e. one may install multiple libraries next to each other of different releases. It still wouldn't solve when distribution-specific patch releases are made in between our releases (which at least for wsclean/aoflagger has happened). |
As far as I remember, Ole or I have not introduced any ABI changes with
distro-specific patch releases. I understand the trouble, and it isn't
easy. In a couple of weeks, I'll have a holiday coming up, and hopefully, I
have a bit more time to look into this/these issues.
Op do 26 nov. 2020 om 13:27 schreef André Offringa <notifications@github.com
…:
I think this is a different discussion. I can't really speak for ASTRON as
a whole, but I'm open to discussions about how to help packaging of
software, but I'm not aware of any issue with the setup of SO versions in
other packages I maintain. Ole seems to have a system for it which works
well.
Having a second version number going in parallel to the regular version
nrs is a bit of a pain, certainly if it's required to increase whenever you
do a release. We change ABIs all the time for some of the packages, and we
can't keep up with that. Maybe a system that could work partly if is the SO
version is set to the package version (this is done more often on Linux,
but don't know if it is according to Debian rules). We always increase the
package version for stable releases. That solves this therefore partly,
i.e. one may install multiple libraries next to each other of different
releases. It still wouldn't solve when distribution-specific patch releases
are made in between our releases (which at least for wsclean/aoflagger has
happened).
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#234 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AACPVJETO4QNPQ6WHJLKBO3SRY3QDANCNFSM4T464YCQ>
.
--
Gijs Molenaar
http://pythonic.nl
|
Thanks @tammojan, it looks like the zipped file is empty though. |
@phiadaarr can you see if this is solved with KERN-7? Thanks! |
No, it is still not working. I use the following docker file now:
Same errors. |
@phiadaarr i'm surprised this still happens, since KERN-7 is a full rebuild of all packages with the latest versions. can you please give the exact commands you are running, the error and if possible the file you are trying to process, or preferably a small example file that triggers your error. |
Hi @phiadaarr, can you try:
|
With this
I get
I described that above. The measurement set I try to process is 5GBs big, so this is kind of sharable size-wise. I have to ask the person who shared the data with me, if I am allowed to share it with you for debugging purposes. I will let you know as soon as I have the answer. |
This is what I get with the ms @tammojan sent:
Dockerfile:
Docker run commands: Edit: MS link |
I cannot download this ms since it is on Slack and I do not have an account there. Could you share it in a form that I can access it? |
Thank you. I get the same error:
And if I use
My machine has the following CPU flags:
Maybe you assume during compilation something that is not present on that machine? I will try the same thing on a newer machine as well. |
I get the same error with this dockerfile and the MS from the google drive:
But when do a
Dysco has been uploaded 4 days after casacore, so it should be linked to the casacore version KERN-7. I think the segmentation fault is caused by an issue with dysco and your specific MS, since there are no errors about missing symbols. Maybe an idea to manually compile dysco using debug flags or install the debugging symbols package (search for debug here https://kernsuite.info/faq/) and get a proper core dump to examine the situation. |
about the machine CPU flags, we are quite conservative with our optimisation flags, so I don't expect this to be an issue. |
On a newer machine, everything works now. Thanks a lot! Shall we dig into the issue with the older machine (it has an |
Interesting, thanks for figuring that out. My guess is that the default dysco optimisation flags are then assuming a modern system. As far as I know, we don't touch the optimisation flags to avoid this kind of issue. I think for now it Is just good enough that we triangulated the issue, no further investigation required. The optimisation is a bit of an issue with the Debian packages, since we don't want to deliver slow packages but also keep support for older architectures, (but not too old). The 6376 seems to come from 2012, so i guess we can label that 'too old' :) |
It is unclear to me if 2012 is already too old. But fortunately I do have an alternative machine. The original issue |
When I do:
where
name.ms
is a dysco-compressed measurement set, I get the error:My docker file looks like this:
Do you have an idea what is going on? I found this issue: https://gitlab.rrz.uni-hamburg.de/hpc/lofar-build/-/issues/5 with this fix: https://gitlab.rrz.uni-hamburg.de/hpc/lofar-build/-/commit/0a38007638f9b311261e9e9b3289e97f2f8c77a3
Can this be integrated into KERN?
The text was updated successfully, but these errors were encountered: