-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Crash with NullPointerException while validating an MRC file #356
Comments
Dear @pabloab, thanks for give QA catalogue a try. Which version of the software do you use, is it a release or did you build it from the source code? (I guess it a released one). Is this file downloadable from somewhere, or could you upload some records? If you do not want to make it available in the issue, you can send me in email: kirunews x gmail. So far I did not worked with records having Greek characters as subfield code. I guess the problem is cased by this line: FileUtils.writeStringToFile(file, content, Charset.defaultCharset(), true) Do you know what is the default character set on your machine? I think we should use UTF-8 instead. And out of curiosity; does UBA stands for Universidad de Buenos Aires? |
…e explicit UTF-8 instead of the default charset when writing to file.
I'm using v0.6.0, using the I exported a new mrc with just one record, and get the same error:
Yes, stands for Universidad de Buenos Aires. Glad you know about us 😄 |
@pabloab Thanks! I tested it. It really throws an exepction with 0.6.0 release, but it was fixed in 0.7.0, and also works well with the current developing version. So my suggestion is to use 0.7.0, or - if you would like to keep update with the latest features the current source code. My knowledge about Universidad de Buenos Aires is quite limited, but I know that one of my favorite authors, Jorge Luis Borges was a professor of English at your university before he was appointed as a director of the national library. The teaching activities (such as a seminar about the Saxon language) and teaching subjects (the thoughts of his favorite English writers) appeared in his writings here and there. But it is a good time to learn more about the university itself! |
I first tried to install v0.7.0, changing the I tried with v0.7.0 and indeed it doesn't crash. I had other issues that maybe I could file aside:
I also really like Borges (I recently revisited an interview). I was lucky enough to be a professor for a some years at that same campus, Puán, which now has its own film (from what I see in the trailer it captures the academic interns quite well). We also have a copy of H. P. Lovecraft's Necronomicon. Of course, I made sure it has its MARC record 😉 |
These are a number of different things:
Here the problem is that in order to process the control fields (mainly 008) we should figure out the type of the record from Leader/06 (Type of record) and Leader/07 (Bibliographic level). There are some possible valid combinations of these two characters, "a " in this case is not among them. You can add an extra flag to all analyses:
Borges: many thanks! I was not aware of that interview. I like a lot another one from the same time: https://www.youtube.com/watch?v=bNxzQSheCkc, this was done in Eglish for a US TV show. Borges said interesting things, like that Latin America did not produce literature which would be interesting for the rest of the world - it was some years before Marquez' Nobel prize, and the big success of other Latin American writers (Llosa, Cortasar etc.). Does Borges have a sculpture or some other memorial at Puán? The film seems to be interesting - the situation is quite typical in academic world. |
AFAIK there is no Borges statue on Puan. No one doubts his talent as a writer, but his politics opinions (which he himself says shouldn't be taken into account) are at the opposite extreme from a vast majority, especially there. |
Recently found this project after searching for regex patterns for each MARC 21 subfield. A little overwhelmed about all its features. I start trying to get a report of a set of 102964 records of a MARC file from a Koha (v22.05).
It stays processing for a couple of seconds and then starts sending all the records contents to stdout. Then it crashes with a NullPointerException.
It seems it doesn't consider a subfield code could be some Unicode char like a Greek letter (alpha, beta, gamma...):
The text was updated successfully, but these errors were encountered: