-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
test harness and api changes #31
Conversation
…nd into the main part of the program. tests not passing yet
…the first match, and the latter which returns a generator, and will walk around a zip looking for contained files
Hi Ed, |
Will do, if you want to chat some time on the phone some time about this I would be open to it. There might even be some other parties who are interested. For example we might want to consider pulling these changes into a new 'testing' branch or something instead of into 'main'? |
… setup.py with addition of test_suite and minor formatting changes.
Just merged in the latest changes, and fixed minor conflict in the new setup.py |
Hi Ed, I will contact you by e-mail to discuss the changes and further development. |
Well these comments go to my inbox. I would prefer to keep the electronic |
… to prevent digging into container formats
Ok, good point. I've brought -zip back, and call Fido.identify_files or Fido.identify_file in main. |
By the way, I wrote up some meandering thoughts about this work at http://inkdroid.org/journal/2012/09/02/fido-test-suite/ |
Ed – Really nice to see your blog on this. Have you seen David Tarrant’s thoughts on govdoc at http://www.openplanetsfoundation.org/blogs/2012-07-26-1-million-21000-reducing-govdocs-significantly ? Cheers, Adam. From: Ed Summers [mailto:notifications@github.com] By the way, I wrote up some meandering thoughts about this work at http://inkdroid.org/journal/2012/09/02/fido-test-suite/ — Adam Farquhar Adam.Farquhar@bl.uk NW1 2DB http://www.bl.uk/ http://www.bl.uk/aboutus/annrep/2010to2011/index.htmlhttp://www.bl.uk/knowledge |
@adamfarquhar no I hadn't seen that -- thanks! |
Thanks Ed, still busy reviewing/testing your code. Must say the generator is quite cleverly done... Read your blog about FIDO test suite. In fact the PUID's are easily resolvable @Pronom, e.g.: |
Just for anyone else following along, here is a private email I received from Maurice on October 17th: Hi Ed, I have reviewed and tested your commit but unfortunately there are a number of issues:
Note that these issues only appear when invoked from CLI. Did not test your API functionality. Due to this I also have not been able to test the rest of the functionality. I also do have some remarks about your test suite: Example: a file that matches on "extension" in one signature-version could very well match on a "signature" if the PUID is updated. It can also be the other way around, a file that has been matched before with "signature" might be matching on "extension" in a later version. Your work has not been in vain, I really like your solution with the generator functions. Kind regards, |
I responded to the above email with: Rather than pulling these changes into master do you want to pull it in as a branch and add issue tickets for the problems you noticed? |
Pulled the changes into a separate branch and will issue tickets for the problems. Closing this pull request. |
would you be ok with renaming the branch to something a bit more meaningful like testsuite-refactor or something? |
I don't know if these changes will be too substantial for you, but I've refactored the Fido class and module to support the development of a test suite, which I have also begun.
The majority of the changes involved making it easy to instantiate a Fido class without having to also do all the stuff that was going on in main. I also removed the printing from the Fido class and put it into main.
Another significant change I made was introducing a new method identify_files which is a generator that lazily returns all matches, walking inside of zip file containers. identify_file now just returns a single identity.
The big thing with this pull request IMHO is the tests. I plan on improving them, so maybe it's worth taking a look?
PS. I turned the README.txt into a README.md for pretty display on Github :-)