Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RPi dockerfile updates #23

Merged
merged 2 commits into from
Sep 16, 2018

Conversation

brannondorsey
Copy link
Contributor

Add required dependencies (namely python3 and pip3) in Dockerfile.raspberry.

Upon building an running Dockerfile.raspberry on an RPi Model 2 B, I noticed that most of the current warrior projects were not working correctly. Out of the box, I believe only URLTeam2 and WikiTeam were working... I've since added several dependencies and tested each project. 5 projects are now working that were not before, but several are still showing errors.

Working Now

  • URLTeam
  • WikiTeam
  • Yahoo Answers (Fixed with this PR)
  • Livejournal Discovery (Fixed with this PR)
  • Quizlet (Fixed with this PR)
  • Wikispaces (Fixed with this PR)
  • Zetaboards (Fixed with this PR)

Still Not Working

  • Flickr
  • Newsgrabber
  • GitHub discovery [Errno 8] Exec format error likely ARM is currently an unsupported architecture
  • Tindeck Traceback (most recent call last): File "<string>", line 21, in <module> ImportError: No module named 'warc' During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/home/warrior/warrior-code2/src/seesaw/seesaw/warrior.py", line 737, in start_selected_project pipeline_path, {"downloader": self.downloader}) File "/home/warrior/warrior-code2/src/seesaw/seesaw/warrior.py", line 686, in load_pipeline exec(pipeline_str, local_context, global_context) File "<string>", line 23, in <module> Exception: Please install warc with 'pip install warc --upgrade'.
  • FTPDiscovery Traceback (most recent call last): File "/home/warrior/warrior-code2/src/seesaw/seesaw/warrior.py", line 737, in start_selected_project pipeline_path, {"downloader": self.downloader}) File "/home/warrior/warrior-code2/src/seesaw/seesaw/warrior.py", line 686, in load_pipeline exec(pipeline_str, local_context, global_context) File "<string>", line 144, in <module> File "<string>", line 138, in get_hash FileNotFoundError: [Errno 2] No such file or directory: '/data/data/projects/ftpdisco-1156c27/CheetoFTP/scanner.py'
  • FTP-GOV Traceback (most recent call last): File "/home/warrior/warrior-code2/src/seesaw/seesaw/warrior.py", line 737, in start_selected_project pipeline_path, {"downloader": self.downloader}) File "/home/warrior/warrior-code2/src/seesaw/seesaw/warrior.py", line 686, in load_pipeline exec(pipeline_str, local_context, global_context) File "", line 55, in Exception: No usable Wpull found.`
  • Livejournal WIP (git clone error, tried to clone https://github.com/ArchiveTeam/ instead of the actual repo)

I may have some bandwidth to dig into some of these errors in the next few days and weeks, but either way this merge should be good to go until then as it fixes several projects. If anyone else notices any quick fixes given some of those ^ error messages that would be rad too!

@brannondorsey
Copy link
Contributor Author

Pinging this 🐱, hoping we can get it merged. I've been running this on a Pi for 2+ weeks and things are looking good!

@chfoo chfoo merged commit 697a0d5 into ArchiveTeam:master Sep 16, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants