Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

montreal2017: review and add additional notes #5

Merged
merged 23 commits into from
Oct 20, 2017
Merged

Conversation

bassosimone
Copy link
Contributor

@bassosimone bassosimone commented Oct 13, 2017

Mostly hotfixes. I've added comments in places where I changed the meaning of sentences.

(We did not reach consensus on whether a censorship event includes also the network inside the country.)
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I remember that we did not agree on that.

@@ -12,13 +12,13 @@ Sanitization procedure -- > Raw data is stored forever, but we sanitize it befor

There are some sanitization checks that don't pass due to formatting reasons.

Eventually the data ends up in postgress database. This is not the best database, but it's good enough. As the raw data storage increases, we will need to consider an alternative.
Eventually the metadata ends up in postgress database. This is not the best database, but it's good enough. As the raw data storage increases, we will need to consider an alternative.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

My understanding is that the new pipeline's database doesn't store data.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Depends on the definition of data I guess. Some data does end up in the DB, I am ok with either though.


How to run this:

The code is basically stored in a docker image, so you can use whatever software you like. Airflow is a bunch of python libraries with their own version and you don't want to have conflicts and resolve them. So you have to build a docker image to run the software.

How do you add the docker image to a chain (like described below)? You upload the docker image to docker hub, and then make the pipeline run this image through operations with elevated privileges (Airflow does this through a wraper script, run with sudo). At the end of these basic checks and setups, docker runs something and that's it.
How do you add the docker image to a chain (like described below)? You upload the docker image to docker hub, and then make the pipeline run this image through operations with elevated privileges (we do this through a wrapper script, run with sudo). At the end of these basic checks and setups, docker runs something and that's it.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

My understanding is that @darkk wrote a custom wrapper for sudo.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yup

@@ -1,6 +1,6 @@
# Mobile apps

IMPORTANT: Automated testing (possibly implemented in node).
IMPORTANT: continuous-integration testing (possibly implemented in node).
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would call this end-to-end testing rather than CI.

@hellais
Copy link
Member

hellais commented Oct 20, 2017

minor nit-pick. merging.

@hellais hellais merged commit 5e618ba into master Oct 20, 2017
@hellais hellais deleted the feature/sbs branch October 20, 2017 09:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants