-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
montreal2017: review and add additional notes #5
Conversation
(We did not reach consensus on whether a censorship event includes also the network inside the country.) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I remember that we did not agree on that.
@@ -12,13 +12,13 @@ Sanitization procedure -- > Raw data is stored forever, but we sanitize it befor | |||
|
|||
There are some sanitization checks that don't pass due to formatting reasons. | |||
|
|||
Eventually the data ends up in postgress database. This is not the best database, but it's good enough. As the raw data storage increases, we will need to consider an alternative. | |||
Eventually the metadata ends up in postgress database. This is not the best database, but it's good enough. As the raw data storage increases, we will need to consider an alternative. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
My understanding is that the new pipeline's database doesn't store data.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Depends on the definition of data I guess. Some data does end up in the DB, I am ok with either though.
|
||
How to run this: | ||
|
||
The code is basically stored in a docker image, so you can use whatever software you like. Airflow is a bunch of python libraries with their own version and you don't want to have conflicts and resolve them. So you have to build a docker image to run the software. | ||
|
||
How do you add the docker image to a chain (like described below)? You upload the docker image to docker hub, and then make the pipeline run this image through operations with elevated privileges (Airflow does this through a wraper script, run with sudo). At the end of these basic checks and setups, docker runs something and that's it. | ||
How do you add the docker image to a chain (like described below)? You upload the docker image to docker hub, and then make the pipeline run this image through operations with elevated privileges (we do this through a wrapper script, run with sudo). At the end of these basic checks and setups, docker runs something and that's it. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
My understanding is that @darkk wrote a custom wrapper for sudo.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yup
@@ -1,6 +1,6 @@ | |||
# Mobile apps | |||
|
|||
IMPORTANT: Automated testing (possibly implemented in node). | |||
IMPORTANT: continuous-integration testing (possibly implemented in node). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would call this end-to-end testing rather than CI.
minor nit-pick. merging. |
Mostly hotfixes. I've added comments in places where I changed the meaning of sentences.