-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a script which tests key functions #16
Conversation
Stopped at first cell in the Loading Data section:
Replacing One potential solution is to replace it with a placeholder variable whose value is |
Thanks for catching this! I just updated it to use Hive's |
Indeed:
Similar problem with
@elukey: Do you have a recommendation for fixing our permission problems? We want anyone with analytics client access to be able to run the cells in this test notebook to verify that the package works. It appears that using the It may be that anyone who wishes to test the package with the notebook will need to run it as the @ottomata Is there a way to use Anaconda as the
and got
(even after I think |
From my DM to @bearloga:
|
@neilpquinn Do you get this error when you run the When I restart the kernel and skip those cells, I can import wmfdata no problem. Maybe we shouldn't use
Everything else runs without problems. |
Note to self: the Presto test can currently fail because the order of the output differs from the input. I need to add an |
f702c70
to
9b1e4fb
Compare
Okay, I have a new version of this I'm happy with! I've solved the permissions issues by simply having the user pass the name of the Hive database to use when they invoke the script. That way, each user should be using their own database. If one of the tests fails, the script stops with an error. If all the tests pass, the output will look like this:
There's a lot of annoying logspam, but I think it still gets the point across. |
Is the title of this PR incorrect? a notebook? :) |
Sorry, it was a mutation due to the excessive radiation coming off the Data Lake! Also, I decided a script would be cleaner and more maintainable than a notebook 😊 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Seems nice to me, I'll keep it in mind as we think about testing airflow pipelines
3add46d
to
a6adba9
Compare
For the record I tested the tests and they test great. I approve these changes but we can't merge until all reviewers agree I think. |
Thank you! I don't think it was the reviewers; I think you just didn't have merge permissions. I've given @wikimedia/data-engineering those rights, so next time you should be able to. |
It won't be very helpful to look at the raw files here; instead, this can be reviewed by checking out the branch on one of the analytics clients and opening the notebook there.