Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

more fixes #107

Merged
merged 2 commits into from Aug 5, 2019
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Jump to
Jump to file
Failed to load files.
Diff view
Diff view
11 changes: 6 additions & 5 deletions 02_bag.ipynb
Expand Up @@ -85,7 +85,8 @@
"source": [
"# Requires `s3fs` library\n",
"# each partition is a remote CSV text file\n",
"b = db.read_text('s3://dask-data/nyc-taxi/2015/yellow_tripdata_2015-01.csv')\n",
"b = db.read_text('s3://dask-data/nyc-taxi/2015/yellow_tripdata_2015-01.csv',\n",
" storage_options={'anon': True})\n",
"b.take(1)"
]
},
Expand Down Expand Up @@ -622,9 +623,9 @@
"metadata": {
"anaconda-cloud": {},
"kernelspec": {
"display_name": "Python 3",
"display_name": "Python [conda env:dask-dev] *",
"language": "python",
"name": "python3"
"name": "conda-env-dask-dev-py"
},
"language_info": {
"codemirror_mode": {
Expand All @@ -636,9 +637,9 @@
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.7.3"
"version": "3.7.2"
}
},
"nbformat": 4,
"nbformat_minor": 1
"nbformat_minor": 2
}
11 changes: 5 additions & 6 deletions 07_dataframe_storage.ipynb
Expand Up @@ -302,9 +302,7 @@
},
{
"cell_type": "markdown",
"metadata": {
"collapsed": true
},
"metadata": {},
"source": [
"Rerun the sum computation above for this version of the data, and time how long it takes. You may want to try this more than once - it is common for many libraries to do various setup work when called for the first time."
]
Expand Down Expand Up @@ -361,7 +359,8 @@
"The way to set up dataframes (and other collections) remains very similar to before. Note that the data here is available anonymously, but in general an extra parameter `storage_options=` can be passed with further details about how to interact with the remote storage.\n",
"\n",
"```python\n",
"taxi = dd.read_csv('s3://nyc-tlc/trip data/yellow_tripdata_2015-*.csv')\n",
"taxi = dd.read_csv('s3://nyc-tlc/trip data/yellow_tripdata_2015-*.csv',\n",
" storage_options={'anon': True})\n",
"```"
]
},
Expand Down Expand Up @@ -390,9 +389,9 @@
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.7.3"
"version": "3.7.2"
}
},
"nbformat": 4,
"nbformat_minor": 1
"nbformat_minor": 2
}
2 changes: 1 addition & 1 deletion Dockerfile
Expand Up @@ -7,5 +7,5 @@ RUN apt-get update && apt-get install -y graphviz
USER jovyan

RUN git clone https://github.com/dask/dask-tutorial.git ./dask-tutorial
RUN cd dask-tutorial && conda env update -f binder/environment.yml && python prep.py && cd ..
RUN cd dask-tutorial && conda env update -f binder/environment.yml && cd ..
RUN rm dask-tutorial/github_deploy_key_dask_dask_tutorial.enc