Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/Import data from cloud #213



Copy link

c-w commented May 21, 2019

In #141 it was discussed to enable the user to upload data directly from cloud storage. This feature speeds up the annotation workflow by no longer requiring the user to download all data files to their machine and also enables data import into doccano in higher compliance environments where users may not be allowed to download data files to their machine.

This pull request implements the data import from cloud by integrating:

  • libcloud storage to have a single SDK in the import API to interact with a wide range of cloud storage systems such as AWS S3, Google Cloud Storage, Azure Blob Storage, etc.

  • django-cloud-browser to enable the user to navigate the cloud storage interactively and select which file to upload. Note that the cloud browser uses libcloud under the hood so it supports the exact same set of cloud storages as the import API.

The animation below shows the cloud upload experience:

Animation showing data import from cloud storage

The cloud upload feature is functionally complete but the default UI of django-cloud-browser is fairly basic. I'd be very happy to take guidance on how to improve the CSS to make it more aligned with doccano, e.g. is there a UI style-guide for doccano?

Resolves #141

@c-w c-w force-pushed the CatalystCode:feature/import-data-from-cloud branch 2 times, most recently from a818f76 to 509de45 May 21, 2019
@c-w c-w force-pushed the CatalystCode:feature/import-data-from-cloud branch 2 times, most recently from 41ef106 to efb2f45 May 24, 2019
@c-w c-w force-pushed the CatalystCode:feature/import-data-from-cloud branch from efb2f45 to d520ff7 May 24, 2019
@Hironsan Hironsan merged commit 175a299 into doccano:master Jun 7, 2019
3 checks passed
3 checks passed
Codacy/PR Quality Review Up to standards. A positive pull request.
Travis CI - Pull Request Build Passed
continuous-integration/travis-ci/pr The Travis CI build passed
@c-w c-w deleted the CatalystCode:feature/import-data-from-cloud branch Jun 7, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
None yet
2 participants
You can’t perform that action at this time.