Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create a data connector for Google Drive #244

Closed
cragwolfe opened this issue Feb 21, 2023 · 4 comments
Closed

Create a data connector for Google Drive #244

cragwolfe opened this issue Feb 21, 2023 · 4 comments
Labels
enhancement New feature or request python Pull requests that update Python code

Comments

@cragwolfe
Copy link
Contributor

cragwolfe commented Feb 21, 2023

Create a data connector that pulls documents from Google Drive, stores them locally (at least temporarily for processing), and runs them through unstructured.partition.auto.

See Adding Data Connectors for details on how to get started. Make sure to include a link to this issue when submitting a PR.

Definition of Done

  • The checklist has been completed.
  • The connector is able to process a single document.
  • The connector is able to process all documents in a Google Drive folder, recursively.
  • For now, it is OK to process only doc types that unstructured.partition.auto is capable of processing. Google Drive documents should be converted to PDF or Word Doc for processing (unless there is a better way).
  • Bonus points: the ability to filter by document type.
@cragwolfe cragwolfe added enhancement New feature or request python Pull requests that update Python code labels Feb 21, 2023
@cragwolfe cragwolfe changed the title Create a data connector for Google Drive [DRAFT] Create a data connector for Google Drive Feb 23, 2023
@HAKSOAT
Copy link
Contributor

HAKSOAT commented Feb 26, 2023

Hi @cragwolfe I'd like to give this a shot.

@cragwolfe
Copy link
Contributor Author

@HAKSOAT , sounds good, please go ahead! Thanks for mentioning it. :)

@alvarobartt
Copy link
Contributor

Hi @cragwolfe what about creating a separate issue to create a connector for Google Cloud Storage (GCS) too? If so I'd like to contribute and tackle that 👍🏻

@cragwolfe
Copy link
Contributor Author

Good idea, added #301 .

cragwolfe pushed a commit that referenced this issue Mar 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request python Pull requests that update Python code
Projects
None yet
Development

No branches or pull requests

3 participants