Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Load specific file types from Google Drive (issue #4878) #4926

Merged
merged 7 commits into from May 18, 2023

Conversation

eyurtsev
Copy link
Collaborator

Load specific file types from Google Drive (issue #4878)

Add the possibility to define what file types you want to load from Google Drive.

 loader = GoogleDriveLoader(
    folder_id="1yucgL9WGgWZdM1TOuKkeghlPizuzMYb5",
    file_types=["document", "pdf"]
    recursive=False
)

Fixes ##4878

Who can review?

Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested:
DataLoaders

Twitter: @UmerHAdil | Discord: RicChilligerDude#7589

@eyurtsev
Copy link
Collaborator Author

cc @UmerHA merging in -- made a few minor changes to function signature. looks good!

@eyurtsev eyurtsev merged commit c06a47a into master May 18, 2023
13 checks passed
@eyurtsev eyurtsev deleted the eugene/gdrive-filter2 branch May 18, 2023 13:27
@danielchalef danielchalef mentioned this pull request Jun 5, 2023
hwchase17 pushed a commit that referenced this pull request Jun 19, 2023
# Iterate through filtered file types instead of all listed files

Fixes #6257

#4926 originally added the
functionality to filter by file type, storing the filtered files in
`_files`

#5220 removed the
functionality when adding code to filter trashed files by using the
`files` variables instead of the `_files` variable.

This PR simply adds the functionality back by using `_files` again.

#### Who can review?

@hwchase17 - project lead
@eyurtsev
This was referenced Jun 25, 2023
kacperlukawski pushed a commit to kacperlukawski/langchain that referenced this pull request Jun 29, 2023
…chain-ai#6258)

# Iterate through filtered file types instead of all listed files

Fixes langchain-ai#6257

langchain-ai#4926 originally added the
functionality to filter by file type, storing the filtered files in
`_files`

langchain-ai#5220 removed the
functionality when adding code to filter trashed files by using the
`files` variables instead of the `_files` variable.

This PR simply adds the functionality back by using `_files` again.

#### Who can review?

@hwchase17 - project lead
@eyurtsev
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants