-
Notifications
You must be signed in to change notification settings - Fork 0
Datasets Sharing
Gen 3: This is a legacy Gen 2 article. For current GT AI OS 3.0 guidance, see gen3/datasets/managing.
This guide explains how to share datasets with team members, across your organization, and how to export/import datasets.
| Visibility | Who Can Access | Best For |
|---|---|---|
| Individual | Only you | Personal knowledge bases |
| Team | Selected team members | Team-specific resources |
| Organization | All users in your organization | Company-wide knowledge |
Note: Only Tenant Admins can set visibility to Organization. All users can access Organization-level datasets.
- Open the dataset details page
- Click Edit or access settings
- Change the visibility setting
- Select teams (if Team visibility)
- Save changes
When you share with a team:
- Team members see the dataset in their list
- They can use it with their agents
- They can view document contents
- Set visibility to Team
- Select which teams should have access
- Save changes
- Edit the dataset
- Deselect teams or change visibility to Individual
- Save changes
- Edit the dataset
- Set visibility to Organization
- Save changes
All organization members can now access the dataset.
Before sharing organization-wide:
- Ensure content is appropriate for all users
- Verify data sensitivity is acceptable
- Document what the dataset contains
- Consider maintenance responsibilities
Export datasets for backup or transfer to another GT AI OS instance:
- Go to the Datasets page
- Select one or more datasets using checkboxes
- Click Export
- A ZIP file downloads
Export contents:
- metadata.json: Dataset name, description, category, tags, and settings
- documents.json: List of uploaded documents with their metadata
- chunks.jsonl: Processed text chunks used for search
Note: Maximum export size is 100MB.
Import previously exported datasets:
- Go to the Datasets page
- Click Import
- Select the ZIP file
- Review the configuration and any warnings
- Choose whether to re-process chunks
- Click Import
Re-processing chunks: When moving datasets between different GT AI OS instances or versions, it's recommended to re-process the chunks. This ensures optimal search quality with the current embedding model. If you're restoring a backup to the same instance, you can skip re-processing.
Export and delete actions work for both individual datasets and bulk selections:
Single dataset:
- Click on a dataset, then use the Export or Delete button
Multiple datasets:
- Select datasets using checkboxes on dataset cards
- Click Export to download all selected as a ZIP
- Click Delete to remove all selected (with confirmation)
- Review content: Ensure accuracy and appropriateness
- Document purpose: Write clear descriptions
- Set proper visibility: Don't over-share
- Add relevant tags: Help others find the dataset
- Communicate changes: Let users know about updates
- Version carefully: Major changes may affect agents using the dataset
- Monitor usage: Check if dataset is being used effectively
- Delete when outdated: Remove stale content
- Don't share sensitive information inappropriately
- Review content before organization-wide sharing
- Consider access controls for sensitive data
- Follow your organization's data policies
- Verify visibility settings are correct
- Confirm users are in the selected team(s)
- Check that the dataset is active
- Review your sharing settings
- Ensure you have permission to share
- Contact your Tenant Admin if needed
If you see warnings during import:
- Missing embedding model: Re-process chunks recommended
- Large file size: May take longer to process
- Duplicate names: Consider renaming before import