GitHub

Introduction
Installation
Getting Started
- Obtaining API Keys and Client ID
- Example
Creating a New Project
Uploading Pre-annotations
- Example Usage (Synchronous)
- Example Usage (Asynchronous)
Local Export
- Acceptable Values
- Example Usage
Retrieving All Projects for a Client
- Example Usage
Retrieving All Datasets
- Example Usage
Error Handling
Automatic Logging and Error Handling
Support

Introduction

The Labellerr SDK is a Python library designed to make interacting with the Labellerr platform simple and efficient. With this SDK, you can manage data annotations, projects, and exports seamlessly in your applications.

This documentation will guide you through installing the SDK, understanding its core functionalities, and handling common errors.

Installation

To install the Labellerr SDK, use the following command:

pip install https://github.com/tensormatics/SDKPython/releases/download/prod/labellerr_sdk-1.0.0.tar.gz

Getting Started

Obtaining `api_keys` and `client_id`:

To obtain your api_keys and client_id, Pro and Enterprise plan users can contact Labellerr support. If you are on a free plan, you can request them by emailing support@tensormatics.com.

Once installed, you can start by importing and initializing the LabellerrClient and LabellerrError. This client will handle all communication with the Labellerr platform.

Example Client initiation:

from labellerr.client import LabellerrClient
from labellerr.exceptions import LabellerrError

# Initialize the client with your API credentials
client = LabellerrClient('your_api_key', 'your_api_secret')

Replace 'your_api_key' and 'your_api_secret' with your actual API credentials provided by Labellerr.

Creating a New Project

A project in Labellerr organizes your datasets and their annotations. Use the following method to create a project:

To know more about what is a project in Labellerr, click here

Limitations:

Maximum of 2,500 files per folder.
Total folder size should not exceed 2.5 GB.

Acceptable value:

option_type
- 'input', 'radio', 'boolean', 'select', 'dropdown', 'stt', 'imc', 'BoundingBox', 'polygon', 'dot', 'audio'

Data Types and Supported Formats

The Labellerr SDK supports various data types and file formats for your annotation projects:

Supported Data Types and Extensions

Data Type	Description	Supported Extensions
`image`	Image files for visual annotation	`.jpg`, `.jpeg`, `.png`, `.bmp`, `.tiff`
`video`	Video content for temporal annotation	`.mp4`
`audio`	Audio files for sound annotation	`.mp3`, `.wav`
`document`	Document files for text analysis	`.pdf`
`text`	Plain text files for text annotation	`.txt`

When using the SDK methods, specify the data type as one of: 'image', 'video', 'audio', 'document', or 'text'.

Example Usage:

from labellerr.client import LabellerrClient
from labellerr.exceptions import LabellerrError


# Initialize the client with your API credentials
client = LabellerrClient('your_api_key', 'your_api_secret')


project_payload = {
    'client_id': '12345',
    'dataset_name': 'Sample Dataset',
    'dataset_description': 'A sample dataset for image classification',
    'data_type': 'image',
    'created_by': 'user@example.com',
    'project_name': 'Image Classification Project',
    'annotation_guide': [
        {
            "question_number": 1,
            "question": "What is the main object in the image?",
            "question_id": "533bb0c8-fb2b-4394-a8e1-5042a944802f",
            "option_type": "dropdown",
            "required": True,
            "options": [
                {"option_name": "Car"},
                {"option_name": "Building"},
                {"option_name": "Person"}
            ]
        }
    ],
    'rotation_config': {
        'annotation_rotation_count': 0,
        'review_rotation_count': 1,
        'client_review_rotation_count': 0
    },
    'autolabel': False,
    'folder_to_upload': '/path/to/image/folder'
}

try:
    result = client.initiate_create_project(project_payload)
    print(f"Project created successfully. Project ID: {result['project_id']}")
except LabellerrError as e:
    print(f"Project creation failed: {e}")

Uploading Pre-annotations

Pre-annotations are labels which can be pre-loaded to a project which can speed up manual annotation

Example Usage (Synchronous):

from labellerr.client import LabellerrClient
from labellerr.exceptions import LabellerrError


# Initialize the client with your API credentials
client = LabellerrClient('your_api_key', 'your_api_secret')

project_id = 'project_123'
client_id = '12345'
annotation_format = 'coco'
annotation_file = '/path/to/annotations.json'

try:
    # Upload and wait for processing to complete
    result = client.upload_preannotation_by_project_id(project_id, client_id, annotation_format, annotation_file)

    # Check the final status
    if result['response']['status'] == 'completed':
        print("Pre-annotations processed successfully")
        # Access additional metadata if needed
        metadata = result['response'].get('metadata', {})
        print("metadata",metadata)
except LabellerrError as e:
    print(f"Pre-annotation upload failed: {e}")

Example Usage (Asynchronous):

from labellerr.client import LabellerrClient
from labellerr.exceptions import LabellerrError


# Initialize the client with your API credentials
client = LabellerrClient('your_api_key', 'your_api_secret')


project_id = 'project_123'
client_id = '12345'
annotation_format = 'coco'
annotation_file = '/path/to/annotations.json'

try:
    # Start the async upload - returns immediately
    future = client.upload_preannotation_by_project_id_async(project_id, client_id, annotation_format, annotation_file)

    print("Upload started, you can do other work here...")

    # When you need the result, wait for completion
    try:
        result = future.result(timeout=300)  # 5 minutes timeout
        if result['response']['status'] == 'completed':
            print("Pre-annotations processed successfully")
            metadata = result['response'].get('metadata', {})
            print("metadata",metadata)
    except TimeoutError:
        print("Processing took too long")
    except Exception as e:
        print(f"Error in processing: {e}")
except LabellerrError as e:
    print(f"Failed to start upload: {e}")

Choosing Between Sync and Async

Synchronous Method:
- Simpler to use - just call and wait for result
- Blocks until processing is complete
- Good for scripts and sequential processing
Asynchronous Method:
- Returns immediately with a Future object
- Allows you to do other work while processing
- Can set timeouts and handle long-running uploads
- Better for applications that need to stay responsive

Both methods will:

Upload your annotation file
Monitor the processing status
Return the final result with complete status information

Note: The processing time depends on the size of your annotation file and the number of annotations. For the sync method, this means waiting time. For the async method, you can do other work during this time.

Local Export

The SDK provides functionality to export project data locally. This feature allows you to export annotations in various formats for further analysis or backup purposes.

The export created will be available in the exports section of Labellerr dashboard. To know more where you can find the export, click here

Acceptable values:

statuses:
- 'review', 'r_assigned','client_review', 'cr_assigned','accepted'
export_format:
- 'json', 'coco_json', 'csv', 'png'

Example Usage:

from labellerr.client import LabellerrClient
from labellerr.exceptions import LabellerrError


# Initialize the client with your API credentials
client = LabellerrClient('your_api_key', 'your_api_secret')


project_id = 'project_123'
client_id = '12345'
export_config = {
    "export_name": "Weekly Export",
    "export_description": "Export of all accepted annotations",
    "export_format": "json",
    "statuses": ["accepted"]
}

try:
    result = client.create_local_export(project_id, client_id, export_config)
    print(f"Local export created successfully. Export ID: {result['export_id']}")
except LabellerrError as e:
    print(f"Local export creation failed: {e}")

Note: The export process creates a local copy of your project's annotations based on the specified status filters. This is useful for backup purposes or when you need to process the annotations offline.

Retrieving All Projects for a Client

You can retrieve all projects associated with a specific client ID using the following method:

Example Usage:

from labellerr.client import LabellerrClient
from labellerr.exceptions import LabellerrError


# Initialize the client with your API credentials
client = LabellerrClient('your_api_key', 'your_api_secret')

client_id = '12345'

try:
    result = client.get_all_project_per_client_id(client_id)

    # Check if projects were retrieved successfully
    if result and 'response' in result:
        projects = result['response']
        print(f"Found {len(projects)} projects:")
        for project in projects:
            print(f"- Project ID: {project.get('project_id')}")
            print(f"  Name: {project.get('project_name')}")
            print(f"  Type: {project.get('data_type')}")
except LabellerrError as e:
    print(f"Failed to retrieve projects: {e}")

This method is useful when you need to:

List all projects for a client
Find specific project IDs
Check project statuses
Get an overview of client's work

The response includes detailed information about each project, including its ID, name, data type, and other relevant metadata.

Retrieving All Datasets

You can retrieve both linked and unlinked datasets associated with a client using the following method:

Example Usage:

from labellerr.client import LabellerrClient
from labellerr.exceptions import LabellerrError

# Initialize the client with your API credentials
client = LabellerrClient('your_api_key', 'your_api_secret')

client_id = '12345'
data_type = 'image'

try:
    result = client.get_all_datasets(client_id, data_type)

    # Process linked datasets
    linked_datasets = result['linked']
    print(f"Found {len(linked_datasets)} linked datasets:")
    for dataset in linked_datasets:
        print(f"- Dataset ID: {dataset.get('dataset_id')}")
        print(f"  Name: {dataset.get('dataset_name')}")
        print(f"  Description: {dataset.get('dataset_description')}")

    # Process unlinked datasets
    unlinked_datasets = result['unlinked']
    print(f"\nFound {len(unlinked_datasets)} unlinked datasets:")
    for dataset in unlinked_datasets:
        print(f"- Dataset ID: {dataset.get('dataset_id')}")
        print(f"  Name: {dataset.get('dataset_name')}")
        print(f"  Description: {dataset.get('dataset_description')}")

except LabellerrError as e:
    print(f"Failed to retrieve datasets: {e}")

This method is useful when you need to:

Get an overview of all available datasets
Find datasets that are linked to projects
Identify unlinked datasets that can be associated with new projects
Manage dataset organization and project associations

The response includes two lists:

linked: Datasets that are already associated with projects
unlinked: Datasets that are not yet associated with any project

Each dataset object contains detailed information including its ID, name, description, and other metadata.

Error Handling

The Labellerr SDK uses a custom exception class, LabellerrError, to indicate issues during API interactions. Always wrap your function calls in try-except blocks to gracefully handle errors.

Example:

from labellerr.exceptions import LabellerrError

try:
    result = client.initiate_create_project(payload)
except LabellerrError as e:
    print(f"An error occurred: {e}")

Automatic Logging and Error Handling

The Labellerr SDK uses class-level decorators to automatically apply logging and error handling to all public methods in both LabellerrClient and AsyncLabellerrClient. This means every method call is automatically:

Logged when the method is called
Logged when the method completes successfully
Logged with error details if the method fails
Wrapped with standardized error handling

Benefits

✓ No Boilerplate: You don't need to add logging or error handling code in every method ✓ Consistency: All methods follow the same logging pattern ✓ Maintainability: Changes to logging or error handling are centralized ✓ Debugging: Comprehensive logs help troubleshoot issues quickly

How It Works

The SDK uses two decorators:

@auto_log_and_handle_errors for synchronous methods
@auto_log_and_handle_errors_async for asynchronous methods

These decorators are applied at the class level, so all public methods (methods not starting with _) automatically inherit them.

Example Log Output

When you call a method, you'll see debug logs like:

DEBUG - Calling create_gcs_connection
DEBUG - create_gcs_connection completed successfully

Or if an error occurs:

DEBUG - Calling create_gcs_connection
ERROR - create_gcs_connection failed: Connection refused

Enabling Debug Logging

To see the automatic logging in action, configure Python's logging:

import logging

# Enable debug logging
logging.basicConfig(
    level=logging.DEBUG,
    format='%(asctime)s - %(levelname)s - %(message)s'
)

from labellerr.client import LabellerrClient

client = LabellerrClient('your_api_key', 'your_api_secret')

# Now all method calls will be automatically logged
client.create_dataset(dataset_config, files_to_upload=['file1.jpg'])

Excluded Methods

Some methods are excluded from automatic decoration:

Private methods (starting with _)
Utility methods like close(), validate_rotation_config()
Session management methods

Custom Implementation

If you're building your own client or extending the SDK, you can use the same decorators:

from labellerr.validators import auto_log_and_handle_errors

@auto_log_and_handle_errors(
    include_params=False,  # Don't log sensitive parameters
    exclude_methods=['close', 'cleanup']  # Skip these methods
)
class MyCustomClient:
    def my_method(self):
        # This method automatically gets logging and error handling
        pass

    def close(self):
        # This method is excluded from decoration
        pass

Support

If you encounter issues or have questions, feel free to contact the Labellerr support team:

Email: support@labellerr.com
Documentation: Labellerr Documentation

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
.github/workflows		.github/workflows
labellerr		labellerr
tests		tests
.bumpversion.cfg		.bumpversion.cfg
.env.example		.env.example
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Makefile		Makefile
README.md		README.md
labellerr_integration_case_tests.py		labellerr_integration_case_tests.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

yashsuman15/SDKPython

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

Introduction

Installation

Getting Started

Obtaining api_keys and client_id:

Example Client initiation:

Creating a New Project

Limitations:

Acceptable value:

Data Types and Supported Formats

Supported Data Types and Extensions

Example Usage:

Uploading Pre-annotations

Example Usage (Synchronous):

Example Usage (Asynchronous):

Choosing Between Sync and Async

Local Export

Acceptable values:

Example Usage:

Retrieving All Projects for a Client

Example Usage:

Retrieving All Datasets

Example Usage:

Error Handling

Example:

Automatic Logging and Error Handling

Benefits

How It Works

Example Log Output

Enabling Debug Logging

Excluded Methods

Custom Implementation

Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Obtaining `api_keys` and `client_id`:

Packages