Skip to content

feat(datasets): add 100+ dataset integrations with sample() and export utilities#28

Merged
shahar-brd merged 5 commits intobrightdata:devfrom
karaposu:dev
Feb 16, 2026
Merged

feat(datasets): add 100+ dataset integrations with sample() and export utilities#28
shahar-brd merged 5 commits intobrightdata:devfrom
karaposu:dev

Conversation

@karaposu
Copy link
Contributor

@karaposu karaposu commented Feb 9, 2026

Summary

  • 100+ dataset integrations across e-commerce, business intelligence, real estate, social media, luxury brands, and more
  • sample() method for quick data access without specifying filters
  • Filter API with operators: =, !=, >, <, includes, is_not_null, and more
  • Export utilities: export_json, export_csv, export_jsonl (or auto-detect with export())
  • Dataset metadata discovery via get_metadata()
  • Mass test notebook for 87 datasets

Usage

async with BrightDataClient() as client:
    # Quick sample — no filter needed
    snapshot_id = await client.datasets.imdb_movies.sample(records_limit=5)
    data = await client.datasets.imdb_movies.download(snapshot_id)

    # With filter
    snapshot_id = await client.datasets.walmart_products(
        filter={"name": "title", "operator": "includes", "value": "laptop"},
        records_limit=10
    )
    data = await client.datasets.walmart_products.download(snapshot_id)

    # Export
    from brightdata.datasets import export
    export(data, "results.json")

…dIn (profiles, companies), Amazon, Crunchbase, IMDB, NBA, Goodreads, World Population - Export utilities: export_json, export_csv, export_jsonl - Notebooks: linkedin, amazon, crunchbase demos
…c dataset classes for: - Luxury brands: Loewe, Berluti, Moynat, Hermes, Delvaux, Prada, Montblanc, YSL, Dior, Balenciaga, Bottega Veneta, Celine, Chanel, Fendi - E-commerce: Amazon (Reviews, Sellers), Walmart, Shopee, Lazada, Zalando, Sephora, Zara, Mango, Massimo Dutti, Asos, Shein, Ikea, H&M, Lego, Mouser, Digikey - Social media: Instagram (Profiles, Posts), TikTok, Pinterest (Posts, Profiles), YouTube (Profiles, Videos, Comments), Facebook Pages Posts - Real estate: Zillow, Airbnb, Australia Real Estate, Otodom Poland, Zonaprop Argentina, Metrocuadrado, Infocasas Uruguay, Properati, Toctoc, Inmuebles24 Mexico, Yapo Chile - Business data: Glassdoor (Companies, Reviews, Jobs), Indeed (Companies, Jobs), ZoomInfo, PitchBook, G2, Trustpilot, TrustRadius, Owler, Slintel, Manta, VentureRadar, Companies Enriched, Employees Enriched - Other: World Zipcodes, US Lawyers, Google Maps Reviews, Yelp, Xing Profiles, OLX Brazil, Webmotors Brasil, Chileautos, LinkedIn Jobs Total: 100 datasets now implemented (92 new + 8 existing)
@karaposu karaposu changed the title Add Datasets API with 8 datasets and demo notebooks feat(datasets): add 100+ dataset integrations with sample() and export utilities Feb 16, 2026
@shahar-brd shahar-brd merged commit 1754652 into brightdata:dev Feb 16, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants