Skip to content

Popular repositories Loading

  1. data-engineering-blueprints data-engineering-blueprints Public

    Patterns and concepts for building resilient data pipelines in Python and Scala

    3

  2. kafka-local kafka-local Public

    Basic single broker Kafka cluster - docker compose using confluent image

    2 1

  3. bigspark.github.io bigspark.github.io Public

    JavaScript 1

  4. kafkademo kafkademo Public

    Java 1 1

  5. streamsets_json_schema_validator_processor streamsets_json_schema_validator_processor Public

    A streamsets dc sample processor for validation records with a specified JSON schema

    Java 1

  6. docker-intellij docker-intellij Public

    dockerised intellij, jdk8, C++

Repositories

Showing 10 of 43 repositories
  • ntu-ktp-data-quality Public

    This repository is part of the Knowledge Transfer Partnership (KTP) between Nottingham Trent University (NTU) and Bigspark. The aim of this project is to address data quality issues in large datasets specifically in Finance using advanced techniques for error detection, error correction, duplicate detection, and beyond.

    itsbigspark/ntu-ktp-data-quality’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Feb 21, 2025
  • itsbigspark/dbt-airflow-dapr-docker’s past year of commit activity
    Python 0 Apache-2.0 1 0 0 Updated Dec 16, 2024
  • docformatter Public Forked from PyCQA/docformatter

    Formats docstrings to follow PEP 257

    itsbigspark/docformatter’s past year of commit activity
    Python 0 MIT 86 0 0 Updated Dec 3, 2024
  • data-engineering-blueprints Public

    Patterns and concepts for building resilient data pipelines in Python and Scala

    itsbigspark/data-engineering-blueprints’s past year of commit activity
    3 0 0 0 Updated Aug 27, 2024
  • genai-presidio Public

    Repository for PII Anonymizer code package and sample FastAPI API to use it to talk to LLM

    itsbigspark/genai-presidio’s past year of commit activity
    Jupyter Notebook 0 0 0 0 Updated Jun 21, 2024
  • nuxtjs-template Public template
    itsbigspark/nuxtjs-template’s past year of commit activity
    JavaScript 0 0 0 0 Updated Jan 21, 2024
  • aws-sso-sync Public

    sso-sync tool to help with the SCIM setup for bigspark.

    itsbigspark/aws-sso-sync’s past year of commit activity
    Go 0 Apache-2.0 0 0 0 Updated Oct 26, 2023
  • test_glue_ Public Forked from itsbigspark/test_glue

    To test glue job

    itsbigspark/test_glue_’s past year of commit activity
    Python 0 2 0 0 Updated Aug 1, 2023
  • test_glue Public

    To test glue job

    itsbigspark/test_glue’s past year of commit activity
    Python 0 2 0 0 Updated Jul 14, 2023
  • ai-hackathon Public

    General Purpose repo for NW AI Hackathon 2023

    itsbigspark/ai-hackathon’s past year of commit activity
    Jupyter Notebook 0 Apache-2.0 2 0 1 Updated Apr 20, 2023

Top languages

Loading…

Most used topics

Loading…