Skip to content
@chatnoir-eu

ChatNoir

ChatNoir Research Web Search Engine

Pinned Loading

  1. chatnoir2-webclient Public

    ChatNoir Web Frontend

    Java 8 6

  2. chatnoir-resiliparse Public

    A robust web archive analytics toolkit

    Cython 101 15

  3. chatnoir2-indexer Public

    ChatNoir Indexer

    Java 9 2

  4. chatnoir2-mapfile-generator Public

    ChatNoir HDFS Map File Generator

    Java 5 2

  5. chatnoir-copycat Public

    CopyCat is a resource for deduplication in TREC-style experimental setups.

    Arc 7

Repositories

Showing 10 of 16 repositories
  • chatnoir-resiliparse Public

    A robust web archive analytics toolkit

    Cython 101 Apache-2.0 15 0 0 Updated Mar 27, 2025
  • chatnoir-pyterrier Public

    🔍 Use the ChatNoir search engine in PyTerrier.

    Python 3 MIT 0 2 0 Updated Mar 25, 2025
  • chatnoir-api Public

    🔍 Simple, type-safe access to the ChatNoir search API.

    Python 5 MIT 1 1 1 Updated Mar 25, 2025
  • Python 0 0 0 0 Updated Mar 24, 2025
  • web-content-extraction-benchmark Public

    Web Content Extraction Benchmark

    Python 17 Apache-2.0 5 3 0 Updated May 24, 2024
  • chatnoir-chat Public
    Jupyter Notebook 4 0 2 0 Updated Dec 21, 2023
  • chatnoir-warc-dl Public

    This pipeline allows extracting data from WARC files on a CPU cluster and streaming it to a GPU server, where it is processed.

    Python 7 MIT 3 1 0 Updated May 7, 2023
  • chatnoir-warc-indexer Public

    ChatNoir Indexer

    Python 0 0 0 0 Updated Dec 2, 2022
  • chatnoir2-webclient Public

    ChatNoir Web Frontend

    Java 8 MIT 6 0 0 Updated Mar 25, 2022
  • chatnoir2-indexer Public

    ChatNoir Indexer

    Java 9 MIT 2 0 0 Updated Nov 5, 2021

Top languages

Loading…

Most used topics

Loading…