Skip to content
@nla

National Library of Australia

Popular repositories Loading

  1. outbackcdx outbackcdx Public

    Web archive index server based on RocksDB

    Java 31 20

  2. httrack2warc httrack2warc Public

    Converts HTTrack crawls to WARC files

    Java 30 6

  3. chropro chropro Public archive

    Chrome debugging protocol client for Java

    Java 10 2

  4. solrbackup solrbackup Public

    Python script for backing up a remote Solr 4 core or SolrCloud cluster

    Python 9 6

  5. chronicrawl chronicrawl Public archive

    Experimental continouous web crawler for web archiving

    Java 9

  6. bamboo bamboo Public

    Web archive collection manager

    Java 8 4

Repositories

Showing 10 of 75 repositories
  • nla-blacklight Public

    Discovery application for the National Library of Australia's catalogue

    nla/nla-blacklight’s past year of commit activity
    Ruby 0 1 0 9 Updated Sep 16, 2024
  • nla-arclight Public

    Custom implementation of ArcLight for The National Library of Australia.

    nla/nla-arclight’s past year of commit activity
    Ruby 0 0 0 9 Updated Sep 16, 2024
  • ai-scout-imageSearchComparison Public

    Simple website to capture evaluation of different ways to search images.

    nla/ai-scout-imageSearchComparison’s past year of commit activity
    JavaScript 1 0 0 6 Updated Sep 13, 2024
  • ai-scout-pictures Public

    AI pictures proof of concept - crawl blacklight, build SOLR index with CLIP embeddings, exploratory web interface

    nla/ai-scout-pictures’s past year of commit activity
    HTML 1 1 0 6 Updated Sep 11, 2024
  • heritrix3 Public Forked from internetarchive/heritrix3

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

    nla/heritrix3’s past year of commit activity
    Java 0 774 0 0 Updated Sep 10, 2024
  • outbackcdx Public

    Web archive index server based on RocksDB

    nla/outbackcdx’s past year of commit activity
    Java 31 Apache-2.0 20 18 0 Updated Sep 9, 2024
  • nla-blacklight_common Public

    Common functionality for Blacklight and ArcLight applications

    nla/nla-blacklight_common’s past year of commit activity
    Ruby 0 0 0 5 Updated Sep 9, 2024
  • pandas4 Public

    Web archive workflow system

    nla/pandas4’s past year of commit activity
    Java 3 Apache-2.0 2 16 0 Updated Sep 4, 2024
  • ai-scout-audio2 Public

    AI audio proof of concept #2 - read TEI transcripts, build SOLR index with nomic embeddings, exploratory search and delivery web interface

    nla/ai-scout-audio2’s past year of commit activity
    JavaScript 0 0 0 6 Updated Aug 30, 2024
  • pywb Public Forked from webrecorder/pywb

    Core Python Web Archiving Toolkit for replay and recording of web archives

    nla/pywb’s past year of commit activity
    JavaScript 1 GPL-3.0 215 0 3 Updated Aug 16, 2024

Top languages

Loading…

Most used topics

Loading…