Skip to content
Change the repository type filter

All

    Repositories list

    • Custom implementation of ArcLight for The National Library of Australia.
      Ruby
      Other
      1001Updated Jun 12, 2025Jun 12, 2025
    • Discovery application for the National Library of Australia's catalogue
      Ruby
      Other
      10010Updated Jun 12, 2025Jun 12, 2025
    • pandas4

      Public
      Web archive workflow system
      Java
      Apache License 2.0
      33160Updated Jun 11, 2025Jun 11, 2025
    • Common functionality for Blacklight and ArcLight applications
      Ruby
      Other
      0006Updated Jun 5, 2025Jun 5, 2025
    • Ruby
      Other
      1001Updated May 29, 2025May 29, 2025
    • An abstraction/normalization layer for querying and displaying results for external search engines, in Ruby on Rails.
      Ruby
      MIT License
      14000Updated May 25, 2025May 25, 2025
    • Range facet/limit/profile plugin for Blacklight
      Ruby
      Other
      41000Updated May 22, 2025May 22, 2025
    • jvmctl

      Public
      Java app deployment tool
      Python
      MIT License
      11101Updated May 21, 2025May 21, 2025
    • Web archive index server based on RocksDB
      Java
      Apache License 2.0
      2034180Updated May 8, 2025May 8, 2025
    • XSLT
      0100Updated May 6, 2025May 6, 2025
    • nla-pywb

      Public
      pywb config overlay for the Australian Web Archive
      HTML
      0210Updated Apr 30, 2025Apr 30, 2025
    • pywb

      Public
      Core Python Web Archiving Toolkit for replay and recording of web archives
      JavaScript
      GNU General Public License v3.0
      229103Updated Apr 17, 2025Apr 17, 2025
    • AI audio proof of concept #2 - read TEI transcripts, build SOLR index with nomic embeddings, exploratory search and delivery web interface
      HTML
      0006Updated Apr 3, 2025Apr 3, 2025
    • AI newspaper search proof of concept - all 3.1m CT articles 1926-94, build SOLR index with nomic embeddings, exploratory web interface with LLM summaries
      JavaScript
      0003Updated Mar 23, 2025Mar 23, 2025
    • bamboo

      Public
      Web archive collection manager
      Java
      Apache License 2.0
      5890Updated Mar 14, 2025Mar 14, 2025
    • Callslip / pickslip request viewer
      Java
      0000Updated Feb 17, 2025Feb 17, 2025
    • Gem for add authentication to applications and secure services with Keycloak
      Ruby
      MIT License
      57003Updated Jan 22, 2025Jan 22, 2025
    • heritrix3

      Public
      Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
      Java
      Other
      760000Updated Dec 1, 2024Dec 1, 2024
    • Simple website to capture evaluation of different ways to search images.
      EJS
      0107Updated Oct 10, 2024Oct 10, 2024
    • AI pictures proof of concept - crawl blacklight, build SOLR index with CLIP embeddings, exploratory web interface
      HTML
      1107Updated Oct 10, 2024Oct 10, 2024
    • doss-dash

      Public
      A dashboard for doss with pretty graphs
      JavaScript
      0001Updated Oct 3, 2024Oct 3, 2024
    • Converts HTTrack crawls to WARC files
      Java
      Apache License 2.0
      63220Updated Aug 6, 2024Aug 6, 2024
    • heimdall

      Public
      A Selenium based web crawler (and archiver) that attempts to capture all resources of JS heavy pages by recursively clicking applicable DOM elements and responding to DOM modifications.
      Java
      0000Updated Jul 12, 2024Jul 12, 2024
    • dnn-cli

      Public
      A command-line interface for training DNN classifiers using deeplearning4j.
      Java
      0100Updated Jul 11, 2024Jul 11, 2024
    • odin

      Public
      Web archiving domain harvest statistics web application prototype.
      Java
      0000Updated Jul 4, 2024Jul 4, 2024
    • loki

      Public
      A lightweight framework for running GWT based applications with DOM-style UI control.
      Java
      0000Updated Jun 11, 2024Jun 11, 2024
    • thor

      Public
      A simple library for server-side utilities. Provides a mechanism to store and retrieve Java objects on the file system without the use of a database, and a mechanism to run tasks through a thread-safety assurance service.
      Java
      0000Updated Jun 11, 2024Jun 11, 2024
    • ArchivesSpace plugin for spreadsheet import
      Ruby
      1001Updated May 30, 2024May 30, 2024
    • Prototype for displaying statistics from web archiving harvests.
      0000Updated May 23, 2024May 23, 2024
    • marcgrep

      Public
      A slow-moving search across MARC data
      Clojure
      Eclipse Public License 1.0
      3000Updated May 10, 2024May 10, 2024