Skip to content

Releases: tballison/commoncrawl-fetcher-lite

v1.0.0-alpha4 Development Build

01 Jun 16:19
90f49ab
Compare
Choose a tag to compare

Commits

  • 439b996: Bump jackson.version from 2.15.0 to 2.15.2 (dependabot[bot]) #23
  • 3a320b9: Bump checkstyle from 10.9.3 to 10.12.0 (dependabot[bot]) #24
  • 63ac51a: Bump jwarc from 0.21.0 to 0.22.0 (dependabot[bot]) #25
  • 94cbeaf: Add ability to file extraction from local index files. This fixes #27 (tallison)
  • 05c8869: Merge remote-tracking branch 'origin/main' (tallison)
  • dbef9f0: Update README.md, add link to AdvancedScenarios and update warning formatting. (tallison)
  • 90f49ab: Update README.md, add link to AdvancedScenarios and update warning formatting. (tallison)

v1.0.0-alpha3

01 Jun 16:14
aa2b962
Compare
Choose a tag to compare
v1.0.0-alpha3-release

v1.0.0-alpha2 Development Build

23 Mar 20:38
Compare
Choose a tag to compare
Pre-release

Commits

  • 079ae27: add checkstyle, forbiddenapis and maven enforcer (tballison)
  • 4ff3c45: revert to snapshot developement (tballison)
  • dd7a622: fix old parameter in default-config.json and drop reporting to every million records read (tballison)
  • bed2cd2: fix bug that let .yaml through to be processed, back off on logging to info and update documenation (tballison)
  • d89f36f: Handle reading index file exceptions more gracefully and prevent addition of ".json" on S3Emitter (tballison)
  • f5fafc8: Improve documentation (tballison)
  • bc76098: add mime_detected, truncated and extracted file length to logs (tballison)
  • 5eb61bc: Implement index fetcher and mime counter (tballison)
  • 7a5ec66: package organization clean ups and revert default-config.json (tballison)
  • d2bccab: prep next release (tballison)

v1.0.0-alpha1 Development Build

23 Mar 15:57
Compare
Choose a tag to compare
Pre-release
v1.0.0-alpha1-release