GitHub - Georgetown-University-Libraries/File-Analyzer: A Data Parsing/Data Manipulation Tool Supporting Digitization Projects and Other Data Analysis Projects

PURPOSE

Project Page: http://georgetown-university-libraries.github.io/File-Analyzer/

The File Analyzer and Metadata Harvester is a general purpose desktop (and command line) tool designed to automate simple, file-based operations. The File Analyzer assembles a toolkit of tasks a user can perform.

The tasks that have been written into the File Analyzer code base have been optimized for use by libraries, archives, and other cultural heritage institutions.

File Analyzer Wiki: https://github.com/Georgetown-University-Libraries/File-Analyzer/wiki

Demonstration Videos

History

This code has been derived from the NARA File Analyzer and Metadata Harvester which is available at https://github.com/usnationalarchives/File-Analyzer.

PREREQUISITES

JDK 1.8 or higher (for build)
JRE 1.8 or higher (for runtime)
(If you need to run with Java 6, see Releases for an older version)
Maven (or you will need to compile the modules manually)

INSTALLATION

Clone this code to your computer
Run mvn install
Detailed Installation Instructions: https://github.com/Georgetown-University-Libraries/File-Analyzer/wiki/Installation-instructions

DEPLOYMENTS

This code will build 3 flavors of the File Analyzer.

Core File Analyzer

All code runs from a self-extracting jar file

DSpace File Analyzer

This version of the file analyzer is a self-extracting jar file that references the core file analyzer jar file.
It contains tools for automating the creation of DSpace ingestion folders

Demo File Analyzer

This version contains extensions illustrating various capabilities of the File Analyzer.
This version of the file analyzer is a self-extracting jar file that references both the core and dspace file analyzer jar files.
This version of the application uses features of Apache Tika, BagIt, and Marc4j

Georgetown University Library IT Code Repositories

Name		Name	Last commit message	Last commit date
Latest commit History 470 Commits
bag-utils		bag-utils
core-utils		core-utils
core		core
demo		demo
doc		doc
dspace		dspace
.classpath		.classpath
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bag-utils

bag-utils

core-utils

core-utils

core

core

demo

demo

doc

doc

dspace

dspace

.classpath

.classpath

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

pom.xml

pom.xml

Repository files navigation

PURPOSE

Demonstration Videos

History

PREREQUISITES

INSTALLATION

DEPLOYMENTS

Core File Analyzer

DSpace File Analyzer

Demo File Analyzer

About

Releases 66

Packages

Languages

License

Georgetown-University-Libraries/File-Analyzer

Folders and files

Latest commit

History

Repository files navigation

PURPOSE

Demonstration Videos

History

PREREQUISITES

INSTALLATION

DEPLOYMENTS

Core File Analyzer

DSpace File Analyzer

Demo File Analyzer

About

Topics

Resources

License

Stars

Watchers

Forks

Languages