Skip to content

Checks correctness of generated CSV data, including within individual files (e.g. number of columns, data format, data ranges) and across a collection of files (e.g. all expected files are present, no dangling relationships in graph)

License

ldbc-dev/ldbc_datachecker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LDBC Data Checker

Build

mvn clean package -Dmaven.compiler.source=1.6 -Dmaven.compiler.target=1.6

Run

java -cp datachecker-0.1-SNAPSHOT.jar com.ldbc.datachecker.socialnet.SocialNetCheck -d <path> [-l] [-t]
    -d,--dir <path>     ldbc_socialnet_dbgen directory path
    -l,--log            Log errors to csv file
    -t,--terminate      Terminate on error

Where:

  • -d: is the top level directory of the ldbc_socialnet_dbgen project
    • It likely looks something like ...ldbc_socialnet_bm/ldbc_socialnet_dbgen/
  • -t: specifies how datachecker should behave when an invalid data is encountered
    • true --> throw an exception with descriptive error message
    • false --> log a descriptive error message
  • -l: (only used if -t is false) specifies if errors should be written to validation_errors.csv
    • true --> datachecker logs errors to, both, console and validation_errors.csv
    • false --> datachecker logs errors to console only

About

Checks correctness of generated CSV data, including within individual files (e.g. number of columns, data format, data ranges) and across a collection of files (e.g. all expected files are present, no dangling relationships in graph)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages