ingest2parquet Readme file needs to be cleaned up #47
Labels
bug
Something isn't working
documentation
Improvements or additions to documentation
fixed
Marks an issues as fixed in the dev branch
There are still references to IBM bluepile in the Readme (command line options)
AST string containing input/output paths. input_folder: Path to input folder of files to
be processed output_folder: Path to output folder of processed files Example: {
'input_folder': '/cos-optimal-llm-pile/bluepile-
processing/rel0_8/cc15_30_preproc_ededup', 'output_folder': '/cos-optimal-llm-
pile/bluepile-processing/rel0_8/cc15_30_preproc_ededup/processed' }
In the section: Run the script via command-line, shouldn't it be:
python ingest2parquet_local.py , instead of python ingest2parque.py ?
The text was updated successfully, but these errors were encountered: