CSVsniffer

Companion repository for the paper:

Detecting CSV File Dialects by Table Uniformity Measurement and Data Type Inference (PDF)

An application of the new methodology outlined in the paper can be found in the CSV interface repository.

Introduction

The results from the research can be reproduced by running the RunTests method from the macro-enabled Excel workbook CSVsniffer.xlsm. To review the results for CleverCSV it is necessary to run the scripts from the clevercsv_test.py file. The text files with the results output are stored in the Current research and cleverCSV folders

Data

The CSV folder contains the files copied from the Pollock framework and other collected test files. Also the dataset used for the CSV wrangling research is available in the CSV_Wranglin folder. Note that only link to the files can be provided, in this last case,due to the authors holds the copyright.

The expect configuration for each set CSV tested is saved in the Dialect_annotations.txtand Manual_dialect_annotation.txt files.

Requirements

Below are the requirements for reproducing the experiments.

Microsoft Office Excel.
CleverCSV and all its dependencies.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CSV

CSV

CSV_Wrangling

CSV_Wrangling

Current research

Current research

cleverCSV

cleverCSV

.gitignore

.gitignore

CSVsniffer.xlsm

CSVsniffer.xlsm

Dialect_annotations.txt

Dialect_annotations.txt

README.md

README.md

clevercsv_test.py

clevercsv_test.py

Repository files navigation

CSVsniffer

Introduction

Data

Requirements

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
CSV		CSV
CSV_Wrangling		CSV_Wrangling
Current research		Current research
cleverCSV		cleverCSV
.gitignore		.gitignore
CSVsniffer.xlsm		CSVsniffer.xlsm
Dialect_annotations.txt		Dialect_annotations.txt
README.md		README.md
clevercsv_test.py		clevercsv_test.py

ws-garcia/CSVsniffer

Folders and files

Latest commit

History

Repository files navigation

CSVsniffer

Introduction

Data

Requirements

About

Topics

Resources

Stars

Watchers

Forks

Languages