Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change CSV row representation, improve error handling & support for dynamic lookup #122

Merged
merged 8 commits into from
Apr 12, 2013
Merged

Conversation

ovatsus
Copy link

@ovatsus ovatsus commented Apr 9, 2013

  • Change csv row representation and improve error handling
    • The representation was previously just a string[] and on each cell access the conversion was being made (possibly repeatedly)
    • Now it's a tuple, and each row is converted as a block (but still each row on deman)
    • This improves the display on F# interactive and the debugger, and allows the csv rows to better display on tools that use reflection and support tuples, like generic data grids hooked into fsi
    • It also allows the IgnoreErrors flag to ignore rows with different data types on the columns, and not just different numbers of columns
    • With this change we achieve feature parity with the csv provider of TryFSharp.org
  • Support for dynamic lookup in CSV files

Gustavo Guerra added 7 commits April 9, 2013 20:30
* The representation was previously just a string[] and on each cell access the conversion was being made (possibly repeatedly)
* Now it's a tuple, and each row is converted as a block (but still each row on deman)
* This improves the display on F# interactive and the debugger, and allows the csv rows to better display on tools that use reflection and support tuples, like generic data grids hooked into fsi
* It also allows the IgnoreErrors flag to ignore rows with different data types on the columns, and not just different numbers of columns
* With this change we achieve feature parity with the csv provider of TryFShrap.org
In addition to the already supported "#N/A", add "NA" and ":"  to the defaults.
"NA" is the default in R datasets, ":" is the default in EU and UK government datasets
It turns out using ToString works exactly the same when displaying a single value, but not when displaying a seq or array of them
tpetricek added a commit that referenced this pull request Apr 12, 2013
Change CSV row representation, improve error handling & support for dynamic lookup
@tpetricek tpetricek merged commit 24fa7e1 into fsprojects:master Apr 12, 2013
@ovatsus ovatsus deleted the CsvImprovements branch April 12, 2013 23:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants