Skip to content

DataFrame LoadCsv improvements #7213

@vsfeedback

Description

@vsfeedback

This issue has been moved from a ticket on Developer Community.


The DataFrame.LoadCsv method could be improved in a number of ways:

  • Use double when the precision in the data allows it. Now float is always used when the data is floating point.
  • Allow the user to define NaN in floating point columns data. In R code that we are also using, NA is typically used.
  • Speed improvements if possible.
  • I think there is a problem with defining culture info in the parameter. I have to do:
    Thread.CurrentThread.CurrentCulture = new CultureInfo("en-US"); before the call to LoadCsv to make floating point data with decimal points load correctly (in Sweden).

Original Comments

Feedback Bot on 8/1/2024, 10:56 PM:

(private comment, text removed)

Metadata

Metadata

Assignees

No one assigned

    Labels

    Microsoft.Data.AnalysisAll DataFrame related issues and PRsenhancementNew feature or requestuntriagedNew issue has not been triaged

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions