Skip to content

slow performance in MS Windows #5

@giantmoa

Description

@giantmoa

Hi There,
The following example from the package documentation runs very slow in Windows.

julia> using InMemoryDatasets

julia> ds = Dataset(rand([1.1,2.2,3.4], 100, 100000), :auto);

julia> filewriter("_tmp.csv", ds, buffsize = 2^25, lsize = 500000);

julia> @time ds = filereader("_tmp.csv", buffsize = 2^21, lsize = 2^20, types = fill(Float64, 10^5));
  1.163346 seconds (900.02 k allocations: 180.966 MiB)

julia> @time ds = filereader("_tmp.csv", buffsize = 2^21, lsize = 2^20, guessingrows = 2);
  1.803125 seconds (4.10 M allocations: 289.193 MiB, 2.86% gc time)

It is because of parsing float. It seems that parsing floats in base is very slow in Windows

parse(Float64, "32423") is about 20 times slower than parse(Int, "32423") in Windows (in OSX they are almost the same).

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions