Skip to content

v0.7.0

Choose a tag to compare

@github-actions github-actions released this 27 Jun 18:05

CSV v0.7.0

Diff since v0.6.2

Closed issues:

  • CategoricalValues with non-Strings (#263)
  • Provided types only partially interpreted? (#431)
  • Memory usage for large files (#432)
  • Slow first run loading a tiny csv file (#464)
  • SIGBUS on CSV.write (#498)
  • Read CSV directly from URL (#506)
  • CSV.Rows chokes on large file on Windows (#519)
  • Cannot read a few rows of a file too big for memory (#542)
  • Using CSV.read on file handle breaks future reads (#549)
  • Correctly handling IO that does not support multithreading (#558)
  • First tab completion very slow after using CSV on julia v1.4.0 (#589)
  • Support for custom types and/or formats (#598)
  • Reading to a rowtable 20x slower than to a columntable (#614)
  • Full missing rows or random segfault when CSV with Windows newlines is read from Linux (#621)
  • possible test failure in upcoming Julia version 1.5 (#623)
  • Add a new function that reads data directly into Julia arrays (or static arrays) (#624)
  • error row positions for multithreading; (#625)
  • Single threaded CSV.read twice slower than single threaded R's fread (#627)
  • memory mapping failed with hyperthreading (#628)
  • MethodError: no method matching streamtypes(::Type{DataFrame}) (#629)
  • categorical option over-rules pool option (#630)
  • Cornercases in CSV.read with skipto and datarow keywords (#631)
  • CSV.read, then how to close the files opened? (#633)
  • MethodError: no method matching streamtypes(::Type{DataFrame}) (#634)
  • FilePathsBase 0.8 compat restriction causes a downgrade catastrophe (#636)
  • thoughts on the data structure choices (#637)
  • Memory bloat when reading floats (#640)
  • How to handle writing empty files? (#641)
  • incorrect line reported on parsing error (#642)
  • BoundsError writing a simple CSV (#643)
  • can't write file .ERROR: MethodError: no method matching schema(::DataFrame) (#644)
  • Forcing Measurements type causes parse error (#645)
  • CSV.read fails to read UTF-8 encoded file properly (#646)
  • Merge header and writeheader CSV.write arguments? (#655)
  • Switch (back) from at-threads to at-spawn to support multithreading from inside a Task (#657)

Merged pull requests:

  • Bump [compat] FilePathsBase="0.6,0.7,0.8,0.9" (#632) (@ablaom)
  • Refactor CSV internals to produce fully mutable columns by default (#639) (@quinnj)
  • Check for write buffer overflow and throw error (#647) (@quinnj)
  • Change promotion strategy for better memory usage (#648) (@quinnj)
  • Start work on supporting custom types (#649) (@quinnj)
  • Support lazystrings=true option to avoid fully allocating strings while parsing (#650) (@quinnj)
  • Cleanup limit support and allow it when multithreaded parsing (#651) (@quinnj)
  • Improve error row reporting (#652) (@quinnj)
  • Clean up skipto and datarow a bit (#653) (@quinnj)
  • Update the multithreaded chunking error message to be more helpful (#654) (@quinnj)
  • Deprecate Cmd and generic IO inputs to CSV.File/CSV.Rows (#658) (@quinnj)
  • Deprecate CSV.read (#659) (@quinnj)
  • Deprecate writeheader (#660) (@quinnj)
  • Switch from Threads-at-threads to Threads.at-spawn. Fixes #657 (#661) (@quinnj)
  • Deprecate the categorical keyword argument (#662) (@quinnj)
  • Improve CSV.Rows performance (#663) (@quinnj)
  • Lots of cleanup (#664) (@quinnj)
  • Add CSV.Chunks for iterating over chunks of large files (#665) (@quinnj)
  • Add some precompiles (#666) (@quinnj)