BREAKING(encoding/csv): add return type to csv's parse and remove a parse func from args #1724

mitch292 · 2021-12-18T00:20:58Z

Resolves #881

Took from a comment in the issue by @dsherret and removed the ability to pass a parse arg to the function which gave it the responsibility of parsing and, optionally, data transformation. Let me know wha you think!

…rom the args

mitch292 · 2021-12-18T02:52:45Z

encoding/csv.ts

-   * This is executed on each entry of the header.
-   * This can be combined with the Parse function of the rows.
-   */
-  parse?: (input: string) => unknown;


I may be reading too much into the comment thread in #881. To keep the logic consistent I removed the parse function from columns as well. Should this be done?

Now, if the parse is removed from column options maybe the function signature for columns?: string[] | ColumnOptions[] should just be simplified to columns?: string[] as below will evaluate to the same.

parse(data, { columns: ['a', 'b', 'c']}); parse(data, { columns: [{ name: 'a' }, { name: 'b' }, { name: 'c' } ] });

I'm in favor of removing this

kt3k

@mitch292
Thanks for working on this! Left a few comments. I think we can do a little further better typing

kt3k · 2021-12-21T09:22:18Z

encoding/csv.ts

-   * This is executed on each entry of the header.
-   * This can be combined with the Parse function of the rows.
-   */
-  parse?: (input: string) => unknown;


I'm in favor of removing this

kt3k · 2021-12-21T09:24:44Z

encoding/csv.ts

 */
 export async function parse(
  input: string | BufReader,
  opt: ParseOptions = {
    skipFirstRow: false,
  },
-): Promise<unknown[]> {
+): Promise<string[][] | Record<string, unknown>[]> {


I think if we do overloading like the below, it would be able to return more correct type depending of the options:

export async function parse( input: string | BufReader ): Promise<string[][]>; export async function parse( input: string | BufReader, opt: Omit<ParseOptions, "columns" | "skipFirstRow">, ): Promise<string[][]>; export async function parse( input: string | BufReader, opt: Omit<ParseOptions, "columns"> & { columns: string[] | ColumnOptions[], }, ): Promise<Record<string, unknown>[]>; export async function parse( input: string | BufReader, opt: Omit<ParseOptions, "skipFirstRow"> & { skipFirstRow: true, }, ): Promise<Record<string, unknown>[]>; export async function parse( input: string | BufReader, opt: ParseOptions = { skipFirstRow: false, }, ): Promise<string[][] | Record<string, unknown>[]> { ... }

mitch292 · 2021-12-22T04:06:31Z

Added some function overloads and updated the readme as well!

dsherret

LGTM.

Yeah, people can just take the output and put it through their own parse function now in a way that works well with TypeScript. This is nicer than typing everything as unknown.

kt3k

LGTM. Nice work!

bx2 · 2022-04-17T21:04:31Z

Hi! This change renders the point of having column options pointless. We do allow to change column names (which is transforming the data) but we do not allow to parse the given column to, for instance, change the type of the field on the fly. I will give you an example - say you are parsing a file with monetary values passed in as micros:

YearsExperience,Salary
10, 23345213000

To get the decimal value you would divide the Salary field by 1000000. With the parse added per column, you can do all the basic transformations while loading the CSV, something like what pandas in python does (https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_csv.html). Currently, you cannot do that anymore.

Also, having column options just to rename the fields but being required to always specify ALL the fields seems somewhat contrary what you are trying to accomplish.

Now, while I do understand that the basic parser should be just that and only parse, but then let's skip the entire concept of column options completely and we should update the docs for it which still claims that it can parse rows. I can prep a MR for that if you wish.

mitch292 added 3 commits December 17, 2021 19:00

Adds a return type to csv's parse and removes a custom parse option f…

dd78424

…rom the args

Removes references to parse option from jsdoc

ae8253e

Removes unnecessary variable declaration in csv's parse

6837d3f

mitch292 requested review from bartlomieju and kt3k as code owners December 18, 2021 00:20

Remove parse option from columns for csv's parse function

6452c7f

mitch292 commented Dec 18, 2021

View reviewed changes

bartlomieju requested a review from dsherret December 18, 2021 16:52

kt3k reviewed Dec 21, 2021

View reviewed changes

mitch292 added 2 commits December 21, 2021 22:29

Add function overloads to csv's parse function

ec91cb8

Reconciles readme for csv's parse with current function state

fd9e89f

dsherret approved these changes Dec 22, 2021

View reviewed changes

kt3k approved these changes Dec 23, 2021

View reviewed changes

kt3k added this to the 1.18 milestone Dec 23, 2021

kt3k changed the title ~~feat(encoding): Adds return type to csv's parse and remove a parse func from args~~ BREAKING(encoding/csv): add return type to csv's parse and remove a parse func from args Dec 23, 2021

kt3k merged commit 373ecdf into denoland:main Jan 20, 2022

kt3k mentioned this pull request Jan 24, 2022

Changed behavior in: std/encoding/csv.ts #1840

Closed

kt3k mentioned this pull request Jun 6, 2022

feat(encoding/csv): restructure proposal #2291

Closed

6 tasks

kt3k mentioned this pull request Aug 25, 2022

feat(encoding/csv): remove ColumnOptions #2536

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BREAKING(encoding/csv): add return type to csv's parse and remove a parse func from args #1724

BREAKING(encoding/csv): add return type to csv's parse and remove a parse func from args #1724

mitch292 commented Dec 18, 2021

mitch292 Dec 18, 2021

kt3k Dec 21, 2021

kt3k left a comment

kt3k Dec 21, 2021

kt3k Dec 21, 2021

mitch292 commented Dec 22, 2021

dsherret left a comment

kt3k left a comment

bx2 commented Apr 17, 2022 •

edited

BREAKING(encoding/csv): add return type to csv's parse and remove a parse func from args #1724

BREAKING(encoding/csv): add return type to csv's parse and remove a parse func from args #1724

Conversation

mitch292 commented Dec 18, 2021

mitch292 Dec 18, 2021

Choose a reason for hiding this comment

kt3k Dec 21, 2021

Choose a reason for hiding this comment

kt3k left a comment

Choose a reason for hiding this comment

kt3k Dec 21, 2021

Choose a reason for hiding this comment

kt3k Dec 21, 2021

Choose a reason for hiding this comment

mitch292 commented Dec 22, 2021

dsherret left a comment

Choose a reason for hiding this comment

kt3k left a comment

Choose a reason for hiding this comment

bx2 commented Apr 17, 2022 • edited

bx2 commented Apr 17, 2022 •

edited