trim config option? #241

amit777 · 2015-07-26T07:50:38Z

Hi, is there a simple way to trim whitespace from the ends of the fieldname as well as data values? This library has everything that I want so I'm guessing I'm just missing something simple.

thanks!

mholt · 2015-07-26T15:12:19Z

No, just do it yourself. :) str.trim()

bthorben · 2015-09-21T08:39:34Z

Doing it yourself is actually pretty unhandy. You have a neat option to output the data as dicts with field: value. If field always contains a space in the beginning (e.g. " password" or " username") it's something not so easy to correct.

I am writing this because I was about to use papa to parse files that looks like:

username, email, password
test-1, 1@example.com, Password1
test-2, 2@example.com, Password1
test-3, 3@example.com, Password1
test-4, 4@example.com, Password1
test-5, 5@example.com, Password1
test-6, 6@example.com, Password1
test-7, 7@example.com, Password1
test-8, 8@example.com, Password1
test-9, 9@example.com, Password1

mholt · 2015-09-21T13:31:22Z

Why is it not so easy to correct? Instead of results.data[i].password you do results.data[i].password.trim()

But you have to assume that the password doesn't have spaces on the edge. Could be a dangerous assumption. That's why I leave it up to the user to do. I'm not gonna go there.

bluej100 · 2015-09-21T15:31:55Z

I think he's saying he would have to do results.data[i][" password"], which is a little gross.

mholt · 2015-09-21T16:06:06Z

Oh, I see.

Unfortunately, the CSV spec specifically says: "Spaces are considered part of a field and should not be ignored." - if your CSV files are created with spaces after the commas, then the spaces are errors in the input and the generator needs to be fixed.

bthorben · 2015-09-23T15:26:24Z

Well, true, the spec says that, but you get all kind of wrong csv files all the time

KamalAman · 2016-08-16T14:49:37Z

While I think there should be an option for PapaParse so that you can enable trimming on the input, this problems is not too difficult to solve on your own:

Just pre-process you data with the following regex

"a ,b, c cc , d dd".replace(/\s*,\s*/g, ',')
//a,b,c cc,d dd

aendra-rininsland · 2016-08-17T18:10:31Z

I honestly think this should be reconsidered...

a. CSV files coming out of Excel quite often have superfluous spaces everywhere. Yes, that's valid for the format, but these are generally unintentional and break things further downstream.

b. The "preprocess with regex" approach suggested by @KamalAman modifies the input data, which is bad because it makes troubleshooting downstream errors more difficult.

c. Having to trim() every string coming out of PapaParse can require a lot of defensive programming.

I'm currently trying to use the step callback to do this, but all of my rows are now coming back as null for reasons I can't quite figure out...

lrossy · 2016-09-15T16:19:54Z

I used this guys trimObj() to solve this issue in my completeFn. Worked perfectly.

https://stackoverflow.com/questions/33510625/trim-white-spaces-in-both-object-key-and-value-recursively/33511005#33511005

rsand27 · 2017-09-15T17:34:45Z

I'm using Papa Parse (well Baby Parse for Node) to read local files from an upload folder. I had an issue with a space in front of a field that threw my app off. I get the data in Node using:

file = await BabyParse.parseFiles(`${ appDir }/${ req.file.path }`, {
  header: true,
  skipEmptyLines: true
});

To trim the white space and delete empty fields from each row object, I use this:

// Clean up the data
file.data.forEach(row => {
  for (let prop in row) {
    // Trim spaces from front and back 
    row[prop] = row[prop].trim();
    // Delete any empty fields
    if (row[prop] === '') {
      delete row[prop];
    }
  }
});

This returns the desired results for me before processing the data and saving it to MongoDB.

pokoli · 2017-09-18T08:00:12Z

Hi @rsand27, latests paparse version can be run also on Node, so I will recomend using PapaParse instead of BabyParse on Node.

If this does not work, please open a new issue.

larryboymi · 2018-12-12T21:35:26Z

What if you want to trim the parsed header? I just had a prepended \uFEFF sneak through in a header name that I could trim out with access to the header parsing function.

ttfreeman · 2019-01-09T05:33:39Z

@amit777 if you set {dynamicTyping: true} ,you shouldn't need to trime() white spaces. Papaparse will do it for you.

mtmacdonald · 2019-03-20T09:16:53Z

In case it helps anyone else, 4x version does have trimHeaders option (undocumented). And 5x version has transformHeader.

ataft · 2021-01-29T20:45:44Z

Has anybody been able to get this to work for the data values, not just the header? When I have spaces after the commas, the values have a space and double quotes. I'm using version 5.3.0.

For example, even with dynamicTyping, this CSV:
`
"Country","Alpha-2 code","Alpha-3 code","Numeric code","Latitude (average)","Longitude (average)"

"Australia", "AU", "AUS", "36", "-27", "133"
`

Gives me these values:

Alpha-2 code: " "AF""
Alpha-3 code: " "AFG""
Country: "Afghanistan"
Latitude (average): " "33""
Longitude (average): " "65""
Numeric code: " "4""

BilalIftikhar · 2022-09-14T07:21:18Z

here is code .
beforeFirstChunk: function(chunk) {
var rows = chunk.trim().replace(/\s*,\s*/g, ',');
return rows;
},

mislavmiocevic · 2022-12-05T10:58:19Z

I do not know if this is relevant anymore, but there is 'transform' function that you can use in config which is executed on every item.
https://www.papaparse.com/docs#config - transform

import { parse } from 'papaparse';

const { data } = parse('A, B\n1, 2', {
    transform: (value) => value.trim()
});

console.log(data);

P.S. I do not know how it will handle larger datasets and will it be slow.

mholt closed this as completed Jul 26, 2015

jimallman mentioned this issue Apr 20, 2022

Smarter CSV parsing OpenTreeOfLife/opentree#1279

Merged

janisdd mentioned this issue Aug 9, 2023

Quote character " ignored janisdd/vscode-edit-csv#124

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trim config option? #241

trim config option? #241

amit777 commented Jul 26, 2015

mholt commented Jul 26, 2015

bthorben commented Sep 21, 2015

mholt commented Sep 21, 2015

bluej100 commented Sep 21, 2015

mholt commented Sep 21, 2015

bthorben commented Sep 23, 2015

KamalAman commented Aug 16, 2016

aendra-rininsland commented Aug 17, 2016

lrossy commented Sep 15, 2016

rsand27 commented Sep 15, 2017 •

edited

Loading

pokoli commented Sep 18, 2017

larryboymi commented Dec 12, 2018

ttfreeman commented Jan 9, 2019

mtmacdonald commented Mar 20, 2019

ataft commented Jan 29, 2021 •

edited

Loading

BilalIftikhar commented Sep 14, 2022 •

edited

Loading

mislavmiocevic commented Dec 5, 2022 •

edited

Loading

trim config option? #241

trim config option? #241

Comments

amit777 commented Jul 26, 2015

mholt commented Jul 26, 2015

bthorben commented Sep 21, 2015

mholt commented Sep 21, 2015

bluej100 commented Sep 21, 2015

mholt commented Sep 21, 2015

bthorben commented Sep 23, 2015

KamalAman commented Aug 16, 2016

aendra-rininsland commented Aug 17, 2016

lrossy commented Sep 15, 2016

rsand27 commented Sep 15, 2017 • edited Loading

pokoli commented Sep 18, 2017

larryboymi commented Dec 12, 2018

ttfreeman commented Jan 9, 2019

mtmacdonald commented Mar 20, 2019

ataft commented Jan 29, 2021 • edited Loading

BilalIftikhar commented Sep 14, 2022 • edited Loading

mislavmiocevic commented Dec 5, 2022 • edited Loading

rsand27 commented Sep 15, 2017 •

edited

Loading

ataft commented Jan 29, 2021 •

edited

Loading

BilalIftikhar commented Sep 14, 2022 •

edited

Loading

mislavmiocevic commented Dec 5, 2022 •

edited

Loading