Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

csv2tsv: Discard UTF-8 Byte Order Mark (BOM) #302

Merged
merged 3 commits into from
Sep 7, 2020

Conversation

jondegenhardt
Copy link
Contributor

This PR changes csv2tsv to detect and discard a UTF-8 Byte Order Mark found at the beginning of an input file.

This PR also adds additional unit tests for the new buffered version of csv2tsv in the previous PR #301. A couple edge bugs were found while writing the new unit tests, these were fixed as well.

@codecov-commenter
Copy link

Codecov Report

Merging #302 into master will increase coverage by 0.01%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #302      +/-   ##
==========================================
+ Coverage   99.35%   99.36%   +0.01%     
==========================================
  Files          18       18              
  Lines        6792     6941     +149     
==========================================
+ Hits         6748     6897     +149     
  Misses         44       44              
Impacted Files Coverage Δ
csv2tsv/src/tsv_utils/csv2tsv.d 100.00% <100.00%> (ø)

@jondegenhardt jondegenhardt merged commit 615914f into eBay:master Sep 7, 2020
@jondegenhardt jondegenhardt deleted the csv2tsv-drop-utf8-bom branch September 7, 2020 21:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants