Join GitHub today
GitHub is home to over 20 million developers working together to host and review code, manage projects, and build software together.
BUG: read_csv with empty header row raising #12494
Comments
jreback
added Bug CSV Difficulty Intermediate Effort Low
labels
Feb 29, 2016
jreback
added this to the
Next Major Release
milestone
Feb 29, 2016
jreback
changed the title from
Exception during read_csv to BUG: read_csv with empty header row raising
Feb 29, 2016
|
thanks for the report. pull-requests welcome and desired! |
|
FYI: if you run your script, you actually will get a |
|
Addendum: It seems to be a certain number of comma's that break the parser. I did a quick test with the following function: def test(count):
s = StringIO(count * ',')
df = read_csv(s)
for i in range(1, 101):
try:
test(i)
except Exception as e:
print("{index} failed: {msg}".format(index=i, msg=str(e)))I get the following output (error messages truncated): 3 fails: Error tokenizing data...
6 fails: Error tokenizing data...
12 fails: Error tokenizing data...
24 fails: Error tokenizing data...
48 fails: Error tokenizing data...
96 fails: Error tokenizing data...So I guess what is it with |
gfyoung
referenced
this issue
Mar 1, 2016
Closed
BUG: Fixed grow_buffer to grow when capacity is reached #12504
gfyoung
added a commit
to gfyoung/pandas
that referenced
this issue
Mar 3, 2016
|
|
gfyoung + gfyoung |
8ba3dd0
|
jreback
modified the milestone: 0.18.0, Next Major Release
Mar 3, 2016
jreback
added a commit
that referenced
this issue
Mar 3, 2016
|
|
gfyoung + jreback |
c69037c
|
|
Also close the issue! |
jreback
closed this
Mar 3, 2016
VelizarVESSELINOV
commented
Mar 8, 2016
|
Thanks, much appreciated :) |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
VelizarVESSELINOV commentedFeb 29, 2016
Code Sample, a copy-pastable example if possible
Expected Output
Current output:
Expected output:
output of
pd.show_versions()INSTALLED VERSIONS
commit: None
python: 3.5.1.final.0
python-bits: 64
OS: Darwin
OS-release: 15.3.0
machine: x86_64
processor: i386
byteorder: little
LC_ALL: None
LANG: None
pandas: 0.17.1
nose: None
pip: 7.1.2
setuptools: 18.3.2
Cython: None
numpy: 1.10.1
scipy: 0.16.1
statsmodels: None
IPython: 4.0.1
sphinx: None
patsy: None
dateutil: 2.4.2
pytz: 2015.7
blosc: None
bottleneck: None
tables: None
numexpr: None
matplotlib: 1.5.0
openpyxl: 2.3.2
xlrd: None
xlwt: None
xlsxwriter: None
lxml: None
bs4: 4.4.1
html5lib: None
httplib2: None
apiclient: None
sqlalchemy: None
pymysql: None
psycopg2: None
Jinja2: 2.8
None