You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm running into an issue extracting a dataframe from a HTML page when using pandas 0.23. Before my environment was updated, I was using pandas 0.22 and the issue did not occur. If I install a new environment with 0.22, the issue does not occur, so I think this is probably a regression. I've cached the relevant HTML at trailheads.html.txt.
Below I'm included a subset of the resulting rows of the table to demonstrate what I mean by repeat. It seems the table has been repeated a single time leading to double the number of rows plus one (since the header is repeated as a row). Below I've excluded all rows besides the top and bottom 2 for brevity.
Date
Happy Isles->Little Yosemite Valley
Happy Isles->Sunrise/Merced Lake (pass through)
Glacier Point->Little Yosemite Valley
Sunrise Lakes
Lyell Canyon
7/28/2018
0
0
0
0
0
7/29/2018
0
0
0
0
0
12/17/2018
18
6
6
9
15
12/18/2018
18
6
6
9
15
Date
Happy Isles->Little Yosemite Valley
Happy Isles->Sunrise/Merced Lake (pass through)
Glacier Point->Little Yosemite Valley
Sunrise Lakes
Lyell Canyon
7/28/2018
0
0
0
0
0
7/29/2018
0
0
0
0
0
12/17/2018
18
6
6
9
15
12/18/2018
18
6
6
9
15
The text was updated successfully, but these errors were encountered:
I can reproduce this in 0.23.3 but not on master. Can you verify? I don't see a PR that says it was specifically targeting this behavior but appears to already be handled.
I'm running into an issue extracting a dataframe from a HTML page when using pandas 0.23. Before my environment was updated, I was using pandas 0.22 and the issue did not occur. If I install a new environment with 0.22, the issue does not occur, so I think this is probably a regression. I've cached the relevant HTML at
trailheads.html.txt
.Here is the code:
Below I'm included a subset of the resulting rows of the table to demonstrate what I mean by repeat. It seems the table has been repeated a single time leading to double the number of rows plus one (since the header is repeated as a row). Below I've excluded all rows besides the top and bottom 2 for brevity.
The text was updated successfully, but these errors were encountered: