-
-
Notifications
You must be signed in to change notification settings - Fork 17.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Segfault in version 1.0.1, read_parquet after creating a clickhouse odbc connection #31981
Comments
Do you have the same issue with a stable version of pyarrow? We don't have a lot of C++ internally so I think root cause might be elsewhere |
this is from pyarrow pls open an issue there |
Are you using the same pyarrow version in both environments? |
Yes, this is the same pyarrow version in both cases. I'll open an issue there. Thank you. Stacktrace:
|
For reference, the arrow issue is here: https://issues.apache.org/jira/browse/ARROW-7873 |
We started getting segfaults in upgraded versions of pandas and tracked it down to an interaction with making odbc connections. The specific odbc driver is for Clickhouse built in Dec19 from:
The core dump is reproducable in the example below:
Code Sample, a copy-pastable example if possible
Problem description
In the code above, creating the clickhouse odbc connection (not using it) leads to a segfault when reading the parquet file:
This does not happen in pandas 0.25.3 but does in pandas 1.0.1.
Expected Output
No core dump.
Odbc driver dependencies
Output of
pd.show_versions()
[paste the output of
pd.show_versions()
here below this line]INSTALLED VERSIONS
commit : None
python : 3.6.9.final.0
python-bits : 64
OS : Linux
OS-release : 3.10.0-1062.4.3.el7.x86_64
machine : x86_64
processor : x86_64
byteorder : little
LC_ALL : None
LANG : None
LOCALE : None.None
pandas : 1.0.1
numpy : 1.18.1
pytz : 2019.3
dateutil : 2.8.0
pip : 9.0.1
setuptools : 45.2.0
Cython : 0.29.14
pytest : 5.3.2
hypothesis : 5.1.2
sphinx : None
blosc : None
feather : None
xlsxwriter : None
lxml.etree : None
html5lib : 0.999999999
pymysql : None
psycopg2 : None
jinja2 : 2.11.1
IPython : 7.12.0
pandas_datareader: None
bs4 : None
bottleneck : None
fastparquet : None
gcsfs : None
lxml.etree : None
matplotlib : 3.1.2
numexpr : None
odfpy : None
openpyxl : 3.0.2
pandas_gbq : None
pyarrow : 0.15.1.dev539+g8cf0c8e0a
pytables : None
pytest : 5.3.2
pyxlsb : None
s3fs : None
scipy : 1.4.1
sqlalchemy : 1.3.11
tables : None
tabulate : None
xarray : None
xlrd : 1.2.0
xlwt : None
xlsxwriter : None
numba : None
The text was updated successfully, but these errors were encountered: