Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

driver: bad connection #169

Closed
redelang opened this issue Dec 19, 2024 · 11 comments
Closed

driver: bad connection #169

redelang opened this issue Dec 19, 2024 · 11 comments
Assignees

Comments

@redelang
Copy link

After upgrading from 1.3.1 to 1.5.2 the error 'driver: bad connection' occurs randomly. On random moments and random metrics (both for default metrics as provided custom metrics) as it seems.

The setup in which it is used:

  • RHEL 8 virtual machine
  • Using env variables for connection (DB_CONNECT_STRING, DB_USERNAME, DB_PASSWORD)
  • Using Oracle instant client 19.14.0 and 19.25.0 (provided by LD_LIBRARY_PATH)
  • Target database is PDB version 19.22.0.0

I have not made any changes in configuration when upgrading.

@markxnelson
Copy link
Member

Thanks for reporting - we have been seeing this too, and are investigating already. Have not identified root cause yet.

@markxnelson
Copy link
Member

Just to confirm - you are running the binary right, not running the exporter in a container?

@redelang
Copy link
Author

Yes indeed, running the binary.

@markxnelson
Copy link
Member

I have a suspicion it might be in godror layer. we call into that (using go sql) and that calls into our oracle odpi-c layer then that into instant client's static libs. i've seen this issue with 19, 21 and 23, and amd and arm, in container and not. might be to do with conflicts between go's connection pooling and instant client's. working on testing/debugging.

have yhou noticed any issues with queries just returning no results at all? sometimes i see a query that works fine in sqlcl/sqlplus but always has no results when run through this same stack of deps. usually more complex ones. would be good to know about that too if you see that. and if you do - if you can share a query, that'd be great. i can give email address if you don't want to post it

@redelang
Copy link
Author

Didn't really notice queries returning no results yet, but I didn't ran it for long yet. Have been focussing on the errors and put the upgrade on hold now because of those errors.

@markxnelson
Copy link
Member

Ok thanks - and these look pretty random right, does not seem to be any particular pattern to them?

@markxnelson markxnelson self-assigned this Dec 19, 2024
@redelang
Copy link
Author

I didn't discover a pattern, different queries both from the default and custom provided metrics ran into this error.

@markxnelson
Copy link
Member

Ok, thanks a lot for the info, it helps a lot, will take that into account while trying to get to the bottom of that issue.

@redelang
Copy link
Author

btw this issue: #168 was found because of this issue. I was surprised that for some metrics there was no error in the log even though it was exposed in the scrape errors metric that these failed. It appears to be caused by the setting ignorezeroresult.

@anders-swanson
Copy link
Member

FYI #168 is fixed by #173. It may take us some time to get to the bottom of this issue :)

@markxnelson
Copy link
Member

We believe we have this fixed in 1.5.4 - if you still see the error, please reopen, or open a new issue to let you know. Thanks very much for your input.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants