Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Could you provide an example to load big dataset like Percona? #9943

Closed
vkingnew opened this issue Mar 28, 2019 · 8 comments
Closed

Could you provide an example to load big dataset like Percona? #9943

vkingnew opened this issue Mar 28, 2019 · 8 comments

Comments

@vkingnew
Copy link

Always to test tidb performance by POC,sometimes we will use the production data,sometimes we can use public dataset such as ONTIME,like percona. the Genaral data format is CSV,could you provide an example like Percona-lab which used for TiDB.
https://github.com/Percona-Lab/ontime-airline-performance/tree/master/mysql

@jackysp
Copy link
Member

jackysp commented Mar 29, 2019

I think TiDB could also use the same data set as Percona.
@vkingnew

@morgo
Copy link
Contributor

morgo commented Mar 29, 2019

The tidb manual uses public bikeshare as a example database.

I have started using ontime myself, I think it is a good dataset. Here is a blog post I wrote about it today: https://pingcap.com/blog/introduction-to-analytics-queries-for-the-mysql-dba/

@morgo
Copy link
Contributor

morgo commented Mar 29, 2019

Here is the manual link for bike share: https://pingcap.com/docs/bikeshare-example-database/ - look at the using explain page for example to see it in use

@vkingnew
Copy link
Author

Good work,i will try.

@vkingnew
Copy link
Author

vkingnew commented Apr 2, 2019

The tidb manual uses public bikeshare as a example database.

I have started using ontime myself, I think it is a good dataset. Here is a blog post I wrote about it today: https://pingcap.com/blog/introduction-to-analytics-queries-for-the-mysql-dba/

could you show the code how to download the dataset and how to load the data,and total linenumers of ONTIME ,and how much time to run your SQL?

@morgo
Copy link
Contributor

morgo commented Apr 2, 2019

I use this script for importing with only small modifications to correct paths. Note that it requires TiDB 3.0 Beta 1, because earlier versions did not support the syntax OPTIONALLY ENCLOSED BY.

@vkingnew
Copy link
Author

vkingnew commented Apr 3, 2019

Thanks,In TiDB 2.1.7 the data which loaded into tidb display error format by Percona's script.
And support tidb 2.1.7 upgrade to 3.0 Beta 1?

@ghost
Copy link

ghost commented Aug 3, 2020

I am going to close this issue as stale. Please feel free to re-open if you have any additional questions. Thanks!

@ghost ghost closed this as completed Aug 3, 2020
This issue was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants