-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Coin name is mislabeled in dataset for duplicate-symbols #2
Comments
Actually, my prioritization logic appears flawed:
So:
Am I parsing this data wrong, and I should know a better way to deal with these duplicate timeseries, or is there some extraneous data creeping in here? Thanks! |
Ooooh, sorry, I figured out that this is due to coins on coinmarketcap that share a ticker. PRO, BTG, ACC, etc. Not sure of a correct way to distinguish them in the dataset...especially since |
Hi Mike, you're spot on the issue is due to several tokens sharing the same symbol. I didn't know how to go about resolving it, but then figured i'd use the slug i'm using to generate the urls for scraping, and then use that as a unique identifier instead. The change I just committed should resolve the duplication issues and also theres a couple extra features included. Let me know how you go, thanks
|
Awesome, the slug works great, thank you! |
Thanks for doing this work, greatly appreciated.
Was going to do some analysis of it on my own, and got confused by the presence of duplicates on some coins.
For example, up until 12-12, we have one datapoint per date, whereas after that shows two datapoints per date:
Only one has a non-empty
market
value...so I'm going to go with that. (I assumemarket
refers to market-cap? I thought at first it might be showing data from two different market exchanges or something.)The text was updated successfully, but these errors were encountered: