Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiple, (almost) undistinguishable entries in tonies.json? #55

Closed
steve8x8 opened this issue Jan 9, 2023 · 1 comment
Closed

Multiple, (almost) undistinguishable entries in tonies.json? #55

steve8x8 opened this issue Jan 9, 2023 · 1 comment

Comments

@steve8x8
Copy link
Contributor

steve8x8 commented Jan 9, 2023

While testing a new attempt at the track scraper, I found this (using a json_pp pretty-printed version of the "official" file):

$ grep -i -C5 the.birth.of tonies.json 
  {
    "no": "297",
    "model": "10000174",
    "audio_id": [],
    "hash": [],
    "title": "Maya The Bee - The birth of Maya",
    "series": "Maya The Bee",
    "episodes": "The birth of Maya",
    "tracks": [],
    "release": "0",
    "language": "en-gb",
    "category": "audio-play",
    "pic": "https://08ee523e746768fd7148-f76a52ba8f0c340564df978383fc4de2.ssl.cf3.rackcdn.com/10000227-50000588-b-5zuTgL8H.png",
--
  {
    "no": "319",
    "model": "10000227",
    "audio_id": [],
    "hash": [],
    "title": "Maya The Bee - The birth of Maya",
    "series": "Maya The Bee",
    "episodes": "The birth of Maya",
    "tracks": [],
    "release": "1585699200",
    "language": "en-gb",
    "category": "Audiobook",
    "pic": "https://tonies.com/assets/cache/10000227-50000588-b-5zuTgL8H.0e8873e5.png",

Of course there's only one page https://tonies.com/en-gb/tonies/maya-the-bee/the-birth-of-maya/ - which entry does this belong to, and what about the other one? (Apparently the single page refers to the 10000174 model, not the 10000227 one?)
It looks like the scraper should also get the model ID for matching?

@steve8x8
Copy link
Contributor Author

Usually, the "pic" image filename starts with the "model" string (there are examples where there's no number).

In this case, the "model":"10000174" seems to be the right one though, and should get updated with the "release" timestamp from the "10000227" one.

The "pic"s all appear to be hosted on *.rackcdn.com, US ones on shopify.com.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant