Scrapy crawler to collect data on the back catalog of songs listed for sale.
Switch branches/tags
Nothing to show
Clone or download
Permalink
Failed to load latest commit information.
juno_crawler scrapy crawler May 24, 2017
README.md updated readme with info May 24, 2017
scrapy.cfg scrapy crawler May 24, 2017
setup.py scrapy crawler May 24, 2017

README.md

Juno Download Crawler

Crawls Juno Download and collects data on the entire back catalogue of music singles.

Fields collected:

  • Artist
  • Title
  • Record label
  • Catalog number
  • Release date
  • Music genre
  • Individual track names
  • mp3 sample urls

Example output code:

[
  {
    "_type": "JunoCrawlerItem",
    "catalog_number": "SB 215-0",
    "title": "Tell Me",
    "release_date": "10 Sep 08",
    "artist": "CLEAR VIEW feat JESSICA",
    "label": "Songbird Holland",
    "tracks": [
      [
        "Tell Me - (6:43)",
        "http://www.junodownload.com/MP3/SF1354749-02-01-01.mp3"
      ],
      [
        "Tell Me (Max Graham remix) - (8:49)",
        "http://www.junodownload.com/MP3/SF1354749-02-01-02.mp3"
      ]
    ],
    "genre": "Progressive House"
  }
]