Skip to content

Piste/ferrissETL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

ferrissETL

Extracting media recommendations from a podcast feed

To do

  1. Download XML
  2. Ingest as dict
  3. Download existing "DB" of media (uniqueness)
  4. Parse through dict
  5. for each link/item
    1. if already in DB, skip
    2. identify media type
    3. book probably has Amazon link? Goodreads?
    4. movie should have IMDB link
    5. Add to
  6. Append to DB

Releases

No releases published

Packages

 
 
 

Languages