Skip to content

tivvit/EventCrawlerCZ

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

  • GAE application
  • BeautifulSoap for parsing

normalizes event data from these pages

stores them in datastore (if not duplicate)

crawls source pages every 4 hours

  • check title page for new events
  • crawls event detail page

print upcoming ordered events (from all sources)