Collecting a movie's baiduindex of particular time.
main.py
Call BaiduIndex_Crawl.py and Get_date.py.BaiduIndex_Crawl.py
Main code to collect the data from baiduindex.Get_date.py
Get the movie's release date.
Based on python3.5 and selenium, first need to install:
selenium
pytesseract
Pillow
phantomjs
chromedriver
Fill in account | star.sh | main.py |
---|---|---|
Open the Get_data.py , find 'AccountList' in line 11, fill in several account like this ['account','passwd'] |
Star with star.sh , and the it will run the main.py to do the task |
It will call the Get_data.py and BaiduIndex_Crael.py . |
- Let's take 山楂树之恋 as example
- First use its name to get the date from MTime.
- And then use its name and date to get the data from baiduindex.
- Store the imformation like this.
movie_name
[date1:data1,date2:data2....]
example