GitHub - pv-912/scrapy: Crawling upcoming and recent movies data from IMDB website using python-scrapy.

scray shell <url> Provide us the response shell we can use to access the data from the page
print(response.text) Print the whole page content
response.css(div.author) Return inside object of that div
response.css(div.author).extract() Return actual html selected data
response.css(div.author::text).extract() Return array of text only of that element
response.css(div.author::text)[0].extract() Return string of text only of that element

scrapy genspider <spider-name> <domain-name-url> after running this a file name .py will be created in the same directory
scrapy runspider filename.py To run the file
scrapy runspider filename.py -o file-name.json To save the file as file-name.json
more file-name.json To see file content

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
IMDB		IMDB
IMDB_image		IMDB_image
.gitignore		.gitignore
Cheat_Scripts.ipynb		Cheat_Scripts.ipynb
LICENSE		LICENSE
README.md		README.md
author.py		author.py
imdb.py		imdb.py
imdb_with_tabs.py		imdb_with_tabs.py
quotes.py		quotes.py

Provide feedback