GitHub - czasg/MiniTools: This is a minitools for python

MiniTools

this is tools for python, and is also my work environment, maybe there are some widgets you need

scrapy

scrapy

1、run scrapy fast and without environment

from minitools.scrapy import miniSpider

class MySPider(miniSpider):
  start_urls = ['http://www.baidu.com']
  
  def parse(self, response):
    print(response.url)

if __name__ == '__main__':
  MySpider.run(__file__)

2、get next pages Request

from minitools.scrapy import next_page_request
class MySpider(miniSpider):
  def parse(self, response):
    yield next_page_request(response, 'page=(\d+)')  # you need fill regex in here

Name		Name	Last commit message	Last commit date
Latest commit History 293 Commits
minitools		minitools
test		test
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MiniTools

scrapy

About

Releases

Packages

Languages

czasg/MiniTools

Folders and files

Latest commit

History

Repository files navigation

MiniTools

scrapy

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages