Skip to content

eli719/crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SimpleCrawlerDemo-Update

With HttpClient 4.5.10 and POI,Crawl data from websites after login and download data you select into Excel.

效果图 效果图

Knowledge points:

  • HttpClient
  • POI and JXL
  • Write and parse json file
  • SpringBoot Web related:
    • listener
    • WebMvcConfigurer
    • controller
  • SpringBoot task: @Schedled
  • Thymealf in simple use

Add Functions:

  • Watch log from page:
    • read log file with websocket
  • add a progress bar in the page to see the progress of tasks
  • add button to restart the program
  • add button to download files completed
  • CompletableFuture can be used to speed up tasks

ADD BaiDuAI OCR

  • Util package :cn.eli486.ocr
  • Example :cn.eli486.imp.CodeW703846Abstraction

About

SimpleCrawlerDemo in reading json file

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published