Skip to content

blockScraper implementation #88

@lurenss

Description

@lurenss

Is your feature request related to a problem? Please describe.
A scraper pipeline capable of retrieve all the similar blocks in a page, like ecommerce, weather, fly websites

Describe the solution you'd like
I have found this paper https://www.researchgate.net/publication/261360247_A_Web_Page_Segmentation_Approach_Using_Visual_Semantics
It deals specifically wti this issue

Describe alternatives you've considered
nope

Additional context
Screenshot 2024-04-27 at 15 04 05

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions