-
Notifications
You must be signed in to change notification settings - Fork 9
Issues: meilisearch/scrapix
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
By default use cheerio instead of Puppeteer
breaking-change
The related changes are breaking for the users
enhancement
New feature or request
#113
opened Nov 9, 2024 by
qdequele
Remove the useless headless option
breaking-change
The related changes are breaking for the users
enhancement
New feature or request
#112
opened Nov 9, 2024 by
qdequele
Create Markdown scraping strategy
enhancement
New feature or request
#110
opened Nov 9, 2024 by
qdequele
Create Cheerio crawling backend option
enhancement
New feature or request
#109
opened Nov 9, 2024 by
qdequele
Create Puppeteer crawling backend option
enhancement
New feature or request
#108
opened Nov 9, 2024 by
qdequele
Create Playwright crawling backend option
enhancement
New feature or request
#107
opened Nov 9, 2024 by
qdequele
Detect/handle localized pages
enhancement
New feature or request
#105
opened Nov 9, 2024 by
qdequele
Load the sitemap as starter point for crawling.
enhancement
New feature or request
#102
opened Nov 9, 2024 by
qdequele
Provide option to slow or rate limit requests
enhancement
New feature or request
#99
opened May 10, 2024 by
klvs
Cannot run under Windows (path contains invalid characters)
bug
Something isn't working
#98
opened May 9, 2024 by
AXYZE9
Scrapix Docker image configuration: JSON string parsing and Liquid syntax compatibility
bug
Something isn't working
#95
opened Feb 20, 2024 by
CaroFG
user_agents
in configuration file doesn't change HTTP User-Agent header
bug
#94
opened Feb 15, 2024 by
TonyRL
Add possibility to exclude selectors
enhancement
New feature or request
#93
opened Jan 23, 2024 by
CaroFG
Throw error when redis server is not answering
bug
Something isn't working
#56
opened Jul 3, 2023 by
bidoubiwa
Retrieve page titles from meta tags
enhancement
New feature or request
#53
opened Jun 29, 2023 by
Strift
Ensure same documents are not pushed more than once.
enhancement
New feature or request
#52
opened Jun 29, 2023 by
bidoubiwa
url_to_index does not work
bug
Something isn't working
needs more info
This issue needs a minimal complete and verifiable example
Handle pages not found
bug
Something isn't working
enhancement
New feature or request
#48
opened Jun 28, 2023 by
bidoubiwa
Previous Next
ProTip!
Follow long discussions with comments:>50.