feat: migrate from Chrome DevTools Protocol to arXiv HTTP API#11
Merged
feat: migrate from Chrome DevTools Protocol to arXiv HTTP API#11
Conversation
This change enables the application to work as a single binary without requiring Chrome to be installed on the system. Changes: - Replace CDP-based scraping with arXiv's public HTTP API - Add arxiv_client.rs with API-based implementation - Remove src/cdp/ module (browser.rs, connection.rs, page.rs, mod.rs) - Remove src/scripts/ directory (JavaScript scraping scripts) - Remove src/arxiv_search.rs (old CDP-based implementation) - Update Cargo.toml: add quick-xml, chrono; remove tokio-tungstenite, futures, uuid - Update config.rs: remove headless and browser_path settings - Update main.rs: remove --head flag and CDP imports API features: - search(): Query arXiv API with pagination support - fetch(): Get paper details by ID with PDF text extraction - fetch_pdf(): Download raw PDF bytes - Date filtering with arXiv API's submittedDate format Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|
This pull request sets up GitHub code scanning for this repository. Once the scans have completed and the checks have passed, the analysis results for this pull request branch will appear on this overview. Once you merge this pull request, the 'Security' tab will show more code scanning analysis results (for example, for the default branch). Depending on your configuration and choice of analysis tool, future pull requests will be annotated with code scanning analysis results. For more information about GitHub code scanning, check out the documentation. |
sonesuke
added a commit
that referenced
this pull request
Feb 21, 2026
sonesuke
added a commit
that referenced
this pull request
Feb 21, 2026
* Revert "feat: migrate from Chrome DevTools Protocol to arXiv HTTP API (#11)" This reverts commit 011989c. * test: improve CDP coverage by adding unit tests and E2E execution tests * feat: auto-start devcontainer in pr-healer script * chore: strengthen pre-commit and fix undetected clippy failures * chore: cleanup temporary files --------- Co-authored-by: Claude <claude@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
arxiv_client.rswith API-based implementationKey Changes
Benefits
This change enables the application to work as a single binary without requiring Chrome to be installed on the system.
Test plan
🤖 Generated with Claude Code