linktrace 0.2.0
This release tightens up the first public PyPI package release with rebrand cleanup, crawler correctness fixes, and improved robots.txt behavior.
Highlights
- Completed remaining
WebCrawler→linktracecleanup in developer commands. - Fixed coverage and demo commands in the
justfileto target the newlinktracepackage. - Improved robots.txt handling by checking whether a URL is allowed before fetching.
- Fixed per-document link collection so links discovered on one page no longer leak into later
Documentobjects. - Preserved the simple public API:
from linktrace import Spider.
Fixes
just test-covnow reports coverage againstlinktrace.just runnow runspython -m linktrace.Spider.parse_document()now uses a document-localfound_linkslist instead of crawler-wide link state.- Disallowed URLs are skipped before fetch and represented with a
403status document.
Upgrade
pip install --upgrade linktrace