v0.1.3 - Improved preprint resolution
What's Changed
Added
--no-cacheflag to disable caching entirely for fresh lookups--clear-cacheflag to clear existing cache files before running--s2-api-keyflag for Semantic Scholar API authentication (also readsS2_API_KEYenv var)MATCH_THRESHOLDconstant (0.85) inResolverandAsyncResolverfor consistent matching
Fixed
- arXiv ID extraction from journal/howpublished fields (major bug fix)
- Previously, entries like
journal={arXiv preprint arXiv:2310.15213}would not extract the arXiv ID, causing cache key collisions and missed API lookups - Now correctly extracts arXiv IDs from all relevant fields
- Previously, entries like
- Match threshold consistency: lowered from hardcoded 0.9 to 0.85 across all search stages, aligned with
FieldFiller.MATCH_THRESHOLD
Improved
- Real-world test shows 117% improvement in upgrade rate (12→26 papers) and 31% reduction in failures (45→31) on a 162-entry bibliography
Full Changelog: v0.1.2...v0.1.3