v0.14.0

ksanyok released this 26 Feb 17:22

· 119 commits to main since this release

7314c37

v0.14.0 -- Reliability, Analysis Tools & New APIs

New API Functions

humanize_sentences() -- per-sentence AI scoring with graduated intensity; only rewrites sentences above a configurable AI probability threshold
humanize_variants() -- generates 1-10 humanization variants with different random seeds, sorted by quality
humanize_stream() -- generator that yields humanized text chunk-by-chunk with progress tracking

New Analysis Modules (zero-dependency, offline)

perplexity_v2 -- character-level trigram cross-entropy model with cross_entropy() and perplexity_score() returning naturalness score (0-100) and verdict
dict_trainer -- corpus analysis for custom dictionary building with train_from_corpus() and export_custom_dict()
plagiarism -- offline originality detection via n-gram fingerprinting with check_originality() and compare_originality()

Pipeline Improvements

Error isolation -- each processing stage wrapped in _safe_stage() with try/except; failing stages are skipped gracefully instead of crashing the pipeline
Partial rollback -- pipeline records checkpoints after each stage; on validation failure, rolls back stage-by-stage to find the last valid state
Pipeline profiling -- stage_timings dict and total_time included in metrics_after for performance analysis

Bug Fixes & Code Quality

Fixed adversarial_calibrate intensity parameter (float 0-1 changed to int 0-100 to match API)
Added input sanitization: TypeError for non-str, ValueError for >500K chars, early return for empty text
Thread-safe lazy loading with double-checked locking on all module loaders
Instance-level plugins preventing cross-instance interference
Fixed humanize_sentences crash (detect_ai_sentences returns list, not dict)

Tests

1,604 tests -- up from 1,560 (44 new tests for all v0.14.0 features)
100% pass rate

Assets 2