Skip to content

Releases: CypherousSkies/reading-for-listeners

r4l v0.0.4post2 Audio Distortion Hotfix

03 Nov 03:20
Compare
Choose a tag to compare

Fixed vocoder issue creating cursed audio.
The outputs are now substantially smoother and take much less time.
If you used 0.0.4 and had issues, this should fix them.
pypi link

r4l v0.0.4

27 Oct 01:15
Compare
Choose a tag to compare

Quality of Life & Memory Awareness

tl;dr This is a small update, I'm still working on #9 and #13, which will be fixed by 0.0.5 and 0.1.0, respectively. Otherwise, mostly quality of life adjustments and small fixes to minor bugs.

Memory

  • I added a few lines to decrease memory usage by dropping the wav file in RAM to compressed mp3 whenever this process is using more than 60% of available ram (this will be configurable in cl/ui later, is configurable in api)

QOL

  • The decoder steps are now dependent on the sentence being read, which decreases the number of cursed tangents due to short sentences.
  • Fixed sentence splitting issues
  • Added some incremental changes to solve header/footer issues (openCV solution next version)

Stress tests

I unintentionally ran this for 50+ hours on texts and it survived the affair, not even crashing out! I'm very proud of it :)
pypi link

r4l v0.0.3

05 Oct 00:48
Compare
Choose a tag to compare

Slight hotfix -- I forgot to pass language into ocrmypdf :/
pypi release

R4L v0.0.2

04 Oct 23:26
Compare
Choose a tag to compare

Quick update! Now supports non-english languages!

Reading for Listeners (r4l) v0.0.1

04 Oct 22:49
Compare
Choose a tag to compare

First release! Big hype!
Features:

  • Easy CLI: r4l [--in_path in/] [--out_path out/] will convert all (english language) pdfs, txts, and muses in the folder in/ into mp3 in the folder out/
  • OCR improvement API: r4l.util.text.TextProcessor is all setup to do the hard work of scanning PDFs and correcting their text!