Skip to content

iphswift/rehearser

Repository files navigation

Rehearser

Rehearser is an application for aiding in the reading of research papers.

Image of UI

(Image includes text from the "Demo Paper" [1])

Usage

To use the application, the only requirement is Docker and Docker Compose.

To run the server, execute the command docker-compose up -d in the root directory.

Once the server is running, you can navigate the application by going to localhost:8000. Note, the applicaiton needs ports 8000 (used by http-server) and 5000 (used by flask). If they are used elsewhere, the application will not run.

Current Features

  • Scrape a research paper to extract the narrational text.
  • Convert scraped text to audio with segmented text-audio pairs for the original text.
  • Provide a minimalistic front end for uploading documents and downloading text, and viewing the text-aligned audio with minimal playback controls.

Technical Deatils

The application uses a few key components:

Disclaimer

This tool is still in early iterations. Please exercise caution with use.

  1. The parsing may end up missing text (eg, a few lines at the top of a page) or adding undesired text (eg, such as the contents of a figure).

  2. Citations are currently missing from the extracted text and audio.

Citation

The "demo paper" used in this project is:

[1] Meuschke, Norman, et al. "A benchmark of pdf information extraction tools using a multi-task and multi-domain evaluation framework for academic documents." International Conference on Information. Cham: Springer Nature Switzerland, 2023.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published