App that transcribes audio to text (voice dictation) using the Google Gemini API and formats the result. Free alternative to Wispr Flow.
LabFlow is a Python-based desktop application designed to efficiently convert audio files to text and subsequently enhance the transcribed text for improved readability and coherence.
- Audio-to-Text Transcription: Converts audio files (such as MP3, WAV, etc.) into plain text using speech recognition technology.
- Automatic Text Enhancement: Once transcribed, the software processes the text to:
- Add punctuation (commas, periods, etc.).
- Correct capitalization (capital letters at the beginning of sentences).
- Improve the overall structure for easier reading.
- Transcription History: Saves a record of all transcriptions, allowing you to easily access them later.
- Usage Statistics: Tracks application usage.
Video Tutorial: https://www.youtube.com/watch?v=vwsnipfGLms
There are two ways to use this tool: by running the Python script directly or by using the executable file.
This option provides more flexibility if you want to modify the code.
Prerequisites:
- Python 3 installed on your system.
pip(Python's package manager) installed.
Steps:
-
Clone the repository:
git clone <your-repository-url>
-
Navigate to the project directory:
-
Create a virtual environment (recommended):
python -m venv .venv
-
Activate the virtual environment:
- On Windows:
.venv\Scripts\activate
- On macOS/Linux:
source .venv/bin/activate
- On Windows:
-
Install the dependencies: (Note: It is highly recommended to create a
requirements.txtfile listing all the necessary libraries for your project. If you have one, use the following command).pip install -r requirements.txt
-
Run the application:
python nuevo_mainsoft.py
If you just want to use the application without dealing with Python installations or dependencies, this is your best option.
- Navigate to https://github.com/joselabweb/LabFlow/releases/tag/executable
- Download
LabFlow.exefile. - Double-click on
LabFlow.exeto launch the application. That's it!
Here is a description of the most important files and their functions:
nuevo_mainsoft.py: The application's entry point. Run this file to start the program.nuevo_text_enhancer.py: Contains all the logic for processing and enhancing the transcribed text (adding punctuation, capitalization, etc.).nuevo_transcription_history.py: Manages the transcription history by saving and retrieving data.
nuevo_config.json: Configuration file. Important settings for the program's operation are stored here.nuevo_transcription_history.json: A JSON-format database where the history of all transcriptions is stored.nuevo_usage_statistics.json: A JSON file that saves data on how the application is used.logs/wisprflow_soft_nuevo.log: A log file that records information about events or errors that may occur during execution.dist/: The folder containing the ready-to-use executable file (.exe)..venv/: The folder for the Python virtual environment (if created).