wikiaudify

Generate audio summaries of Wikipedia articles using OpenAI and ElevenLabs

Introduction

This was a hackathon project made during the Wikimedia NL Mini Hackathon 2024 to generate audio summaries like the ones from NotebookLM. Obviously it's not as good as that one, but it makes quite enjoyable fun short audio conversations.

Here's an example of a generated audio summary about the article on Grilled cheese.

grilled_cheese.mp4

And you can find a transcription here.

Install

What you'll need:

An OpenAI API key
An ElevenLabs API key
Python 3.13+ (it probably works with older versions too, but no guarantees)

There is an option to use a local LLM (like Ollama) but i didn't get very good results, but you could try to make it work!

To use this script:

Clone this repo

git clone https://github.com/hay/wikiaudify.git

Make a virtual environment and install the requirements.txt

python -m venv env
source env/bin/activate
pip install -r requirements.txt

Copy the example-config.toml to a new file (e.g. test.toml) and fill in your API keys and other details
Try running it from the command line like this:

python generate.py -a "Grilled_Cheese" -q "At what temperature will my cheese melt?" -c test.toml

Note that the Wikipedia article you give with the -a option should have underscores, e.g. the path in the URL of the article.

By default this will generate two files in the root of this project: a summary.mp3 containing the summary and a summary.txt with a transcription.

Troubleshooting

If you add the -v (verbose) flag audio2text will give much more debug information.

All options

You'll get this when doing python generate.py -h

usage: generate.py [-h] [-a ARTICLE] [-c CONFIG] [-na] [-nt] [-o OUT]
                   [-ot OUT_TRANSCRIPT] [-q QUESTION] [-v]

Generate an audio summary of a Wikipedia article

options:
  -h, --help            show this help message and exit
  -a, --article ARTICLE
                        Article you want the audio summary to be about
  -c, --config CONFIG   Path to a TOML file with configuration
  -na, --no-audio       Don't generate audio output
  -nt, --no-transcript  Don't generate an audio transcript
  -o, --out OUT         Path of output MP3 file
  -ot, --out-transcript OUT_TRANSCRIPT
                        Path of output transcript
  -q, --question QUESTION
                        User question that will be included in the summary
  -v, --verbose         Show debug information

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
wikiaudify		wikiaudify
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
example-config.toml		example-config.toml
generate.py		generate.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

wikiaudify

Introduction

Install

Troubleshooting

All options

License

About

Releases

Packages

Languages

License

hay/wikiaudify

Folders and files

Latest commit

History

Repository files navigation

wikiaudify

Introduction

Install

Troubleshooting

All options

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages