This script is meant to search for new papers, summarize them using an LLM and send out a pdf file via email. We further provide functionality to let it run once a day on windows.
In this version, we are using the free Mistral model.
If you want to change it, simply provide a different function to the make_summaries function.
conda create --name arxiv_update
conda activate arxiv_update
pip install -r requirements.txt
You can set everything you need in the config.json and last_request.json files:
keywords: Keywords to search on ArXiV. The lists represent a disjunktive normal form (DNF). That is, the lists are concatenated with OR, the entries of each list with AND.preferences: Your personal preferences for creating the summaries. This will be added to the LLM prompt.emails: To whom the emails will be sent.buffer_days: number of last days for which we store scraped results (to avoid uncovered time periods)smtp_*: SMTP stuff for sending the mail (see here for an example)download_dir: Where papers are downloaded internallysummary_dir: Where summaries are savedmax_results: Maximum number of papers per categorymax_pages: Maximum number of pages for LLM to read
last_date: here you can set the earliest date that needs to be considered by the searchrequest_buffer: only used internally to avoid uncovered time periods
On Windows, follow the instructions here using the run_updater.bat file.