Perspicacité (French for "insight") is an AI-powered research assistant designed for scientists, researchers, and students who want to find and understand academic literature without getting lost in hundreds of papers.
Perspicacité helps you:
- 🔍 Search across academic databases (PubMed, arXiv, OpenAlex, and more)
- 📚 Build personal knowledge bases from papers you care about
- 🤖 Get AI-powered answers grounded in real research papers
- 💾 Save and organize papers for future reference
- 📖 Download and read full-text papers when available
You need Python 3.12 installed on your computer. Then:
# Download the project
git clone <repository-url>
cd perspicacite
# Install dependencies using uv (recommended)
uv sync --dev
# Or if you don't have uv, use pip:
# The ".[dev]" means "install this package with development dependencies"
pip install -e ".[dev]"Copy the example environment file and add your API keys:
cp .env.example .envEdit .env and add at least one of these:
| Service | Why you need it | Get a key at |
|---|---|---|
| DeepSeek | AI model for answering questions | DeepSeek |
| OpenAI | Alternative AI model | OpenAI |
| Anthropic | Alternative AI model | Anthropic |
💡 Don't have API keys? Start with DeepSeek - they offer free credits for new users.
If you used uv in Step 1:
uv run python web_app_full.pyIf you used pip in Step 1:
python web_app_full.pyThen open your browser and go to: http://localhost:8000
In the left sidebar, you'll see a "Knowledge Base" section:
- "No KB (web search only)" - Searches the entire web for papers
- Your own KBs - Searches only papers you've added
To create a new Knowledge Base:
- Click "+ Create new KB"
- Enter a KB name and drag-and-drop a
.bibfile - Click "Create from BibTeX" to import papers
Type your research question in the chat box. Examples:
- "What are the effects of green tea extract on metabolism?"
- "How is feature-based molecular networking used in metabolomics?"
- "Compare transformer models to CNNs for medical imaging"
Select a mode from the dropdown:
| Mode | Best for | Speed |
|---|---|---|
| Basic | Quick answers from your KB | Fast ⚡ |
| Advanced | Better answers with query expansion | Medium |
| Profound | Deep research with multiple cycles | Slower |
| Agentic | Complex questions needing web search | Variable |
| Literature Survey | Systematic review of a research field | Slower |
Perspicacité will:
- Show its "thinking process" (click ▶ to expand)
- Search relevant papers
- Filter and score them for relevance
- Download full texts when possible
- Generate an answer with citations
At the bottom of each response, you'll see papers found during research. Click "Add to KB" to save them to your knowledge base.
When Perspicacité finds papers during research, click the "Add to KB" button on any paper you want to save.
If you have a reference manager (Zotero, Mendeley, EndNote):
- Export your references as BibTeX (.bib file)
- In Perspicacité, click "+ Create new KB"
- Drag-and-drop your
.bibfile into the drop zone - Enter a KB name and click "Create from BibTeX"
Coming soon: Upload PDFs directly through the web interface.
For now, use the command line:
# Add a single PDF to a KB
perspicacite add-pdf <kb-name> <path-to-pdf.pdf>
# Add all PDFs in a folder
perspicacite add-pdf <kb-name> <folder-path>/Your conversations are saved automatically. You can:
- Click any previous chat to resume it
- Click 🗑️ next to "Chat History" to clear all history
- Start a "New Chat" anytime with the button
Hover over your selected KB to see:
- Number of papers stored
- Description
- Creation date
When Perspicacité is working, it shows:
- What it's doing (searching, filtering, downloading)
- Progress on paper downloads
- Which sources it's using
✅ Good questions:
- "What are the antioxidant properties of green tea catechins?"
- "How does FBMN compare to traditional molecular networking?"
- "What are recent advances in transformer models for medical imaging?"
❌ Questions to avoid:
- "Tell me about tea" (too broad)
- "What is the meaning of life?" (not research-related)
- "Write me an essay" (Perspicacité summarizes research, doesn't write original content)
- Keep KBs focused: Create separate KBs for different projects
- Add papers gradually: Start with 10-20 key papers, expand as needed
- Review relevance: Perspicacité scores papers - pay attention to high-scoring ones
- Basic: You have a well-curated KB and want quick answers
- Advanced: Your KB might need broader search
- Profound: Complex questions needing multiple perspectives
- Agentic: Questions requiring web search beyond your KB
- Literature Survey: Mapping a research field with AI-identified themes and recommended papers
The Literature Survey mode helps you systematically map a research field:
- Searches multiple academic databases (Semantic Scholar, OpenAlex, PubMed, arXiv, etc.)
- Analyzes paper abstracts with AI to assess relevance
- Identifies research themes automatically from key concepts
- Recommends papers based on citations, recency, and theme diversity
- Lets you select papers to add to your Knowledge Base
How to use:
- Select "📖 Literature Survey" mode
- Ask a broad research question (e.g., "What are the recent advances in CRISPR gene editing?")
- Wait while it searches and analyzes (may take 1-2 minutes)
- Review papers grouped by AI-identified themes
- Select papers and click "Add Selected to KB" to save them
Note: Literature Survey requires papers to have abstracts for quality analysis.
You need to set up at least one LLM API key in your .env file. See Step 2 above.
Not all papers are freely accessible. Perspicacité tries:
- Unpaywall (open access database)
- arXiv (preprint server)
- Direct publisher access
- Alternative sources (if configured)
- Try "Basic" or "Advanced" mode instead of "Agentic"
- Check your internet connection
- DeepSeek API can be slow during peak times
Make sure papers have:
- A valid title
- Author information
- Year of publication
Knowledge base names must:
- Be unique
- Contain only letters, numbers, hyphens, and underscores
- Not be empty
Edit config.yml to change the AI provider:
llm:
default_provider: "openai" # or "anthropic", "deepseek"
default_model: "gpt-4" # or "claude-3", "deepseek-chat"For better PDF access, you can:
-
Set your email for Unpaywall (already done in default config):
pdf_download: unpaywall_email: "your-email@example.com"
-
Add publisher API keys (optional, for institutional access):
pdf_download: wiley_tdm_token: "your-token" elsevier_api_key: "your-key"
Click the 🌙/☀️ button in the top right to toggle between light and dark mode.
- Your data stays local: Knowledge bases are stored on your computer
- API calls only: Questions are sent to AI providers (DeepSeek, OpenAI, etc.)
- No tracking: We don't collect usage data
- Your papers: PDFs you add stay in your local database
- See
CONTRIBUTING.mdfor contribution guidelines
Problem: App won't start
Solution: Check that port 8000 isn't already in use, or change it in config.yml
Problem: "Module not found" errors
Solution:
- If you used
uv: Run withuv run python web_app_full.py(not justpython) - If you used
pip: Make sure you ranpip install -e ".[dev]"
Problem: "command not found: uv"
Solution: Install uv from https://github.com/astral-sh/uv or use pip instead
Problem: AI responses are slow
Solution: Try a different mode (Basic is fastest) or check your internet connection
uv is a modern Python package manager that's faster than pip:
- It automatically creates a virtual environment
- You run commands with
uv run python ... - Recommended for new users
pip is the traditional Python installer:
- You need to manage virtual environments yourself
- You run commands with
python ... - Works everywhere, good if you have issues with uv
Both work fine - use whichever you prefer!
Perspicacité is open source! If you find bugs or want to suggest features:
- Check existing issues first
- Create a new issue with:
- What you were trying to do
- What happened instead
- Your operating system and Python version
For contribution workflow and contributor agreement details, see CONTRIBUTING.md.
This repository is distributed under the Apache License 2.0.
See LICENSE for the full license text and NOTICE for attribution information.
Perspicacité v2 builds on:
- ChromaDB for vector storage
- OpenAlex for academic search
- DeepSeek/OpenAI/Anthropic for AI models
- Unpaywall for open access papers
Perspicacité builds on the following components:
Original Perspicacité AI (ISWC-C 2025 Demo Paper):
@inproceedings{pradi2025perspicacite,
title = {An AI Pipeline for Scientific Literacy and Discovery: a Demonstration of Perspicacit\\'{e}-AI Integration with Knowledge Graphs},
author = {Pradi, Lucas and Jiang, Tao and Feraud, Matthieu and Bekbergenova, Madina and Taghzouti, Yousouf and Nothias, Louis-Felix},
booktitle = {Joint Proceedings of Industry, Doctoral Consortium, Posters and Demos of the 24th International Semantic Web Conference (ISWC-C 2025)},
pages = {462--467},
year = {2025},
month = nov,
address = {Nara, Japan},
url = {https://hal.science/hal-05290005}
}SciLEx - Science Literature Exploration Toolkit:
@softwareversion{scilex2026,
TITLE = {{SciLEx, Science Literature Exploration Toolkit}},
AUTHOR = {Ringwald, C\\'{e}lian and Navet, Benjamin},
URL = {https://github.com/Wimmics/SciLEx},
INSTITUTION = {{University C\\^{o}te d'Azur ; CNRS ; Inria}},
YEAR = {2026},
MONTH = Fev,
SWHID = {swh:1:dir:944639eb0260a034a5cbf8766d5ee9b74ca85330},
VERSION = {1.0},
REPOSITORY = {https://github.com/Wimmics/SciLEx},
LICENSE = {MIT Licence},
}Ready to start? Run python web_app_full.py and ask your first question!