A self-hosted platform for conducting AI-powered conversational interviews at scale.
Built for survey researchers, qualitative methodologists, and anyone who needs structured or open-ended data collection through voice or text.
Website · Docs · About · License
OASIS is an open-source research platform that lets you run conversational AI interviews from your own infrastructure. You define the study, configure an agent with a system prompt and model of your choice, and share a link (or phone number) with participants. Transcripts are diarized and stored in your database. You stay in control of the data, the models, and the entire pipeline.
It was built out of a simple frustration: existing commercial tools for conversational AI are powerful, but they are not designed with research methodology in mind. Things like follow-up probing, semi-structured interview guides, participant identifiers, and study-level organization tend to be afterthoughts. OASIS puts those features front and center.
1. Setup a Study
demo1-fast.mp4
2. Create and Configure an Agent
demo2-fast.mp4
3. Collect and Manage Data
demo3-fast.mp4
Two interview modalities
- Voice interviews (real-time speech with STT, LLM, and TTS)
- Text chat interviews (clean chat UI with customizable avatars)
Flexible pipeline architecture
- Modular pipeline: chain any STT, LLM, and TTS provider independently
- Voice-to-voice pipeline: stream audio directly to multimodal models (OpenAI Realtime, Gemini Live)
Semi-structured interview mode
- Define a question guide with preset questions, follow-up probes, and transition logic
- The agent follows a structured backbone while still allowing natural conversation
- Upload interview guides via CSV or JSON, or configure them in the dashboard
Multi-provider model support
- OpenAI (GPT-4o, GPT-4.1, GPT-5, Realtime models)
- Google Gemini (including native audio preview)
- Scaleway (European-hosted, OpenAI-compatible)
- Azure OpenAI and GCP Vertex AI (self-hosted endpoints)
- Any LiteLLM-compatible provider, plus a custom model identifier option
STT and TTS providers
- Deepgram, Scaleway Whisper (STT)
- ElevenLabs, Cartesia, Scaleway (TTS)
Research-first design
- Study-level organization with multiple agents per study
- Participant identifiers: random, predefined lists, or self-reported
- Diarized transcripts with timestamps
- Session analytics and data export
- Knowledge base (RAG) with pgvector for document-grounded conversations
Telephony support (beta)
- Twilio Media Streams integration for phone-based interviews
- Currently supports incoming calls
Self-hosted and private
- Everything runs in Docker on your own machine or server
- No data leaves your infrastructure unless you explicitly configure external API calls
- Optional basic authentication for the admin dashboard
- API keys configurable via
.envor the dashboard settings page
OASIS runs as five Docker containers on a single internal network:
| Container | Stack | Role |
|---|---|---|
| Caddy | Caddy 2 | Reverse proxy, automatic HTTPS |
| Frontend | React, Tailwind CSS, Nginx | Admin dashboard and participant interview widget |
| Backend | FastAPI, Pipecat, LiteLLM | WebSocket audio/text transport, AI pipeline orchestration, REST API |
| PostgreSQL | pgvector/pg16 | Agent configs, transcripts, participant data, vector embeddings |
| Redis | Redis 7 | Session tracking, real-time transcript pub/sub, API key overrides |
┌──────────────────────────────────────────────────────────┐
│ Caddy (ports 80/443) │
│ ┌──────────┬──────────┐ │
│ │ Frontend │ Backend │ │
│ │ (React) │ (FastAPI)│ │
│ └────┬─────┴────┬─────┘ │
│ │ │ │
│ ┌────┴────┐ ┌───┴───┐ │
│ │PostgreSQL│ │ Redis │ │
│ └─────────┘ └───────┘ │
└──────────────────────────────────────────────────────────┘
OASIS exists to fill a gap. Commercial conversational AI platforms are built for customer support, sales, and marketing. They are good at what they do, but research has different requirements:
- Methodological control. Researchers need semi-structured guides, probing logic, and participant tracking built into the tool, not bolted on.
- Transparency. When you publish a study, reviewers and readers should be able to understand exactly what system was used, what models were called, and where data was stored.
- Affordability. Academic budgets are not enterprise budgets. Self-hosting with pay-as-you-go API keys is often the only realistic option.
- Data sovereignty. Especially in Europe, running your own infrastructure is not a nice-to-have. It is a compliance requirement.
The long-term goal is for OASIS to become a standard open-source tool that survey and qualitative researchers can rely on, the same way they rely on tools like Qualtrics for traditional surveys but with conversational AI as the medium.
Contributions are very welcome. If you are interested in contributing, improving the platform, or building on it for your own research, please reach out first. Open an issue or send an email to max.lang@stx.ox.ac.uk.
For feature requests and bug reports, please use GitHub Issues.
OASIS is released under the Open Non-Commercial Research License (ONCRL) v1.0.
You are free to use, modify, and share OASIS for non-commercial research purposes. If you want to use it in a funded project, a commercial product, or any context that goes beyond personal or academic exploration, you need written approval from the copyright holder.
See the full LICENSE file for details.
