GitHub - louayamor/Reasona-RAG-Architecture: A RAG-powered AI project for context-aware, factual reasoning and knowledge generation.

Reasona is a modular AI/ML pipeline framework for streaming, preprocessing, embedding, indexing, retrieval, reranking, and inference on large-scale datasets. It supports streaming-first workflows to process data end-to-end efficiently, with vector-based retrieval and transformer-based inference, and a Flask web app for interactive usage.

Features

Streaming Data Ingestion: Stream large datasets directly from [Hugging Face Datasets] or Wikimedia without downloading entire files.
Data Cleaning & Transformation: Remove duplicates, handle missing values, and convert to instruction-based JSON for embedding.
Chunking & Embedding: Split long documents into configurable chunks with optional overlap; embed using [SentenceTransformers].
Vector Indexing: Store embeddings in [FAISS] for fast similarity search.
Retrieval & Reranking: Retrieve top-k results via FAISS and optionally rerank results using transformer-based models.
Inference: Generate answers using pretrained models with configurable parameters.
Flask Web App: Interactive interface for querying Reasona pipelines, managing chats, and visualizing results.
Scalable & Configurable: Centralized YAML configuration for all pipelines.
Logging & Monitoring: JSON logs and runtime metrics for all stages, including checkpoints and progress tracking.

Project Structure

Reasona/
│
├── src/
│   └── Reasona/
│       ├── config/
│       │   ├── config_manager.py
│       │   └── params.yaml
│       ├── data/
│       │   ├── loader.py         
│       │   ├── cleaner.py         
│       │   ├── formatter.py      
│       │   ├── chunker.py        
│       │   └── embedder.py        
│       ├── pipeline/
│       │   ├── preprocess_pipeline.py   # Streaming + preprocessing
│       │   ├── indexing_pipeline.py     # Chunking + embedding + FAISS
│       │   ├── reranking_pipeline.py    # Transformer reranking
│       │   ├── inference_pipeline.py    # Inference & retrieval
│       │   └── training_pipeline.py     # Qwen/Qwen2.5-1.5B-Instruct
│       ├── vectorstore/
│       │   └── faiss_store.py          
│       ├── services/
│       │   └── reasona_service.py      
│       └── utils/
│           ├── logger.py               
│           └── helpers.py              
├── config/
│   ├── config.yaml                     
│   └── params.yaml                     
├── artifacts/                           
├── logs/                                
├── main.py                              
├── app.py                               
└── README.md

Installation

Clone the repository:

git clone https://github.com/louayamor/Reasona.git
cd Reasona

Create and activate a virtual environment:

python -m venv venv
source venv/bin/activate  # Linux/macOS
venv\Scripts\activate     # Windows

Install dependencies:

pip install -r requirements.txt

Configuration

Control pipelines via config/config.yaml and config/params.yaml. Key sections:

preprocess: dataset, split, max samples, batch size, shuffle, prefetch buffer.
indexing: embedding model, chunk size/overlap, batch size, queue size, vector store directory, checkpoint frequency.
reranking: reranker model, tokenizer, top-k reranking.
retrieval: top-k results, embedding model, vector store path.
inference: model path, tokenizer path, engine type, max tokens, temperature.
flask: host, port, debug mode.

Usage

Preprocessing + Indexing (Streaming Mode)

python main.py

Pipeline flow:

Preprocessing Pipeline (Producer)
- Streams data from Hugging Face or Wikimedia.
- Cleans, formats, and converts to instruction-based JSON.
- Stops at max_samples.
Indexing Pipeline (Consumer)
- Chunks data.
- Embeds chunks using SentenceTransformers.
- Stores vectors in FAISS with checkpoints.

Pipelines communicate via queues to handle large datasets efficiently.

Reranking & Inference

python src/Reasona/pipeline/reranking_pipeline.py
python src/Reasona/pipeline/inference_pipeline.py

Retrieve top-k results from FAISS.
Optionally rerank using transformer models.
Generate answers or code snippets.

Run Flask App

python app.py

Access interactive web interface at http://localhost:5000.
Query datasets, retrieve top-k results, and perform inference.

Supported Datasets

Hugging Face Datasets (streamable)
wikimedia/wikipedia

Logging & Monitoring

JSON-based logs:
- logs/pipeline/preprocess_pipeline.json
- logs/pipeline/indexing_pipeline.json
- logs/pipeline/reranking_pipeline.json
- logs/pipeline/inference_pipeline.json
Logs include progress, checkpoints, runtime, and embedding statistics.

Author

Louay Amor – GitHub | LinkedIn

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
.github/workflows		.github/workflows
build/lib/Reasona		build/lib/Reasona
config		config
deployment		deployment
exporters/filebeat		exporters/filebeat
monitoring/elk		monitoring/elk
research		research
src		src
static		static
templates		templates
.dvcignore		.dvcignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
main.py		main.py
requirements.txt		requirements.txt
setup.py		setup.py
template.py		template.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Features

Project Structure

Installation

Configuration

Usage

Preprocessing + Indexing (Streaming Mode)

Reranking & Inference

Run Flask App

Supported Datasets

Logging & Monitoring

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Features

Project Structure

Installation

Configuration

Usage

Preprocessing + Indexing (Streaming Mode)

Reranking & Inference

Run Flask App

Supported Datasets

Logging & Monitoring

Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages