Sailors Parrot: Multimodal Sailing Assistant

A sophisticated AI-powered chatbot designed to help new sailors understand the ways of the sea. This assistant combines the wisdom of an experienced sea captain with modern retrieval-augmented generation (RAG) technology and multimodal capabilities to provide comprehensive, accurate information about sailing and boating.

Features

Multimodal Interaction: Process both text queries and images to provide comprehensive sailing advice
Retrieval-Augmented Generation: Pulls information from sailing forums and reference materials
Visual Search: Analyze sailing-related images to provide context-specific information
Nautical Persona: Responds with the voice and expertise of a seasoned sea captain
Markdown Formatting: Delivers well-structured, easy-to-read responses
Conversation Memory: Maintains context throughout the conversation
Forum Topic Classification: Automatically categorizes queries by relevant sailing topics
Platform Adaptability: Optimized for both Apple Silicon and GPU environments

Architecture

The system is built on a modern microservices architecture with these key components:

Retriever: Core component that handles document retrieval and response generation
Combined Services: Standalone service that provides a unified API for:
- Visual Search: Processes and indexes images for visual search capabilities
- ChromaDB: Manages the vector database for semantic search of forum content
LangChain Integration: Leverages LangChain for prompt templates and LLM interactions
Multimodal LLM: Uses advanced language models capable of processing both text and images
Platform Abstraction: Automatically adapts to different hardware environments

This microservices architecture improves resource efficiency and startup time by:

Loading resource-intensive components (visual index and ChromaDB) only once in the dedicated service
Allowing the main application to start faster
Reducing memory usage in the main application
Enabling better scalability and separation of concerns
Providing a unified API for all data retrieval operations

For more details on the architecture, see ARCHITECTURE.md.

Installation

# Clone the repository
git clone https://github.com/yourusername/sea-captain.git
cd sea-captain

# Create and activate a conda environment
conda create -n sea-captain python=3.11
conda activate sea-captain

# Install dependencies
pip install -r requirements.txt

# Install custom Byaldi module
pip install -e custom_modules/byaldi

# Set up Python path
export PYTHONPATH=$PYTHONPATH:$(pwd)

# Set up environment variables
cp .env.example .env
# Edit .env with your API keys and configuration

Usage

Starting the Application

Start the Corpus Service:

# Start the Corpus Service
python service/run_corpus_service.py

Then, in a separate terminal, start the main application:

# Start the main application
python src/main.py

The Combined Services will be available at http://localhost:8081 The main application will be available at http://localhost:4001

Example Queries

The assistant can handle a variety of sailing-related questions:

"What's the best way to tie a bowline knot?"
"How do I read a nautical chart?"
"What should I do if I encounter rough weather while sailing?"
[Upload an image of sailing equipment] "What is this and how do I use it?"
"What safety equipment should I have on board for coastal cruising?"

Configuration

The system is highly configurable through the RetrieverConfig class:

Adjust retrieval parameters (number of documents, window sizes)
Modify the system prompt and query templates
Configure visual search parameters
Set forum collection names and result limits

Dependencies

LangChain: For RAG implementation and LLM interaction
ChromaDB: Vector database for semantic search
PIL/Pillow: Image processing
Byaldi: Custom multimodal RAG implementation
Google Gemini or similar multimodal LLM

Development

Project Structure

.
├── src/
│   ├── app.py                # Main application entry point
│   ├── retriever.py          # Core retrieval and generation logic
│   ├── models.py             # Data models and state management
│   ├── session_manager.py    # User session handling
│   └── visual_index/         # Image processing and visual search
│       ├── search.py         # Visual search implementation
│       └── index_provider.py # Platform-adaptive index management
├── custom_modules/           # Custom extensions
│   └── byaldi/               # Custom multimodal RAG implementation
├── .env                      # Environment variables
├── Dockerfile                # CPU-optimized Docker configuration
├── Dockerfile.gpu            # GPU-optimized Docker configuration
├── docker-compose.yml        # Docker Compose for CPU environments
├── docker-compose.gpu.yml    # Docker Compose for GPU environments
├── ARCHITECTURE.md           # Detailed architecture documentation
├── CHANGES.md                # Summary of platform adaptability changes
├── GPU_SETUP.md              # GPU-specific setup instructions
└── README.md                 # This file

Adding New Features

To extend the assistant's capabilities:

Add new forum topics in models.py (ForumTopic)
Enhance the retriever with additional data sources
Improve the visual search capabilities
Customize the system prompt for different use cases

Docker Deployment

This application can be run in a Docker container for easier deployment and consistency across environments. We provide two different Docker configurations:

CPU Mode (for Apple Silicon and other CPU-only environments)
GPU Mode (for environments with NVIDIA GPUs)

Prerequisites

Docker and Docker Compose installed on your system
Google API key for the language model
For GPU mode: NVIDIA GPU with CUDA support and NVIDIA Container Toolkit installed

Running on Apple Silicon (or CPU-only systems)

# Build and start the container in CPU mode
docker-compose up -d

This will:

Build the Docker image optimized for CPU usage
Mount the data directories as volumes
Start the application on port 8000

Running on GPU Systems

# Build and start the container in GPU mode
docker-compose -f docker-compose.gpu.yml up -d

This will:

Build the Docker image optimized for GPU usage
Configure the container to use NVIDIA GPUs
Mount the data directories as volumes
Start the application on port 8000

For detailed GPU setup instructions, see GPU_SETUP.md.

Accessing the Application

Access the application at http://localhost:8000

Environment Variables

You can set environment variables in the following ways:

In a .env file in the project root (for docker-compose)
Directly in the docker-compose.yml file
By passing them to the docker-compose up command:

GOOGLE_API_KEY=your_key_here CHAINLIT_AUTH_SECRET=your_secret docker-compose up -d

Data Persistence

The Docker setup uses volume mounts to persist data:

./data:/app/data - Application data
./.byaldi:/app/.byaldi - Byaldi index
./chroma_db:/app/chroma_db - ChromaDB data

This ensures that your data remains intact even if the container is removed.

Platform-Specific Optimizations

Apple Silicon / CPU-only Environments

The application includes specific optimizations for Apple Silicon:

Uses memory-mapped tensors for efficient memory usage
Forces CPU usage to avoid CUDA initialization errors
Applies custom patches to optimize for Apple Silicon

GPU Environments

For systems with NVIDIA GPUs, the application:

Uses CUDA acceleration for faster processing
Configures memory allocation settings for optimal performance
Leverages GPU memory for model inference

Checking Your Environment

Run the environment checker script to determine the best configuration for your system:

python check_environment.py

This will analyze your system and recommend the appropriate Docker configuration.

License

[Specify your license here]

Contributing

[Guidelines for contributing to the project]

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.chainlit		.chainlit
custom_modules/byaldi		custom_modules/byaldi
public		public
scripts		scripts
services		services
src		src
tests		tests
.cursorignore		.cursorignore
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
CHANGES.md		CHANGES.md
GPU_SETUP.md		GPU_SETUP.md
README.md		README.md
SETUP.md		SETUP.md
SUMMARY.md		SUMMARY.md
chainlit.md		chainlit.md
chainlit.yaml		chainlit.yaml
environment.yml		environment.yml
pytest.ini		pytest.ini
requirements.backup.txt		requirements.backup.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sailors Parrot: Multimodal Sailing Assistant

Features

Architecture

Installation

Usage

Starting the Application

Example Queries

Configuration

Dependencies

Development

Project Structure

Adding New Features

Docker Deployment

Prerequisites

Running on Apple Silicon (or CPU-only systems)

Running on GPU Systems

Accessing the Application

Environment Variables

Data Persistence

Platform-Specific Optimizations

Apple Silicon / CPU-only Environments

GPU Environments

Checking Your Environment

License

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

celtiberi/sail-chat

Folders and files

Latest commit

History

Repository files navigation

Sailors Parrot: Multimodal Sailing Assistant

Features

Architecture

Installation

Usage

Starting the Application

Example Queries

Configuration

Dependencies

Development

Project Structure

Adding New Features

Docker Deployment

Prerequisites

Running on Apple Silicon (or CPU-only systems)

Running on GPU Systems

Accessing the Application

Environment Variables

Data Persistence

Platform-Specific Optimizations

Apple Silicon / CPU-only Environments

GPU Environments

Checking Your Environment

License

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages