Abstract Vehicle Control System

A multi-modal vehicle control platform that interprets natural language commands and converts them into actionable vehicle control sequences through advanced NLP, computer vision, and speech processing.

📋 Table of Contents

Features
Project Structure
Requirements
Installation
Usage
Architecture
Contributing

✨ Features

1. Natural Language Processing

Parse natural language vehicle commands using fine-tuned T5 models
Multi-task learning for plan generation, intent classification, and slot detection
Special token support for structured command parsing

2. Plan Normalization

Validate and normalize parsed plans for consistency
Convert raw language into standardized command formats

3. Token Assembly

Convert normalized plans into executable token sequences
Structured output for vehicle control systems

4. Computer Vision

Real-time object detection using YOLOv8
Distance calculation between vehicle and detected objects
Direction detection for spatial awareness

5. Speech Recognition

Voice-to-text conversion for hands-free control
Integration with command pipeline

6. Speed & Duration Mapping

Intelligent parameter mapping for vehicle dynamics
Speed scaling and duration estimation for commands

7. Interactive Frontend

Real-time visualization of parsed commands
View parsed plans, normalized outputs, and assembled tokens
User-friendly interface for system interaction

📁 Project Structure

Abstract-Vehicle-Control/
├── NLP_modeling/              # NLP model training and inference
│   ├── main.py               # Main training script
│   ├── t5_multitask.py        # Multi-task T5 model
│   ├── models/               # Trained model checkpoints
│   └── multitask_tokenizer/   # Custom tokenizer
├── normalization_module/      # Plan normalization
│   ├── normalization.py
│   └── model_evaluation.py
├── object_distance_detection/ # Computer vision module
│   ├── yolo.py               # YOLOv8 integration
│   ├── o_d_d.py              # Distance detection
│   └── object direction.py    # Direction detection
├── speech_recognition/        # Voice input processing
│   └── speech_recognition.py
├── speed_duration_mapping/    # Parameter mapping
│   ├── speed_mapper.py
│   └── duration_mapper.py
├── frontend/                  # React TypeScript UI
│   ├── src/
│   │   ├── App.tsx           # Main application
│   │   ├── App.css
│   │   └── assets/CarSimulation.tsx
│   ├── package.json
│   └── vite.config.ts
├── data/                      # Training datasets
│   ├── train.jsonl
│   └── val.jsonl
├── diagnostic_script.py       # System diagnostics
├── token_assembler.py         # Token assembly utility
└── requirements.txt           # Python dependencies

🔧 Requirements

Python 3.8+
Node.js 16+ (for frontend)
PyTorch 2.0+
Transformers library
CUDA-capable GPU (recommended)

📦 Installation

Backend Setup

# Clone the repository
git clone <repository-url>
cd Abstract-Vehicle-Control

# Create Python virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install Python dependencies
pip install -r requirements.txt

Frontend Setup

cd frontend

# Install dependencies
npm install

# Build frontend
npm run build

🚀 Usage

Running the NLP Model

cd NLP_modeling
python main.py

Running the Full Pipeline

python diagnostic_script.py

Starting the Frontend

cd frontend
npm run dev

The application will be available at http://localhost:5173

🏗️ Architecture

NLP Pipeline

Input: Natural language command
Processing: T5 multi-task model with adapters
Output: Parsed plan with intent, slots, and structure

Normalization Stage

Input: Parsed plan
Processing: Validation and normalization rules
Output: Standardized plan format

Token Assembly

Input: Normalized plan
Processing: Convert to executable tokens
Output: Token sequence for vehicle control

Multi-Modal Integration

Vision: YOLOv8 for object detection and distance measurement
Audio: Speech-to-text for voice commands
Dynamics: Speed/duration mapping for realistic execution

📊 Model Information

Base Model: T5 (Text-to-Text Transfer Transformer)
Vision Model: YOLOv8 (nano, small variants)
Fine-tuning: LoRA and adapter-based approaches for parameter efficiency
Custom Tokens: <plan>, </plan>, <intent>, </intent>, <none>

🔄 Data Format

Training Data (JSONL)

{
  "command": "move forward 5 meters",
  "plan": "MOVE FORWARD 5",
  "intent": "movement",
  "slots": ["direction: forward", "distance: 5"]
}

🤝 Contributing

Contributions are welcome! Please feel free to submit pull requests or open issues for bugs and feature requests.

📝 License

[Add your license information here]

📧 Contact

[Add contact information here]

Note: Ensure all required pre-trained models (YOLOv8n.pt, YOLOv8s.pt) are in the project root before running the system.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Abstract Vehicle Control System

📋 Table of Contents

✨ Features

1. Natural Language Processing

2. Plan Normalization

3. Token Assembly

4. Computer Vision

5. Speech Recognition

6. Speed & Duration Mapping

7. Interactive Frontend

📁 Project Structure

🔧 Requirements

📦 Installation

Backend Setup

Frontend Setup

🚀 Usage

Running the NLP Model

Running the Full Pipeline

Starting the Frontend

🏗️ Architecture

NLP Pipeline

Normalization Stage

Token Assembly

Multi-Modal Integration

📊 Model Information

🔄 Data Format

Training Data (JSONL)

🤝 Contributing

📝 License

📧 Contact

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
NLP_modeling		NLP_modeling
__pycache__		__pycache__
data		data
frontend		frontend
normalization_module		normalization_module
object_distance_detection		object_distance_detection
speech_recognition		speech_recognition
speed_duration_mapping		speed_duration_mapping
src		src
talk2cardataset		talk2cardataset
.gitattributes		.gitattributes
.gitignore		.gitignore
diagnostic_script.py		diagnostic_script.py
js		js
readme.md		readme.md
requirements.txt		requirements.txt
token_assembler.py		token_assembler.py

Swoyesh/Abstract_Vehicle_Control

Folders and files

Latest commit

History

Repository files navigation

Abstract Vehicle Control System

📋 Table of Contents

✨ Features

1. Natural Language Processing

2. Plan Normalization

3. Token Assembly

4. Computer Vision

5. Speech Recognition

6. Speed & Duration Mapping

7. Interactive Frontend

📁 Project Structure

🔧 Requirements

📦 Installation

Backend Setup

Frontend Setup

🚀 Usage

Running the NLP Model

Running the Full Pipeline

Starting the Frontend

🏗️ Architecture

NLP Pipeline

Normalization Stage

Token Assembly

Multi-Modal Integration

📊 Model Information

🔄 Data Format

Training Data (JSONL)

🤝 Contributing

📝 License

📧 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages