Genify - Advanced Data Generation Platform

🚧 Work in Progress - Currently Under Development 🚧

Genify is a powerful and scalable data generation platform that creates realistic mock datasets for testing, development, and data science applications. Built with a modern React frontend and FastAPI backend, supporting 15+ data types and capable of generating up to 100,000+ records.

✨ Features

🎯 Core Capabilities

📊 Massive Scale: Generate datasets from 1 to 100,000+ records
🎨 Modern UI: Beautiful, responsive interface with dark/light theme support
⚡ Real-time Preview: See your data before generating the full dataset
🔧 Advanced Schema Generation: AI-powered schema creation from natural language descriptions
📱 Mobile Responsive: Works seamlessly on all device sizes

📋 Supported Data Types

📖 View Complete Data Types Reference - Comprehensive guide to all the available data generation categories

🏗️ Architecture

Backend (FastAPI)

Framework: FastAPI with Python
Data Generation: Faker, NumPy, Custom algorithms
Validation: Pydantic models
File Operations: CSV export with configurable options

Frontend (React)

Framework: React 19 with Vite
Styling: Tailwind CSS 4.x
State Management: React hooks and context
UI Components: Custom component library

🚀 Getting Started

Prerequisites

Python 3.8+
Node.js 18+
npm or yarn

Installation

Clone the Genify repository

git clone https://github.com/Jousure/Genify.git
cd Genify

Backend Setup

cd backend
pip install -r requirements.txt

Frontend Setup
```
cd frontend
npm install
```

Running the Application

Start the Backend Server

cd backend
uvicorn app.main:app --reload --host 0.0.0.0 --port 8000

Start the Frontend Development Server
```
cd frontend
npm run dev
```
Access the Application
- Frontend: http://localhost:5173
- Backend API: http://localhost:8000
- API Documentation: http://localhost:8000/docs

📖 Usage

Basic Data Generation

Describe your dataset in natural language (e.g., "Generate customer data with names, emails, and phone numbers")
Set the number of records you want to generate
Preview the generated data
Download as CSV

Advanced Features

Custom Schemas: Define specific column types and constraints
Data Validation: Ensure generated data meets your requirements
Batch Generation: Generate multiple datasets in parallel
Export Options: Multiple formats (CSV, JSON, Excel coming soon)

🔧 Configuration

Environment Variables

Create a .env file in the backend directory:

# Optional: OpenAI API key for enhanced schema generation
OPENAI_API_KEY=your_openai_api_key_here

# Optional: Custom data generation settings
MAX_RECORDS_PER_REQUEST=100000
DEFAULT_OUTPUT_FORMAT=csv

📚 API Documentation

Main Endpoints

Generate Data

POST /api/generate
Content-Type: application/json

{
  "description": "Generate customer data with names and emails",
  "num_records": 1000,
  "schema": null  // Optional custom schema
}

Get Available Data Types

GET /api/data-types

Preview Data

POST /api/preview
Content-Type: application/json

{
  "description": "Generate customer data",
  "num_records": 5
}

🧪 Testing

Backend Tests

cd backend
pytest tests/

Frontend Tests

cd frontend
npm test

🤝 Contributing

🚧 This project is currently under active development 🚧

Genify welcomes contributions! Here's how you can help:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Areas for Contribution

New Data Types: Add support for additional data generation patterns
UI/UX Improvements: Enhance the user interface and experience
Performance: Optimize data generation for larger datasets
Export Formats: Add support for JSON, Excel, Parquet formats
Validation: Improve data validation and schema enforcement
Documentation: Help improve documentation and examples

🐛 Known Issues

Large dataset generation (>50,000 records) may take several minutes
Memory usage increases significantly with very large datasets
Some complex data types may need additional validation
UI responsiveness during large generation operations

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

📞 Support

For questions, suggestions, or issues about Genify:

Create an issue on GitHub
Check the documentation for available data types
Review the API docs at /docs when running the backend

🎉 About Genify

Genify simplifies the process of creating realistic test data for developers, QA engineers, and data scientists. Whether you need customer data, financial records, healthcare information, or any other type of structured data, Genify provides an intuitive interface and powerful backend to generate exactly what you need.

⚠️ Disclaimer: This is a development version. Features may change, and some functionality may be incomplete. Use at your own risk for production workloads.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Genify - Advanced Data Generation Platform

✨ Features

🎯 Core Capabilities

📋 Supported Data Types

🏗️ Architecture

Backend (FastAPI)

Frontend (React)

🚀 Getting Started

Prerequisites

Installation

Running the Application

📖 Usage

Basic Data Generation

Advanced Features

🔧 Configuration

Environment Variables

📚 API Documentation

Main Endpoints

Generate Data

Get Available Data Types

Preview Data

🧪 Testing

Backend Tests

Frontend Tests

🤝 Contributing

Areas for Contribution

🐛 Known Issues

📄 License

📞 Support

🎉 About Genify

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages