π§ Work in Progress - Currently Under Development π§
Genify is a powerful and scalable data generation platform that creates realistic mock datasets for testing, development, and data science applications. Built with a modern React frontend and FastAPI backend, supporting 15+ data types and capable of generating up to 100,000+ records.
- π Massive Scale: Generate datasets from 1 to 100,000+ records
- π¨ Modern UI: Beautiful, responsive interface with dark/light theme support
- β‘ Real-time Preview: See your data before generating the full dataset
- π§ Advanced Schema Generation: AI-powered schema creation from natural language descriptions
- π± Mobile Responsive: Works seamlessly on all device sizes
π View Complete Data Types Reference - Comprehensive guide to all the available data generation categories
- Framework: FastAPI with Python
- Data Generation: Faker, NumPy, Custom algorithms
- Validation: Pydantic models
- File Operations: CSV export with configurable options
- Framework: React 19 with Vite
- Styling: Tailwind CSS 4.x
- State Management: React hooks and context
- UI Components: Custom component library
- Python 3.8+
- Node.js 18+
- npm or yarn
-
Clone the Genify repository
git clone https://github.com/Jousure/Genify.git cd Genify -
Backend Setup
cd backend pip install -r requirements.txt -
Frontend Setup
cd frontend npm install
-
Start the Backend Server
cd backend uvicorn app.main:app --reload --host 0.0.0.0 --port 8000 -
Start the Frontend Development Server
cd frontend npm run dev -
Access the Application
- Frontend: http://localhost:5173
- Backend API: http://localhost:8000
- API Documentation: http://localhost:8000/docs
- Describe your dataset in natural language (e.g., "Generate customer data with names, emails, and phone numbers")
- Set the number of records you want to generate
- Preview the generated data
- Download as CSV
- Custom Schemas: Define specific column types and constraints
- Data Validation: Ensure generated data meets your requirements
- Batch Generation: Generate multiple datasets in parallel
- Export Options: Multiple formats (CSV, JSON, Excel coming soon)
Create a .env file in the backend directory:
# Optional: OpenAI API key for enhanced schema generation
OPENAI_API_KEY=your_openai_api_key_here
# Optional: Custom data generation settings
MAX_RECORDS_PER_REQUEST=100000
DEFAULT_OUTPUT_FORMAT=csvPOST /api/generate
Content-Type: application/json
{
"description": "Generate customer data with names and emails",
"num_records": 1000,
"schema": null // Optional custom schema
}GET /api/data-typesPOST /api/preview
Content-Type: application/json
{
"description": "Generate customer data",
"num_records": 5
}cd backend
pytest tests/cd frontend
npm testπ§ This project is currently under active development π§
Genify welcomes contributions! Here's how you can help:
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature) - Commit your changes (
git commit -m 'Add amazing feature') - Push to the branch (
git push origin feature/amazing-feature) - Open a Pull Request
- New Data Types: Add support for additional data generation patterns
- UI/UX Improvements: Enhance the user interface and experience
- Performance: Optimize data generation for larger datasets
- Export Formats: Add support for JSON, Excel, Parquet formats
- Validation: Improve data validation and schema enforcement
- Documentation: Help improve documentation and examples
- Large dataset generation (>50,000 records) may take several minutes
- Memory usage increases significantly with very large datasets
- Some complex data types may need additional validation
- UI responsiveness during large generation operations
This project is licensed under the MIT License - see the LICENSE file for details.
For questions, suggestions, or issues about Genify:
- Create an issue on GitHub
- Check the documentation for available data types
- Review the API docs at
/docswhen running the backend
Genify simplifies the process of creating realistic test data for developers, QA engineers, and data scientists. Whether you need customer data, financial records, healthcare information, or any other type of structured data, Genify provides an intuitive interface and powerful backend to generate exactly what you need.