AutoEditAI - AI Photoshop Assistant

Version 0.2.0 - August 2025

AutoEditAI is an AI-powered Photoshop assistant that automates image edits requested on Reddit's r/PhotoshopRequest using Google's Gemini AI models.

0902.1.mp4

🚀 Features

Automated Reddit Integration: Fetches new posts from r/PhotoshopRequest with image attachments
AI-Powered Request Parsing: Uses Gemini 2.5 Flash to parse natural language requests into structured edit forms
Intelligent Image Editing: Leverages AutoEditAI (Gemini 2.5 Flash Image Preview) for automated image processing
Modern Dashboard UI: Clean, responsive interface built with Next.js and shadcn/ui
Before/After Comparison: Side-by-side or slider view for comparing edits
Download & Export: Easy download of processed images
Processing History: Track all completed and failed requests

🏗️ Architecture

Tech Stack

Frontend: Next.js 14 (App Router) + TypeScript + TailwindCSS
UI Components: shadcn/ui + Radix UI
Backend: Next.js API Routes (Edge Runtime)
Reddit API: snoowrap (JavaScript Reddit API client)
AI Models:
- Gemini 2.5 Flash → Text parsing and request understanding
- AutoEditAI (Gemini 2.5 Flash Image Preview) → Image editing and generation
Icons: Lucide React

Core Workflow

Fetch → Reddit posts with images from r/PhotoshopRequest
Parse → Extract request text and convert to structured edit form using Gemini
Edit → Process images using AutoEditAI with the generated edit form
Display → Show before/after results in the dashboard

📋 Prerequisites

Node.js 18+
npm or yarn
Google AI API key (for Gemini models)
Reddit API credentials (optional - uses mock data if not configured)

🛠️ Installation

Clone and install dependencies:

git clone <repository-url>
cd autoeditai
npm install

Configure environment variables:

Create a .env.local file in the root directory:

# Google AI (Gemini) Configuration
GOOGLE_AI_API_KEY=your_google_ai_api_key_here

# Reddit API Configuration (optional - uses mock data if not set)
REDDIT_CLIENT_ID=your_reddit_client_id
REDDIT_CLIENT_SECRET=your_reddit_client_secret
REDDIT_USERNAME=your_reddit_username
REDDIT_PASSWORD=your_reddit_password

# Optional: Storage Configuration (for later)
AWS_ACCESS_KEY_ID=your_aws_access_key
AWS_SECRET_ACCESS_KEY=your_aws_secret_key
AWS_S3_BUCKET=your_s3_bucket_name

Start the development server:

npm run dev

Open your browser: Navigate to http://localhost:3000

🔧 Configuration

Google AI Setup

Visit Google AI Studio
Create a new API key
Add it to your .env.local file

Reddit API Setup (Optional)

Go to Reddit Apps
Create a new application (type: script)
Note your Client ID and Client Secret
Add Reddit credentials to .env.local

Note: If Reddit credentials are not configured, the app will use mock data for demonstration.

🎯 Usage

Dashboard Overview

The app has three main sections:

Queue - View and process new Reddit requests
Editor - Review before/after comparisons
History - Browse processed requests

Processing a Request

Fetch Posts: Click "Refresh Posts" in the Queue tab
Select Request: Browse available posts with images
Parse & Edit: Click the "Parse & Edit" button on any post
Review Results: Switch to Editor tab to see the processed image
Download: Save the edited image to your device

Edit Form Structure

The AI parses requests into this structured format:

{
  "task_type": "object_removal",
  "instructions": "Remove the man in the background",
  "objects_to_remove": ["man in background"],
  "objects_to_add": [],
  "style": "realistic",
  "mask_needed": true
}

🏗️ Project Structure

autoeditai/
├── src/
│   ├── app/                    # Next.js app router
│   │   ├── api/               # API routes
│   │   │   ├── reddit/        # Reddit integration
│   │   │   └── gemini/        # AI processing
│   │   ├── globals.css        # Global styles
│   │   ├── layout.tsx         # Root layout
│   │   └── page.tsx           # Landing page
│   │   └── app/               # Dashboard pages
│   ├── components/            # React components
│   │   ├── ui/               # shadcn/ui components
│   │   ├── queue-view.tsx    # Reddit posts queue
│   │   ├── editor-view.tsx   # Image editor/comparison
│   │   └── history-view.tsx  # Processing history
│   ├── lib/                  # Utilities and services
│   │   ├── database.ts       # Supabase integration
│   │   ├── auth-context.tsx  # Authentication
│   │   └── utils.ts          # Helper functions
│   └── types/                # TypeScript definitions
│       └── index.ts          # Type definitions
├── vercel.json               # Vercel deployment config
├── .vercelignore             # Files to exclude from Vercel build
├── supabase-schema.sql       # Database schema (reference)
└── README.md                 # Project documentation

🔌 API Endpoints

Reddit Integration

GET /api/reddit/posts - Fetch posts from r/PhotoshopRequest

AI Processing

POST /api/gemini/parse - Parse request text into edit form
POST /api/gemini/edit - Process image with edit form

🎨 UI Components

Built with shadcn/ui components:

Card - Post/request containers
Button - Actions and interactions
Tabs - Navigation between views
Table - History and data display
Slider - Before/after image comparison
Badge - Status indicators

🚀 Deployment

Vercel (Recommended)

Connect your GitHub repository to Vercel
Add environment variables in Vercel dashboard
Deploy automatically on push

Other Platforms

The app can be deployed to any platform supporting Next.js:

Netlify
Railway
DigitalOcean App Platform

🔮 Future Enhancements

Persistent Storage: S3/Supabase for image storage
Database Integration: Store processing history and user preferences
Batch Processing: Process multiple requests simultaneously
Advanced Editing: More sophisticated edit types and styles
User Authentication: Multi-user support with access controls
Webhooks: Real-time Reddit post notifications
Analytics: Processing metrics and performance insights

📝 Development

Available Scripts

npm run dev      # Start development server
npm run build    # Build for production
npm run start    # Start production server
npm run lint     # Run ESLint

Code Quality

TypeScript: Full type safety
ESLint: Code linting and formatting
Prettier: Code formatting (via ESLint)

🤝 Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

📄 License

MIT License - see LICENSE file for details.

🆘 Support

For issues and questions:

Check the Issues page
Create a new issue with detailed information
Include error messages and steps to reproduce

🙏 Acknowledgments

Google Gemini AI for powerful AI models
Reddit for the PhotoshopRequest community
shadcn/ui for beautiful UI components
Next.js for the React framework

Built with ❤️ for the creative community

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
public		public
src		src
supabase/.temp		supabase/.temp
.gitignore		.gitignore
.vercelignore		.vercelignore
README.md		README.md
next-env.d.ts		next-env.d.ts
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
supabase-schema.sql		supabase-schema.sql
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json
vercel.json		vercel.json

Xenonesis/AutoEditAI

Folders and files

Latest commit

History

Repository files navigation