The Instant Architect 🛋️✨

Built for the Google Live Agent Hackathon 2026 🏆

Welcome to The Instant Architect, an AI-powered interior design partner that "sees" through your device's camera, listens to your natural language instructions, and live-generates (in-paints) stunning furniture into your room using Generative AI.

🚀 Core Concept

📱 Recommended Experience: This application is best experienced on a smartphone! The UI is optimized for mobile screens, allowing you to use your rear camera like an AR lens while talking to the AI. This can be done by accessing the deployed URL on your phone's browser.

The application serves as an enthusiastic interior architect. By leveraging low-latency audio and vision streaming via WebSockets, you can have a natural conversation with the AI about your room.

When you agree on a design suggestion (e.g., "Yes, put a blue Bauhaus couch there!"), the AI triggers a tool that captures a high-resolution snapshot of your living space and sends it to an image-generation backend. Within seconds, a photorealistic rendering of the suggested furniture is placed seamlessly into your room's live feed.

🏗️ Architecture

The project is structured as a Monorepo containing a modern web frontend and a lightweight backend orchestrator.

1. Frontend (`/client`)

Built with: React and Vite (optimized for mobile Safari/Chrome).
Functionality:
- Captures full-screen video (object-fit: cover) from the user's mobile camera.
- Maintains a persistent WebSocket connection to the Node.js Backend, which securely relays data to the GenAI Live API.
- Streams 16kHz PCM audio and base64 video frames in real-time.
- Receives AI audio responses and plays them back dynamically via the Web Audio API (AudioContext).
- Listens for Gemini's specific render_furniture Function Calls to trigger the visual magic.

2. Backend (`/server`)

Built with: Node.js, Express, and Multer.
Functionality:
- Keeps the API keys secure.
- Exposes the /api/inpaint endpoint.
- Receives high-resolution frame snapshots from the client when the image generation tool is triggered.
- Communicates with the Google GenAI SDK (specifically utilizing advanced multimodal models like gemini-live-2.5-flash-native-audio and gemini-2.5-flash-image) to process the image and prompt, generating the in-painted result.

🔄 The "Magic" Workflow

The Conversation: The web app captures your microphone and camera. Data is continuously streamed to Gemini. The model is prompted with a specific persona ("Enthusiastic interior architect").
The Output: Gemini speaks back to you. The frontend decodes the incoming base64 24kHz PCM audio chunks and plays them instantly.
The Trigger: You tell the agent: "I want to see the couch." The Gemini model triggers the pre-defined render_furniture or render_room tool.
The Snapshot: The frontend intercepts this tool call, instantly grabs a high-resolution snapshot of the video feed, and sends it to the local Express backend alongside the AI's parameters (e.g., fabric, type).
The In-Painting: The Node.js server routes the image and the formulated prompt to Google's Image Generation API.
The Result: The backend returns the final generated image (Base64), which the frontend overlays beautifully onto your screen as a "Wow" moment.

🛠️ Local Setup Instructions

Prerequisites

Node.js (v18+ recommended)
A Google Cloud Project with Vertex AI API enabled.
Google Cloud CLI (gcloud) installed and authenticated.

1. Clone & Install

git clone https://github.com/timremmert/Architect-Agent.git
cd Architect-Agent

# Install all workspace dependencies at once
pnpm install

2. Environment Variables

You only need to configure the backend Google Cloud Project ID for the frontend.

Backend (server/.env):

GOOGLE_CLOUD_PROJECT=YOUR_PROJECT_ID

3. Google Cloud Authentication

Since the application connects directly to Vertex AI, you need to authorize your local environment using Application Default Credentials (ADC) and specify your project ID:

gcloud auth application-default login
gcloud config set project YOUR_PROJECT_ID

4. Run the Development Server

From the root the-instant-architect directory, you can start both the frontend and the backend simultaneously:

pnpm run dev

The React frontend will run at http://localhost:5173
The Express backend will run at http://localhost:3001

(Note: Depending on your browser's security policies, you might need to access the app via localhost or set up HTTPS to allow microphone/camera permissions).

⚠️ Notes on API

You may need to set up a billing account for your Google Cloud Project and activate the Vertex AI API to use the API.

⚠️ Notes on API Limits

Image generation models (like gemini-2.5-flash-image) have strict rate limits and quotas under the free tier. If you encounter a 429 Resource Exhausted or Quota error, you may need to either link your API key to a billed Google Cloud Project or utilize a mock-mode.

☁️ Deployment (Google Cloud Run)

The application can be deployed reproducibly to Google Cloud Run using Terraform. This deploys a single container that serves both the built React frontend and the Express backend.

Prerequisites

Google Cloud CLI (gcloud) installed and authenticated.
Terraform installed.
A Google Cloud Project with billing enabled.
Docker installed and running. (or Podman with Docker compatibility mode)

Deployment Steps

We've bundled the entire deployment process into a single executable script so you don't have to manually build Docker images and Terraform states.

Copy Environment Variables: Make sure you have copied server/.env.example to server/.env and filled in your GOOGLE_CLOUD_PROJECT.
Run the Deployment Script: From the root repository directory, simply run:
```
./deploy.sh
```

The script will automatically:

Authenticate with your Google Cloud account
Enable required Google Cloud APIs (Artifact Registry, Cloud Run, Vertex AI)
Initialize Terraform and create an Artifact Registry
Build the Node.js+React container locally
Push the Docker Image to your Google Cloud project
Deploy the Cloud Run application

After applying, the console will output your public service_url.

Continuous Updates

To deploy a new version of your code later, simply run ./deploy.sh again. It handles both fresh provisions and code updates automatically!

Or try the one-liner:

gcloud run deploy instant-architect \
  --image europe-west1-docker.pkg.dev/PROJECT-ID/instant-architect-repo/instant-architect:latest \
  --region europe-west1

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
client		client
images		images
server		server
terraform		terraform
.DS_Store		.DS_Store
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
deploy.sh		deploy.sh
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Instant Architect 🛋️✨

🚀 Core Concept

🏗️ Architecture

1. Frontend (`/client`)

2. Backend (`/server`)

🔄 The "Magic" Workflow

🛠️ Local Setup Instructions

Prerequisites

1. Clone & Install

2. Environment Variables

3. Google Cloud Authentication

4. Run the Development Server

⚠️ Notes on API

⚠️ Notes on API Limits

☁️ Deployment (Google Cloud Run)

Prerequisites

Deployment Steps

Continuous Updates

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

The Instant Architect 🛋️✨

🚀 Core Concept

🏗️ Architecture

1. Frontend (/client)

2. Backend (/server)

🔄 The "Magic" Workflow

🛠️ Local Setup Instructions

Prerequisites

1. Clone & Install

2. Environment Variables

3. Google Cloud Authentication

4. Run the Development Server

⚠️ Notes on API

⚠️ Notes on API Limits

☁️ Deployment (Google Cloud Run)

Prerequisites

Deployment Steps

Continuous Updates

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. Frontend (`/client`)

2. Backend (`/server`)

Packages