🤖 feat: Drag-and-drop image support + fixes #308

ammar-agent · 2025-10-17T23:26:18Z

Summary

Adds drag-and-drop support for images in the chat input, refactors ChatInput.tsx for maintainability, and fixes image handling bugs.

New Feature: Drag-and-Drop Images

Users can now drag image files directly into the chat input box alongside existing paste (Ctrl+V/Cmd+V) functionality.

How it works:

Drag image file(s) over the chat input textarea
Drop indicator shows (cursor changes to copy icon)
Images convert to base64 and appear as thumbnails
Multiple images supported in one operation

Code Refactoring

Reduced ChatInput.tsx from 1072 → 882 lines (-190 lines, -17.7%):

New Files:

src/utils/imageHandling.ts (73 lines) - Image conversion utilities
src/components/ChatInputToasts.tsx (200 lines) - Toast creation logic
src/utils/imageHandling.test.ts (180 lines) - 10 comprehensive tests

Benefits:

Unified logic for paste and drop (no duplication)
Async/await instead of nested FileReader callbacks
Toast utilities separated from UI component
Easy to test in isolation

Bug Fixes

1. macOS Drag-and-Drop MIME Type Issue

Problem: Dragging PNG files from macOS Finder failed with "image part must include base64 string content"

Root Cause: macOS doesn't populate file.type for dragged files, resulting in empty mediaType

Solution: Added MIME type detection fallback:

Primary: Use file.type if available
Fallback: Detect from file extension (png, jpg, jpeg, gif, webp, bmp, svg)
Default: Use image/png if unrecognized

2. Better Error Messages for Image Validation

Old: "image part must include base64 string content"

New: "image part [0] must include url string content (got undefined): {\"id\":\"...\"}"

Errors now include:

Index of failing image
Type of invalid field
First 50-200 chars of actual data
Specific validation that failed

Testing

✅ 583 unit tests pass
✅ 10 new image handling tests
✅ All integration tests pass
✅ Type checking passes
✅ Manually verified drag-and-drop on macOS

Files Changed

src/components/ChatInput.tsx         | 254 ++++----  (1072 → 882 lines)
src/components/ChatInputToasts.tsx   | 200 +++++++  (new)
src/utils/imageHandling.test.ts      | 180 +++++++  (new)
src/utils/imageHandling.ts           |  95 +++++++  (new)
src/services/agentSession.ts         |  17 +++---  (better errors)

Commits

🤖 feat: Add drag-and-drop image support + refactor ChatInput
🤖 improve: Add detailed debugging info for image validation errors
🤖 fix: Handle drag-and-drop files with missing MIME type

Generated with cmux

Images were being saved to history but not transmitted to the AI model. **Root cause:** - Our CmuxImagePart used type:'image' with fields 'image' and 'mimeType' - AI SDK's convertToModelMessages() only processes type:'file' parts - Images were filtered out before reaching the model **Fix:** - Changed CmuxImagePart to match AI SDK's FileUIPart format: - type: 'image' → 'file' - image → url - mimeType → mediaType - Updated all references across frontend, backend, and type definitions - Updated message aggregation to filter for type:'file' instead of type:'image' **Files changed:** - Types: message.ts, ipc.ts - Backend: agentSession.ts, ipcMain.ts, StreamingMessageAggregator.ts, modelMessageTransform.ts - Frontend: ChatInput.tsx, ImageAttachments.tsx, UserMessage.tsx - Stories: UserMessage.stories.tsx Images now flow correctly from UI → history → AI model.

- Test image transmission to AI model and response - Test image persistence in chat history - Uses 1x1 pixel PNG as minimal test fixture - Verifies both Anthropic and OpenAI providers

Reduces duplication and net LoC: - Add waitForStreamSuccess() - combines create collector + wait + assert - Add readChatHistory() - reads and parses chat.jsonl - Add TEST_IMAGES constant - reusable 1x1 pixel fixtures Image tests now: - 36 lines shorter (removed boilerplate) - More declarative and readable - Easier to add similar tests in future Net change: -36 lines in sendMessage.test.ts, +39 in helpers.ts (+3 total)

Consolidates repeated patterns: - createEventCollector + waitForEvent + assertStreamSuccess - Now just: await waitForStreamSuccess() Reduces 3 lines to 1 in multiple tests for: - bash tool tests - conversation continuity test - additional system instructions test Net: -8 lines

- Import fs/promises correctly in readChatHistory - Add type annotation for line parameter - Cast deltas to StreamDeltaEvent for textDelta access - Add proper null checks for userMessage and imagePart

- Extract image handling utilities to src/utils/imageHandling.ts - Unified paste and drop logic for cleaner code - processImageFiles handles async conversion to base64 - extractImagesFromClipboard/Drop filter image files - Extract toast utilities to src/components/ChatInputToasts.tsx - createCommandToast and createErrorToast - Removed 190 lines from ChatInput.tsx (1072 → 882, -17.7%) - Add drag-and-drop support to ChatInput - onDragOver handler checks for Files and sets dropEffect - onDrop handler processes dropped images - Works alongside existing paste support - Add comprehensive unit tests - src/utils/imageHandling.test.ts covers all utilities - Mock FileReader for Node.js test environment - 10 tests, all passing Users can now drag images directly into the chat input instead of only pasting them.

When image parts fail validation, errors now include: - Index of the failing image part - Type of the invalid field (got typeof X) - First 50-200 chars of actual data received - Specific check that failed (url, data URL format, mediaType) Frontend validation logs errors to console before sending, making it easier to catch issues client-side. Backend validation provides detailed context in assertion messages, making it clear what was received vs. what was expected. Example new error: "image part [0] must include url string content (got undefined): {\"id\":\"...\"}" vs old error: "image part must include base64 string content"

Some browsers/OS combinations (e.g., macOS drag-and-drop) don't populate file.type for dragged files. This causes mediaType to be empty string, which fails validation in the AI SDK. Solution: Fall back to detecting MIME type from file extension when file.type is empty. Defaults to image/png if extension is unrecognized. Supported extensions: png, jpg, jpeg, gif, webp, bmp, svg

- Remove unused ParsedCommand import - Fix async event handler warnings (use void with .then()) - Fix prefer-nullish-coalescing warnings (?? instead of ||) - Fix consistent-type-assertions in tests (use as at call site) - Remove unused beforeEach import

CI has stricter TypeScript settings - need to cast through unknown when mocking complex types like DataTransfer.

ammario

Manually verified — macOS

ammario · 2025-10-18T01:50:37Z

cc @kylecarbs if your image paste breaks it's probably because of this — although I'm pretty sure it won't

## Summary Adds drag-and-drop support for images in the chat input, refactors ChatInput.tsx for maintainability, and fixes image handling bugs. --- ## New Feature: Drag-and-Drop Images Users can now drag image files directly into the chat input box alongside existing paste (Ctrl+V/Cmd+V) functionality. **How it works:** - Drag image file(s) over the chat input textarea - Drop indicator shows (cursor changes to copy icon) - Images convert to base64 and appear as thumbnails - Multiple images supported in one operation --- ## Code Refactoring Reduced ChatInput.tsx from 1072 → 882 lines (-190 lines, -17.7%): **New Files:** - `src/utils/imageHandling.ts` (73 lines) - Image conversion utilities - `src/components/ChatInputToasts.tsx` (200 lines) - Toast creation logic - `src/utils/imageHandling.test.ts` (180 lines) - 10 comprehensive tests **Benefits:** - Unified logic for paste and drop (no duplication) - Async/await instead of nested FileReader callbacks - Toast utilities separated from UI component - Easy to test in isolation --- ## Bug Fixes ### 1. macOS Drag-and-Drop MIME Type Issue **Problem:** Dragging PNG files from macOS Finder failed with "image part must include base64 string content" **Root Cause:** macOS doesn't populate `file.type` for dragged files, resulting in empty mediaType **Solution:** Added MIME type detection fallback: - Primary: Use `file.type` if available - Fallback: Detect from file extension (png, jpg, jpeg, gif, webp, bmp, svg) - Default: Use `image/png` if unrecognized ### 2. Better Error Messages for Image Validation **Old:** `"image part must include base64 string content"` **New:** `"image part [0] must include url string content (got undefined): {\"id\":\"...\"}"` Errors now include: - Index of failing image - Type of invalid field - First 50-200 chars of actual data - Specific validation that failed --- ## Testing - ✅ 583 unit tests pass - ✅ 10 new image handling tests - ✅ All integration tests pass - ✅ Type checking passes - ✅ Manually verified drag-and-drop on macOS --- ## Files Changed ``` src/components/ChatInput.tsx | 254 ++++---- (1072 → 882 lines) src/components/ChatInputToasts.tsx | 200 +++++++ (new) src/utils/imageHandling.test.ts | 180 +++++++ (new) src/utils/imageHandling.ts | 95 +++++++ (new) src/services/agentSession.ts | 17 +++--- (better errors) ``` --- ## Commits - 🤖 feat: Add drag-and-drop image support + refactor ChatInput - 🤖 improve: Add detailed debugging info for image validation errors - 🤖 fix: Handle drag-and-drop files with missing MIME type _Generated with `cmux`_

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

src/utils/imageHandling.ts

src/utils/messages/StreamingMessageAggregator.ts

**Issue 1: Drag-drop files with empty MIME types were filtered out** - extractImagesFromDrop() rejected files where file.type === "" - This happened BEFORE the MIME type fallback could run - Fix: Also accept files with image extensions when file.type is empty - Test: Added test for macOS drag-drop scenario (empty MIME type) **Issue 2: Breaking change for existing users with saved images** - Changed from type: "image" to type: "file" in PR #308 - StreamingMessageAggregator only looked for type === "file" - Users who saved images before upgrade would lose them - Fix: Accept both "file" (new) and "image" (legacy) types - Uses type casting with eslint-disable for backwards compat Both fixes maintain backwards compatibility with existing chat history while fixing the macOS drag-and-drop bug. Codex comments: - #308 (comment) - #308 (comment)

TypeScript CI required proper type narrowing for the legacy image filter. Added type predicate (p): p is CmuxImagePart to narrow the union type.

ammar-agent added 8 commits October 17, 2025 18:22

test: Add integration tests for image support through IPC

0286da3

- Test image transmission to AI model and response - Test image persistence in chat history - Uses 1x1 pixel PNG as minimal test fixture - Verifies both Anthropic and OpenAI providers

fix: Remove stray closing brace in helpers.ts

920e676

fix: Import modelString from helpers

3780414

fix: TypeScript errors in test helpers and image tests

b0362af

- Import fs/promises correctly in readChatHistory - Add type annotation for line parameter - Cast deltas to StreamDeltaEvent for textDelta access - Add proper null checks for userMessage and imagePart

fix: Access textDelta from delta property in StreamDeltaEvent

2d23a2d

ammar-agent force-pushed the image branch from ce1e5a9 to 2d23a2d Compare October 17, 2025 23:48

ammar-agent added 5 commits October 17, 2025 18:52

fix: Add imageParts to sendMessage helper type signature

bccf320

fix: Format test files with Prettier

fd14724

ammar-agent changed the title ~~🤖 Fix: Images not visible to AI model~~ 🤖 feat: Drag-and-drop image support + fixes Oct 18, 2025

ammar-agent added 2 commits October 17, 2025 20:21

🤖 fix: TypeScript strict casting in test mocks

0b14c62

CI has stricter TypeScript settings - need to cast through unknown when mocking complex types like DataTransfer.

ammario approved these changes Oct 18, 2025

View reviewed changes

ammario marked this pull request as ready for review October 18, 2025 01:50

ammario added this pull request to the merge queue Oct 18, 2025

chatgpt-codex-connector bot reviewed Oct 18, 2025

View reviewed changes

src/utils/imageHandling.ts Show resolved Hide resolved

src/utils/messages/StreamingMessageAggregator.ts Show resolved Hide resolved

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Oct 18, 2025

ammar-agent added 2 commits October 17, 2025 21:09

🤖 fix: Add missing import and type guard for CmuxImagePart

d8374fd

TypeScript CI required proper type narrowing for the legacy image filter. Added type predicate (p): p is CmuxImagePart to narrow the union type.

ammario added this pull request to the merge queue Oct 18, 2025

Merged via the queue into main with commit d0dc0c1 Oct 18, 2025
8 checks passed

ammario deleted the image branch October 18, 2025 02:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🤖 feat: Drag-and-drop image support + fixes #308

🤖 feat: Drag-and-drop image support + fixes #308

Uh oh!

ammar-agent commented Oct 17, 2025 •

edited

Loading

Uh oh!

ammario left a comment

Uh oh!

ammario commented Oct 18, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

🤖 feat: Drag-and-drop image support + fixes #308

🤖 feat: Drag-and-drop image support + fixes #308

Uh oh!

Conversation

ammar-agent commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

New Feature: Drag-and-Drop Images

Code Refactoring

Bug Fixes

1. macOS Drag-and-Drop MIME Type Issue

2. Better Error Messages for Image Validation

Testing

Files Changed

Commits

Uh oh!

ammario left a comment

Choose a reason for hiding this comment

Uh oh!

ammario commented Oct 18, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ammar-agent commented Oct 17, 2025 •

edited

Loading