AI_Grounded with SAM2

상위 문서로 이동 : AI Wiki

Grounded with SAM2

We use Grounded SAM2 to automatically detect and segment objects in an image based on a text prompt (e.g., "monitor. keyboard. mouse.").
It combines Grounding DINO (for text-based object detection) with SAM2 (for high-quality segmentation), producing pixel-accurate masks for each object.

Masks are used

visualize object boundaries
generate per-class binary masks
or apply inpainting models to replace or modify specific regions

Example Output

Test1

Origin Image
Grounded SAM2.1
Mask

desk	deskmat	laptop	monitor	mouse

Test2

Origin Image
Grounded SAM2.1
Mask

desk	deskmat	monitor

keyboard	mouse	speaker

Test3

Origin Image
Grounded SAM2.1
Mask

desk	deskmat	monitor

laptop	keyboard	mouse	speaker

Test4

Origin Image
Grounded SAM2.1
Mask

desk	monitor	desktop

keyboard	mouse	speaker

Issues

When saving masks, objects with the same class name overwrite each other. Only the last instance is preserved
In some cases (e.g., test1), unintended objects such as the other person's mouse may be detected
If the label confidence is low or ambiguous, a single object may be split into multiple segments (e.g., "desktop_monitor" detected as two parts)

Next Steps

Merge all masks into a single combined mask
Apply SDXL inpainting using the original image and the merged mask
Fine-tune the SDXL inpainting model for better domain-specific results

Reference

IDEA-Research / Grounded-SAM-2

Woody's AI Backend Engineering Log

Home

💼 About

Deepvisions | AI Engineer 2026.03 ~ 재직중

🚀 Projects (최신순)

CCTV 자전거 경로 & 공회전 탐지 — 한동대학교 리빙랩

2026.05 ~ | @ Deepvisions 캠퍼스 CCTV 4대 · 자전거 OCR + 차량 공회전 다중 신호

야생동물 탐지 — RPi 엣지 배포

2026.04 ~ | @ Deepvisions 포도밭 침입 탐지 (5종 multi-class · 라즈베리파이 4 실시간)

포도밭 병해충 탐지 및 수확량 예측

2026.03 ~ | @ Deepvisions 드론 이미지 기반 객체 탐지 + GSD calibration + 수확량 예측

📦 종료된 프로젝트

OnTheTop

2025.03 ~ 2025.08 | 카카오테크부트캠프 | ✅ 종료 AI 기반 데스크테리어 추천 서비스

AI Notes

About

Name: Woody (이동재)
Focus: Vision AI, LLM Integration, Backend Engineering
GitHub: @ehdwo0427
Email: ehdwo0427@naver.com
포트폴리오 : 포트폴리오

AI_Grounded with SAM2

Grounded with SAM2

Masks are used

Example Output

Test1

Test2

Test3

Test4

Issues

Next Steps

Reference

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Woody's AI Backend Engineering Log

Home

💼 About

🚀 Projects (최신순)

CCTV 자전거 경로 & 공회전 탐지 — 한동대학교 리빙랩

야생동물 탐지 — RPi 엣지 배포

포도밭 병해충 탐지 및 수확량 예측

📦 종료된 프로젝트

OnTheTop

AI Notes

About

Clone this wiki locally