-
Notifications
You must be signed in to change notification settings - Fork 0
AI_Grounded with SAM2
ehdwo0427 edited this page Aug 4, 2025
·
2 revisions
상위 문서로 이동 : AI Wiki
- We use Grounded SAM2 to automatically detect and segment objects in an image based on a text prompt (e.g.,
"monitor. keyboard. mouse."). - It combines Grounding DINO (for text-based object detection) with SAM2 (for high-quality segmentation), producing pixel-accurate masks for each object.
- visualize object boundaries
- generate per-class binary masks
- or apply inpainting models to replace or modify specific regions
-
Origin Image

-
Grounded SAM2.1

-
Mask
| desk | deskmat | laptop | monitor | mouse |
|---|---|---|---|---|
![]() |
![]() |
![]() |
![]() |
![]() |
-
Origin Image

-
Grounded SAM2.1

-
Mask
| desk | deskmat | monitor |
|---|---|---|
![]() |
![]() |
![]() |
| keyboard | mouse | speaker |
|---|---|---|
![]() |
![]() |
![]() |
-
Origin Image

-
Grounded SAM2.1

-
Mask
| desk | deskmat | monitor |
|---|---|---|
![]() |
![]() |
![]() |
| laptop | keyboard | mouse | speaker |
|---|---|---|---|
![]() |
![]() |
![]() |
![]() |
-
Origin Image

-
Grounded SAM2.1

-
Mask
| desk | monitor | desktop |
|---|---|---|
![]() |
![]() |
![]() |
| keyboard | mouse | speaker |
|---|---|---|
![]() |
![]() |
![]() |
- When saving masks, objects with the same class name overwrite each other. Only the last instance is preserved
- In some cases (e.g., test1), unintended objects such as the other person's mouse may be detected
- If the label confidence is low or ambiguous, a single object may be split into multiple segments (e.g.,
"desktop_monitor"detected as two parts)
- Merge all masks into a single combined mask
- Apply SDXL inpainting using the original image and the merged mask
- Fine-tune the SDXL inpainting model for better domain-specific results
Deepvisions | AI Engineer 2026.03 ~ 재직중
2026.05 ~ | @ Deepvisions 캠퍼스 CCTV 4대 · 자전거 OCR + 차량 공회전 다중 신호
2026.04 ~ | @ Deepvisions 포도밭 침입 탐지 (5종 multi-class · 라즈베리파이 4 실시간)
2026.03 ~ | @ Deepvisions 드론 이미지 기반 객체 탐지 + GSD calibration + 수확량 예측
- 프로젝트 메인
- 관련 연구 종합 + 한계 (2026-05) ← 최신
- 수확량 close-up 4장 + 3-Model (2026-05-19)
- 드론 포도 수확량 예측 — 파이프라인 (2026-05)
- 드론 포도송이 탐지 — 학습 변천사 (2026-04)
- SAM3 vs Fine-tuned YOLO
- Grounding DINO vs YOLO Top3 비교 요약
- YOLO Baseline Top3 비교 요약
- YOLO Model Comparison Summary
- 포도 탐지를 위한 데이터 수집
- 포도 수확량 측정을 위한 Object Detection
2025.03 ~ 2025.08 | 카카오테크부트캠프 | ✅ 종료 AI 기반 데스크테리어 추천 서비스
- Name: Woody (이동재)
- Focus: Vision AI, LLM Integration, Backend Engineering
- GitHub: @ehdwo0427
- Email: ehdwo0427@naver.com
- 포트폴리오 : 포트폴리오























