Loading…
Friday August 14, 2026 14:25 - 14:50 KST
AI agents are increasingly expected to work with visual information: documents, screenshots, cameras, inspection images, and other domain-specific signals. MCP gives developers a powerful way to expose these capabilities as discoverable tools and resources, but real-world computer vision brings challenges that do not disappear behind a simple tool call, including image quality, sensor differences, domain shift, uncertainty, validation, and human review.

In this session, I will explore practical patterns for integrating computer vision into the MCP ecosystem, including OCR, object detection, visual monitoring, and domain-specific inspection tools. The talk will focus on what MCP can standardize for agent-facing vision systems, what should remain domain-specific, and how developers can design safer, more reliable interfaces between AI agents and visual perception.
Speakers
avatar for Seowoo Han

Seowoo Han

Computer Vision Engineer / AI Engineer, B GARAGE
Seowoo Han is an AI Engineer and Computer Vision Engineer based in Korea, with 5+ years of experience building production-grade AI systems across OCR, object detection, drone-based visual inspection, edge video analytics, multimodal AI, RAG, and agentic workflows. She currently works... Read More →
Friday August 14, 2026 14:25 - 14:50 KST
Orchid 2

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link