Visual Programming Language Using Converter

Do Visual Imaginations Improve Vision-and-Language Navigation Agents?

Abstract: Vision-and-Language Navigation (VLN) agents are tasked with navigating an unseen environment using natural language instructions. In this work, we study if visual representations of ...

IEEE

Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

Abstract: Automatic detection and prevention of open-set failures are crucial in closed-loop robotic systems. Recent studies often struggle to simultaneously identify unexpected failures reactively ...

Microsoft

LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation - Microsoft Research

CLIP is one of the most important multimodal foundational models today, aligning visual and textual signals into a shared feature space using a simple contrastive learning loss on large-scale ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Do Visual Imaginations Improve Vision-and-Language Navigation Agents?

Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation - Microsoft Research

Trending now