vision-language-action

🔥This is a curated list of "A survey on Efficient Vision-Language Action Models" research. We will continue to maintain and update the repository, so follow us to keep up with the latest developments!!!

efficient vla embodied-ai vision-language-action vision-language-action-model

Updated Nov 2, 2025

TongUI-agent / TongUI-agent

Star

Release of code, datasets and model for our work TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials

agent vision-language-model vision-language-action computer-use gui-agent vision-language-action-model computer-use-agent tongui

Updated Oct 23, 2025
HTML

jiaming-zhou / X-ICM

Star

official repo for AGNOSTOS, a cross-task manipulation benchmark, and X-ICM method, a cross-task in-context manipulation (VLA) method

manipulation vision-language-action

Updated Oct 30, 2025
Python

ai4ce / INT-ACT

Star

Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models

benchmarking evaluation vla vision-language-action vision-language-action-model

Updated Nov 2, 2025
Python

worldbench / awesome-vla-for-ad

Star

🌐 A curated collection of vision-language-action (VLA) models for autonomous driving applications

awesome-list autonomous-driving 3d vla vlm embodied-ai large-language-models llm multimodal-large-language-models vision-language-action vision-language-models

Updated Oct 31, 2025
HTML

robosense2025 / track2

Star

Track 2: Social Navigation

social-navigation embodied-agent vision-language-action vision-language-models

Updated Aug 19, 2025

SS47816 / AGI-Elo

Star

[NeurIPS 2025] AGI-Elo: How Far Are We From Mastering A Task?

benchmark leaderboard agi imagenet coco artificial-general-intelligence datasets evaluation-metrics elo-rating rating-system evaluation-framework sota ai-benchmarks waymo-open-dataset mmlu vision-language-action ai-evaluation-framework livecodebench navsim

Updated Oct 28, 2025
Python

miladfa7 / PickAgent

Star

PickAgent: OpenVLA-powered Pick and Place Agent | Gradio&Simulation | Vision Language Action Model

ai deep-learning gradio vision-language-model vision-language-action openvla

Updated Aug 10, 2025
Python

pl909 / VLAGen

Star

VLAGen: Automated Data Collection for Generalizing Robotic Policies

robot ai ml vision-language-action

Updated Feb 23, 2025
Python

OmniJarvis / omnijarvis.github.io

Star

Project Page of OmniJARVIS

agent minecraft vision-language-action

Updated Jul 2, 2024
HTML

Improve this page

Add a description, image, and links to the vision-language-action topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-language-action topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vision-language-action

Here are 15 public repositories matching this topic...

showlab / ShowUI

xiaomi-research / recogdrive

ucla-mobility / AutoVLA

2toinf / UniAct

BridgeVLA / BridgeVLA

YuZhaoshu / Efficient-VLAs-Survey

TongUI-agent / TongUI-agent

jiaming-zhou / X-ICM

ai4ce / INT-ACT

worldbench / awesome-vla-for-ad

robosense2025 / track2

SS47816 / AGI-Elo

miladfa7 / PickAgent

pl909 / VLAGen

OmniJarvis / omnijarvis.github.io

Improve this page

Add this topic to your repo