An open API service providing repository metadata for many open source software ecosystems.

Topic: "embodied-ai"

TianxingChen/Embodied-AI-Guide

[Lumina Embodied AI Community] 具身智能技术指南 Embodied-AI-Guide

Size: 22 MB - Last synced at: 32 minutes ago - Pushed at: about 1 hour ago - Stars: 5,175 - Forks: 330

EvolvingLMMs-Lab/Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language: Python - Size: 7.39 MB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 3,253 - Forks: 212

dora-rs/dora

DORA (Dataflow-Oriented Robotic Architecture) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.

Language: Rust - Size: 11.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2,179 - Forks: 178

unrealcv/unrealcv

UnrealCV: Connecting Computer Vision to Unreal Engine

Language: C++ - Size: 18.1 MB - Last synced at: 5 days ago - Pushed at: 27 days ago - Stars: 2,002 - Forks: 444

hyp1231/awesome-llm-powered-agent

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

Size: 159 KB - Last synced at: 5 days ago - Pushed at: 19 days ago - Stars: 1,999 - Forks: 157

facebookresearch/theseus

A library for differentiable nonlinear optimization

Language: Python - Size: 11.3 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 1,888 - Forks: 133

haosulab/ManiSkill

SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.

Language: Python - Size: 827 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,572 - Forks: 271

HCPLab-SYSU/Embodied_AI_Paper_List

[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI

Size: 11.4 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 1,416 - Forks: 95

zchoi/Awesome-Embodied-Robotics-and-Agent

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

Size: 1.59 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,338 - Forks: 77

TianxingChen/RoboTwin

[CVPR 25 Highlight & ECCV Workshop 24 Best Paper] RoboTwin Dual-arm Robot Manipulation Simulation Platform

Language: Python - Size: 34 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 841 - Forks: 96

OpenRL-Lab/openrl

Unified Reinforcement Learning Framework

Language: Python - Size: 8 MB - Last synced at: 30 days ago - Pushed at: 8 months ago - Stars: 720 - Forks: 66

OpenDriveLab/DriveAGI

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System

Language: Python - Size: 13.4 MB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 718 - Forks: 31

Zhefan-Xu/NavRL

[IEEE RA-L'25] NavRL: Learning Safe Flight in Dynamic Environments (NVIDIA Isaac/Python/ROS1/ROS2)

Language: C++ - Size: 215 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 653 - Forks: 67

huangwl18/VoxPoser

VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models

Language: Python - Size: 7.11 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 637 - Forks: 82

Skylark0924/Rofunc

🤖 The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation

Language: Python - Size: 1.01 GB - Last synced at: 4 days ago - Pushed at: 15 days ago - Stars: 620 - Forks: 55

ChaofanTao/Autoregressive-Models-in-Vision-Survey

[TMLR 2025🔥] A survey for the autoregressive models in vision.

Size: 7.96 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 571 - Forks: 15

simpler-env/SimplerEnv

Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)

Language: Jupyter Notebook - Size: 16.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 555 - Forks: 76

geng-haoran/Simulately

A universal summary of current robotics simulators

Language: TypeScript - Size: 670 MB - Last synced at: about 20 hours ago - Pushed at: about 21 hours ago - Stars: 454 - Forks: 24

haoranD/Awesome-Embodied-AI

A curated list of awesome papers on Embodied AI and related research/industry-driven resources.

Size: 116 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 416 - Forks: 16

RobotecAI/rai

RAI is an agentic framework for robotics, utilizing Langchain and ROS 2 tools to perform complex actions, defined scenarios, free interface execution, log summaries, voice interaction and more.

Language: Python - Size: 52.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 313 - Forks: 39

MarSaKi/ETPNav

[TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"

Language: Python - Size: 8.53 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 290 - Forks: 24

huangwl18/language-planner

Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"

Language: Jupyter Notebook - Size: 20.3 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 271 - Forks: 33

microsoft/CogACT

A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

Language: Python - Size: 95.7 KB - Last synced at: 4 days ago - Pushed at: 20 days ago - Stars: 259 - Forks: 20

allenai/procthor

🏘️ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses

Language: Python - Size: 3.96 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 219 - Forks: 18

HCPLab-SYSU/Book-of-MLM

《多模态大模型:新一代人工智能技术范式》作者:刘阳,林倞

Language: HTML - Size: 33.7 MB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 205 - Forks: 21

FlagOpen/RoboBrain

[CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.

Language: Python - Size: 13.2 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 189 - Forks: 10

MarSaKi/VLN-BEVBert

[ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"

Language: Python - Size: 9.73 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 163 - Forks: 4

HaoyiZhu/SPA

[ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation

Language: Python - Size: 9.98 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 143 - Forks: 5

CyberOrigin2077/Cyber

This repo is designed for General Robotic Operation System

Language: Jupyter Notebook - Size: 34.5 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 136 - Forks: 21

UnrealZoo/unrealzoo-gym Fork of zfw1226/gym-unrealcv

Large-scale photo-realistic virtual worlds for embodied AI

Language: Python - Size: 112 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 128 - Forks: 8

thunlp/LEGENT

Open Platform for Embodied Agents

Language: Python - Size: 1.72 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 128 - Forks: 6

rllab-snu/RNR-Map

Official Github repository for "Renderable Neural Radiance Map for Visual Navigation‬". (CVPR 2023 Highlight)

Language: Python - Size: 148 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 126 - Forks: 1

leofan90/Awesome-World-Models

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related websites.

Size: 123 KB - Last synced at: 7 days ago - Pushed at: 17 days ago - Stars: 119 - Forks: 3

bagh2178/UniGoal

[CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Language: Python - Size: 30.2 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 114 - Forks: 1

2toinf/UniAct

[CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"

Language: Python - Size: 220 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 111 - Forks: 7

bagh2178/SG-Nav

[NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation

Language: Jupyter Notebook - Size: 76.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 111 - Forks: 11

iris0329/SeeGround

[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding

Language: Python - Size: 97.9 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 104 - Forks: 2

UMass-Embodied-AGI/3D-Mem

[CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"

Language: Python - Size: 275 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 102 - Forks: 5

Lumina-EAI/Embodied-AI-Paper-List

[Lumina Embodied AI Community] A paper list for Embodied AI / Robotics

Size: 317 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 97 - Forks: 2

allenai/manipulathor

ManipulaTHOR, a framework that facilitates visual manipulation of objects using a robotic arm

Language: Jupyter Notebook - Size: 113 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 86 - Forks: 13

wadeKeith/Awesome-Embodied-AI

An Introduction to Embodied Intelligence (A Quick Guide of Embodied-AI) (Updating)

Size: 2.2 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 85 - Forks: 6

zd11024/NaviLLM

[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'

Language: Python - Size: 69.3 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 83 - Forks: 6

3dlg-hcvc/hssd

Code repository for the Habitat Synthetic Scenes Dataset (HSSD) paper.

Language: Python - Size: 118 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 83 - Forks: 6

LAMDASZ-ML/Awesome-LLM-Reasoning-with-NeSy

✨✨Latest Advances on Neuro-Symbolic Learning in the era of Large Language Models

Size: 1.29 MB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 82 - Forks: 4

BraveGroup/SheetCopilot

We release a general framework for prompting LLMs to manipulate software in a closed-loop manner.

Language: Python - Size: 61.1 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 82 - Forks: 7

minnie-lin/Awesome-Physics-Cognition-based-Video-Generation

A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.

Size: 295 KB - Last synced at: 14 days ago - Pushed at: 22 days ago - Stars: 79 - Forks: 2

Xiaoming-Zhao/PointNav-VO

[ICCV 2021] Official implementation of "The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation"

Language: Python - Size: 4.66 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 78 - Forks: 11

MarSaKi/NvEM

[ACM MM 2021 Oral] Official repo of "Neighbor-view Enhanced Model for Vision and Language Navigation"

Language: C++ - Size: 2.74 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 77 - Forks: 2

ai4ce/CityWalker

[CVPR 2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos

Language: Python - Size: 82.8 MB - Last synced at: 26 days ago - Pushed at: about 2 months ago - Stars: 75 - Forks: 5

mazpie/genrl

[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning them into sequences of latent world model states. Latent state sequences can be decoded using the decoder of the model, allowing visualization of the expected behavior, before training the agent to execute it.

Language: Python - Size: 82.9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 73 - Forks: 2

WayneMao/RoboMatrix

The Official Implementation of RoboMatrix

Language: Python - Size: 6.01 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 73 - Forks: 2

RayYoh/OCRM_survey

A Survey of Embodied Learning for Object-Centric Robotic Manipulation

Size: 1.21 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 71 - Forks: 4

declare-lab/Emma-X

Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning

Language: Python - Size: 32.7 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 64 - Forks: 5

rllab-snu/Visual-Graph-Memory

Official GitHub Repository for paper "Visual Graph Memory with Unsupervised Representation for Visual Navigation", ICCV 2021

Language: Python - Size: 367 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 61 - Forks: 12

H-Freax/ThinkGrasp

[CoRL2024] ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter. https://arxiv.org/abs/2407.11298

Language: Python - Size: 225 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 60 - Forks: 5

YicongHong/Discrete-Continuous-VLN

Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation

Language: Python - Size: 25.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 60 - Forks: 6

2toinf/DecisionNCE

[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"

Language: Python - Size: 24.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 56 - Forks: 1

rh20624/Awesome-IMU-Sensing

A collection of datasets, papers, and resources for IMU sensing.

Size: 286 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 55 - Forks: 3

TianxingChen/G3Flow

[CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation

Language: Python - Size: 68.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 55 - Forks: 2

yyvhang/lemon_3d

Language: Python - Size: 2.19 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 54 - Forks: 2

eric-ai-lab/VLMbench

NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"

Language: Python - Size: 95 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 53 - Forks: 7

MSR3D/MSR3D

[NeurIPS 2024] Official code repository for MSR3D paper

Language: Python - Size: 75.7 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 50 - Forks: 2

csiro-robotics/Uncertainty-LPR

📣 [IEEE IROS 2023] Official Repository of IROS 23 paper "Uncertainty-Aware Lidar Place Recognition in Novel Environments"

Language: Python - Size: 45.1 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 47 - Forks: 3

FudanDISC/ReForm-Eval

An benchmark for evaluating the capabilities of large vision-language models (LVLMs)

Language: Python - Size: 10 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 46 - Forks: 4

HanqingWangAI/Active_VLN

The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`

Language: Python - Size: 2.03 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 40 - Forks: 8

opendilab/OpenPaL

Building open-ended embodied agent in battle royale FPS game

Size: 12.1 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 37 - Forks: 1

hutslib/DHP-Mapping

[IROS2024]DHP-Mapping: A Dense Panoptic Mapping System with Hierarchical World Representation and Label Optimization Techniques

Language: C++ - Size: 3.46 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 33 - Forks: 0

phlippe/BISCUIT

Official code of the paper "BISCUIT: Causal Representation Learning from Binary Interactions" (UAI 2023)

Language: Python - Size: 13.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 32 - Forks: 9

ai4ce/DeepExplorer

[RSS2023] Metric-Free Exploration for Topological Mapping by Task and Motion Imitation in Feature Space

Language: Python - Size: 359 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 30 - Forks: 2

thunlp/EmbodiedAIxLLMPapers

Papers on integrating large language models with embodied AI

Size: 46.9 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 29 - Forks: 3

CEC-Agent/CEC

Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"

Language: Python - Size: 450 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 29 - Forks: 4

allenai/ai2thor-colab

🚀 Run AI2-THOR with Google Colab

Language: Jupyter Notebook - Size: 3.61 MB - Last synced at: 27 days ago - Pushed at: almost 3 years ago - Stars: 29 - Forks: 2

LoopMind-AI/loopquest

A Production Tool for Embodied AI

Language: Python - Size: 2.89 MB - Last synced at: 9 months ago - Pushed at: 10 months ago - Stars: 28 - Forks: 0

allenai/PoliFormer

PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators

Language: Python - Size: 550 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 27 - Forks: 0

allenai/phone2proc

📱👉🏠 Perform conditional procedural generation to generate houses like your own!

Language: Python - Size: 84 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 26 - Forks: 1

carpit680/giraffe

A cost-effective, ROS2-compatible robotic manipulator designed to lower the barriers of entry to Embodied AI.

Language: Python - Size: 41.7 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 25 - Forks: 5

allenai/robustnav

Evaluating pre-trained navigation agents under corruptions

Language: Python - Size: 15.6 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 25 - Forks: 3

tsinghua-fib-lab/SmartAgent

The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".

Size: 4.68 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 24 - Forks: 1

NZ-Liam-Zhong/Awesome_AI_for_Robotics_Learning_Notes

These are my learning notes about robot learning and Embodied AI[具身智能学习笔记]. If you feel it hard to learn them, please star me!

Size: 1.31 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 22 - Forks: 3

thkkk/FCNet

Fourier Controller Networks (FCNet) for Real-Time Decision-Making in Embodied Learning, ICML 2024

Language: Python - Size: 111 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 22 - Forks: 1

SgtVincent/EMOS Fork of facebookresearch/habitat-lab

The project repository for paper EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents: https://arxiv.org/abs/2410.22662

Language: Python - Size: 779 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 18 - Forks: 4

danelpeng/Awesome-Continual-Leaning-with-PTMs

This is a curated list of "Continual Learning with Pretrained Models" research.

Size: 254 KB - Last synced at: 8 days ago - Pushed at: about 2 months ago - Stars: 17 - Forks: 0

UMass-Embodied-AGI/CHAIC

[NeurIPS D&B Track 2024] Source code for the paper "Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge"

Language: Python - Size: 114 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 16 - Forks: 0

airs-cuhk/airsoul

Next-gen Foundation Model for Embodied AI

Language: Python - Size: 2.58 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 15 - Forks: 6

MCG-NJU/TPM

[WACV 2025 Oral] Transferring Foundation Models for Generalizable Robotic Manipulation

Language: Python - Size: 2.12 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 15 - Forks: 0

habicrowd/HabiCrowd

HabiCrowd, a new dataset and benchmark for crowd-aware visual navigation that surpasses other benchmarks in terms of human diversity and computational utilization.

Size: 491 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 1

taco-group/GenAI4AD

a comprehensive and critical synthesis of the emerging role of GenAI across the autonomous driving stack

Size: 4.65 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 13 - Forks: 1

zihao-ai/EARBench

Benchmarking Physical Risk Awareness of Foundation Model-based Embodied AI Agents

Language: Python - Size: 2.02 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 0

Evan-wyl/robotlearning

Papers, codes, datasets, applications, tutorials.

Size: 113 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 11 - Forks: 0

amitkparekh/CoGeLoT

A comprehensive framework to explore whether embodied multimodal models are plausibly resilient

Language: Python - Size: 39.8 MB - Last synced at: 26 days ago - Pushed at: about 1 month ago - Stars: 10 - Forks: 1

embodied-ai-workshop/embodied-ai.org

🌎 The Website for the Embodied AI Workshop at CVPR

Language: TypeScript - Size: 648 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 8 - Forks: 3

OpenXRIF/synapse

Robot VLM and VLA (Vision-Language-Action) inference API helping you manage multimodal prompts, RAG, and location metadata

Language: Rust - Size: 218 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 8 - Forks: 0

michaelyuancb/egomono4d

Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"

Language: Python - Size: 6.19 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 8 - Forks: 0

intelligolabs/R2RIE-CE

Official repository of "Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation". We present the first dataset - R2R-IE-CE - to benchmark instructions errors in VLN. We then propose a method, IEDL.

Language: Python - Size: 25.7 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 8 - Forks: 1

joeyy5588/planning-as-inpainting

Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty

Language: Python - Size: 59.6 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 1

aminabyaneh/stable-imitation-policy

Learning globally stable dynamical systems policies through imitation

Language: Python - Size: 27 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 7 - Forks: 1

Ayush8120/COAT

A CommonSense Reasoning Dataset pertaining to Physical Commonsense affordance of objects.

Size: 5.14 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 7 - Forks: 3

joeyy5588/LACMA

LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following

Language: C - Size: 833 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 0

keskival/embodied-emulated-personas

A project space for Embodied Emulated Personas - Embodied neural networks trained by LLM chatbot teachers

Language: Python - Size: 18.2 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 0

WILLOSCAR/Awesome-HCI-LLM

Awesome-HCI (Ubiquitous, LLM, MLLM, Agent, RAG, Embodied-AI)

Size: 14.6 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 6 - Forks: 0

Related Topics
robotics 49 reinforcement-learning 20 large-language-models 16 embodied-agent 14 computer-vision 14 llm 12 deep-learning 12 robot-learning 10 imitation-learning 10 foundation-models 9 awesome 8 visual-navigation 8 multimodal 8 artificial-intelligence 8 agent 6 navigation 6 ai 6 machine-learning 6 vlm 6 robot-manipulation 5 vla 5 embodied 5 pytorch 5 ros2 5 world-models 5 embodied-artificial-intelligence 5 vision-and-language-navigation 4 vision-and-language 4 large-language-model 4 awesome-list 4 ai2-thor 4 generative-ai 4 video-generation 4 benchmark 4 llms 4 ros 4 vision-language-model 4 survey 4 robotics-simulation 3 self-supervised-learning 3 in-context-learning 3 image-goal-navigation 3 habitat-sim 3 rust 3 robotic-manipulation 3 vision-language-navigation 3 manipulation 3 spatial-intelligence 3 simulation 3 robot 3 cvpr2025 3 autonomous-driving 3 vision-language 3 foundation-model 3 robotic-arm 2 metric-semantic 2 imu 2 mllm 2 object-manipulation 2 perception 2 motion-planning 2 representation-learning 2 visual-reasoning 2 object-goal-navigation 2 matterport3d 2 3d-scene-graph 2 llmops 2 reinforcement-learning-algorithms 2 gym 2 planning-algorithms 2 learning-from-demonstration 2 large-multimodal-models 2 virtual-worlds 2 3d-visual-grounding 2 simulator 2 multi-agent-reinforcement-learning 2 planning 2 embodied-intelligence 2 objectnav 2 ai2thor-environment 2 agents 2 gpt-4 2 chatgpt 2 instruction-tuning 2 instruction-following 2 human-computer-interaction 2 deep-reinforcement-learning 2 multi-agent-systems 2 embodied-navigation 2 dataset 2 sensor 2 camera 2 logitech 2 reasoning 2 noise-cancellation 2 ptz 2 speech-recognition 2 speech-to-text 2 4d-reconstruction 2 hoi 2