An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: spatial-reasoning

damianomarsili/VADAR

Program synthesis for 3D spatial reasoning

Language: Jupyter Notebook - Size: 6.19 MB - Last synced at: about 9 hours ago - Pushed at: about 10 hours ago - Stars: 36 - Forks: 2

remyxai/VQASynth

Compose multimodal datasets 🎹

Language: Python - Size: 17.5 MB - Last synced at: 4 days ago - Pushed at: 7 days ago - Stars: 403 - Forks: 17

Zhoues/RoboRefer

Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"

Size: 5.7 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 0

haoningwu3639/SpatialScore

SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding

Language: Python - Size: 7.59 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 41 - Forks: 0

Pradeep9167/Spatial-MLLM

Spatial-MLLM enhances multi-language learning models by integrating visual-based spatial intelligence. This project aims to improve understanding and processing of spatial data, making it a valuable resource for researchers and developers. 🌍🚀

Language: Python - Size: 18.4 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

LaVi-Lab/VG-LLM

The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'

Language: Jupyter Notebook - Size: 30.9 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 8 - Forks: 0

SafeRL-Lab/m4r

Measuring Massive Multi-Modal Understanding and Reasoning in Open Space

Language: Python - Size: 39.6 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

ai4ce/SPARE3D

[CVPR2020] A Dataset for SPAtial REasoning on Three-View Line Drawings

Language: Python - Size: 8.25 MB - Last synced at: 14 days ago - Pushed at: 11 months ago - Stars: 53 - Forks: 9

jiayuww/SpatialEval

[NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs

Language: Python - Size: 3.95 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 23 - Forks: 0

ShiZhengyan/StepGame

[AAAI 2022] Dataset and pytorch codes for the paper titled "StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts" in AAAI 2022 (Oral)

Language: Python - Size: 357 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 28 - Forks: 6

spatial-comfort/spatial-comfort.github.io

Official website for "Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference under Ambiguities"

Language: JavaScript - Size: 9.95 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

AnjieCheng/SpatialRGPT

[NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"

Language: Python - Size: 7 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 74 - Forks: 5

ai4ce/Self-Supervised-SPARE3D

[CVPR 2022] Self-supervised Spatial Reasoning on Multi-View Line Drawings

Language: Python - Size: 2.06 MB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 25 - Forks: 1

andrewliao11/Q-Spatial-Bench-code

Official repo of the paper "Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models"

Language: Python - Size: 184 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

sled-group/COMFORT

Repo for the paper "Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Under Ambiguities"

Language: Python - Size: 33.8 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 4 - Forks: 0

altsoph/PLUGH

This is a supplementary code for the paper "PLUGH: A Benchmark for Spatial Understanding and Reasoning in Large Language Models."

Language: Python - Size: 7.45 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ShiZhengyan/LearnToAsk

[NAACL 2022] Dataset and codes for the paper titled "Learning to Execute Actions or Ask Clarification Questions" in Findings of NAACL 2022

Language: Python - Size: 284 KB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 12 - Forks: 11

lawl2/object-detection-and-spatial-relation

Language: Python - Size: 3.17 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

alreich/qualreas

Qualitative Reasoning: Spatio-Temporal Reasoning using Relation Algebras and Constraint Networks. Documentation is under construction at ReadTheDocs. See link below.

Language: Jupyter Notebook - Size: 43.7 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 22 - Forks: 3

lambdamikel/DLMAPS

DLMAPS = Description Logic Maps: Ontology-Based Spatial Queries to Digital City Maps

Language: Common Lisp - Size: 38.4 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 0

lambdamikel/Common-Lisp-Tangram-Solver

A Tangram Puzzle Solver in Common Lisp that is capable of solving arbitrary geometric tiling problems. CLIM (Common Lisp Interface Manager) is used for its GUI.

Language: Common Lisp - Size: 40.2 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 25 - Forks: 6

juletx/spatial-reasoning

Grounding Language Models for Compositional and Spatial Reasoning

Language: Jupyter Notebook - Size: 297 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 0

mfcecilia/cs2104_intro-to-problem-solving

Intro to Problem Solving -- notes

Size: 8.21 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

Related Keywords
spatial-reasoning 23 reasoning 6 dataset 5 benchmark 4 vision-language-models 4 vision-language-model 4 deep-learning 3 multimodal-deep-learning 3 gpt-4v 2 llama3 2 llm 2 machine-learning 2 multimodal 2 vision-and-language 2 lisp 2 multi-modal 2 line-drawings 2 temporal-reasoning 2 line-drawing 2 nlp 2 clim 2 common-lisp 2 natural-language-processing 2 claude 2 gpt-4o 2 spatial-queries 1 common-lisp-interface-manager 1 sparql 1 commonlisp 1 geometric-algorithms 1 spatial-query 1 query-language 1 query-answering 1 geometric-reasoning 1 lispworks 1 tangram 1 tangram-play 1 visual-analogies 1 owl 1 ontology-based 1 ontology 1 inference 1 geosparql 1 geographical-information-system 1 geographic-information-retrieval 1 geographic-data 1 framework 1 description-logics 1 temporal-reasoning-network 1 spatio-temporal 1 singleton-labellings 1 singleton-labelling 1 singleton-labelings 1 relation-algebras 1 verbal-reasoning 1 verbal-analogies 1 types 1 testing 1 recursive-algorithms 1 propositional-logic 1 problem-solving 1 number-theory 1 lateral-thinking 1 inheritance 1 heuristics 1 externalization 1 debugging 1 classes 1 blackbox 1 association 1 algorithms 1 aggregation 1 winoground 1 vsr 1 visual-spatial-reasoning 1 image-retrieval 1 image-captioning 1 grounding 1 computer-vision 1 caption-retrieval 1 tilings 1 tiling-problem 1 tangram-solver 1 tangram-puzzle-solver 1 tangram-puzzle 1 conversational-ai 1 clarification-questions 1 text-based-game 1 pathfinding 1 interactive-fiction 1 graph-reconstruction 1 contrastive-learning 1 aaai2022 1 large-language-models 1 gemini 1 foundation-models 1 pythonocc 1 open-space 1 intent-reasoning 1 mllm 1