An open API service providing repository metadata for many open source software ecosystems.

Topic: "multi-modal-llm"

liyiheng23/UniPose

[CVPR 2025] UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing

Language: Python - Size: 16.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 8 - Forks: 0

hemangjoshi37a/AIComputerInteractionLogger

Python tool for capturing and logging human-computer interactions. Generate rich datasets for training multi-modal LLMs in autonomous computer control. Features screenshot, mouse, keyboard, and audio recording.

Language: Python - Size: 356 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 6 - Forks: 1