Topic: "multi-modal-llm"
liyiheng23/UniPose
[CVPR 2025] UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
Language: Python - Size: 16.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 8 - Forks: 0

hemangjoshi37a/AIComputerInteractionLogger
Python tool for capturing and logging human-computer interactions. Generate rich datasets for training multi-modal LLMs in autonomous computer control. Features screenshot, mouse, keyboard, and audio recording.
Language: Python - Size: 356 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 6 - Forks: 1
