[CVPR 2025] UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
-
Updated
Mar 10, 2025 - Python
[CVPR 2025] UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
Python tool for capturing and logging human-computer interactions. Generate rich datasets for training multi-modal LLMs in autonomous computer control. Features screenshot, mouse, keyboard, and audio recording.
Add a description, image, and links to the multi-modal-llm topic page so that developers can more easily learn about it.
To associate your repository with the multi-modal-llm topic, visit your repo's landing page and select "manage topics."