Play Mingchao with actions and voice.
It is a good idea. More importantly, it is great to make the idea work.
Key | Pose |
---|---|
A(left) | 单元格 |
W(forward) | 单元格 |
S(back) | 单元格 |
D(right) | 单元格 |
T | 单元格 |
Space(jump) | 单元格 |
Mouse Left One Click(attack) | 单元格 |
Mouse Right One Click(escape) | 单元格 |
Mouse Move Left(turn on left) | 单元格 |
Mouse Move Right(turn on right) | 单元格 |
- 走路向前
- 小幅度左转向走路
- 大幅度左转向走路
- 加速跑:手臂斜45°向前(按住鼠标右键3秒)
- 小幅度右转走路
- 大幅度右转走路
- 面向前,向左平移
- 面向前,向右平移
- 后退:身体板正后倾斜45°
- 闪避:身体左后倾 或 身体右后倾
- 普通攻击:打拳
- E技能:两手掌合并为中空三角形
- R大招:基纽特战队队长登场姿势
- 空格:跳跃
-
Use yolov8 to detect and classify human poses.
# command line to yolo pose predict model=D:/digital_human/MingchaoPlayer/models/yolo_models/yolov8n-pose.pt source='https://ultralytics.com/images/bus.jpg'
动作区别越大,识别准确率越高;闪避可以用双臂交叉挡住脸。 识别不准的动作有:跳跃、向左平移、向右平移
-
Use Damo, OP or PyAutoGUI to operate mouse and keyboard.
-
Use Whisper to convert the player's real-time audio into text.
-
Text splitting and phrases mapping to commands.
- yolov8分类器
ffmpeg进行视频切帧,结果是一帧帧的图片
ffmpeg -i VID20240528093613.mp4 -f image2 ./source/output_%03d.png
- https://github.com/CMU-Perceptual-Computing-Lab/openpose
- https://www.cnblogs.com/yuyingblogs/p/16177798.html
- https://zh.d2l.ai/chapter_convolutional-neural-networks/index.html
- https://learn.microsoft.com/zh-cn/windows/ai/windows-ml/tutorials/pytorch-train-model
- https://github.com/mmakos/HPC
- https://ai.google.dev/edge/mediapipe/solutions/vision/pose_landmarker/python?hl=zh-cn
- ASR System(Whisper): https://blog.csdn.net/lsb2002/article/details/131056566