Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
audio		audio
camera		camera
classifiers		classifiers
communication		communication
config		config
constants		constants
images		images
models		models
runs		runs
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bus.jpg		bus.jpg
data.yaml		data.yaml
main.py		main.py
requirements.txt		requirements.txt

Repository files navigation

MingchaoPlayer

Play Mingchao with actions and voice.

Author's saying

It is a good idea. More importantly, it is great to make the idea work.

Pose2action

Key	Pose
A(left)	单元格
W(forward)	单元格
S(back)	单元格
D(right)	单元格
T	单元格
Space(jump)	单元格
Mouse Left One Click(attack)	单元格
Mouse Right One Click(escape)	单元格
Mouse Move Left(turn on left)	单元格
Mouse Move Right(turn on right)	单元格

走路向前
小幅度左转向走路
大幅度左转向走路
加速跑：手臂斜45°向前（按住鼠标右键3秒）
小幅度右转走路
大幅度右转走路
面向前，向左平移
面向前，向右平移
后退：身体板正后倾斜45°
闪避：身体左后倾或身体右后倾
普通攻击：打拳
E技能：两手掌合并为中空三角形
R大招：基纽特战队队长登场姿势
空格：跳跃

Technical Planning

Use yolov8 to detect and classify human poses.
```
# command line to 
yolo pose predict model=D:/digital_human/MingchaoPlayer/models/yolo_models/yolov8n-pose.pt source='https://ultralytics.com/images/bus.jpg'
```
动作区别越大，识别准确率越高；闪避可以用双臂交叉挡住脸。识别不准的动作有：跳跃、向左平移、向右平移
Use Damo, OP or PyAutoGUI to operate mouse and keyboard.
Use Whisper to convert the player's real-time audio into text.
Text splitting and phrases mapping to commands.

Implementation Detail

yolov8分类器 ffmpeg进行视频切帧，结果是一帧帧的图片
```
ffmpeg -i VID20240528093613.mp4 -f image2 ./source/output_%03d.png
```

References

About

Play Mingchao with actions and voice.

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%