Text To Speech to Facial BlendShapes #4428

GeorgeS2019 · 2023-05-18T11:46:17Z

MediaPipe Solution (you are using)

Part: 2 => Face Blendshape: May 2023 ->?
Part: 1 => Done: ARKit 52 blendshapes support request. June 2022 to April 2023 Completed

Programming language

c#

Are you willing to contribute it

Yes:

using @srcnalt Ready Player Me Avatar
RPM-Face-Tracing in Godot
using @kaiidams TextToSpeech: Voice100Sharp
using @SpookyCorgi mediapipe motion capture
using @virtual-puppet-project speech to avatar mouth movements Virtual Puppet Project

Describe the feature and the current behaviour/state

From the Modelling part using Godot
https://github.com/srcnalt/ReadyPlayerMe-Godot-Test/issues/1#issue-1713856035

Will this change the current API? How?

YES, additional non-conflicting API to the existing current API

Who will benefit with this feature?

Anyone who use MediaPipe BlendShape. It is NEXT STEP to Deep AI (Integrating Deep Audio to MediaPipe)

Please specify the use cases for this feature

User use ChatGPT or something similar to generate replies and this new feature translate the replies to speech with corresponding Avatar Blendshapes manipulation

Any Other info

No response

GeorgeS2019 · 2023-05-18T12:35:17Z

How the API looks Like ?

Given a ChatGPT or something similar from Google reply in text, the API will receive this string and output

the corresponding facial blendshapes as Time coordinated list of Dictionary[ blendshapeName, blendshapeValueFloat]
Voice (mp3 or WAV) that aligns with the blendshapeValues

endink · 2023-05-18T15:47:39Z

I have done this feature in Unreal Engine, it is easy to implement It use PaddleLite + OvrLipSync .😄

GeorgeS2019 · 2023-05-18T15:56:19Z

@endink
This is just Part 2 of many parts ahead :-)

FishWoWater · 2023-05-19T06:22:15Z

Agreed! It would be really exciting if blendshapes could be estimated and aligned with input audio clip.

I am currently working on a pipeline: user voice->speech recognition->chatgpt->text to speech->blendshapes. There exist many mature solutions except for the last stage (speech2blendshapes). Lipsync and face good can possibly do this, but have their limitations or problems. This feature will benefit the mediapipe community.

ayushgdev · 2023-05-22T07:48:54Z

Hello @GeorgeS2019 Thanks for raising this amazing feature request. We will discuss it internally and prioritise it in our roadmap. However, just a heads up, we are working in numerous fronts as of now hence this might get delayed.

GeorgeS2019 · 2023-05-27T15:27:48Z

Now working, the BlendShape part in 8th Top Ranked Github Open source 3D game engine: Godot
@srcnalt
@kaiidams
@SpookyCorgi
@you-win
@j20001970

kuaashish · 2023-06-06T10:24:40Z

Hello @lu-wang-g,
Could you please look into this amazing feature request? Thank you!!

lu-wang-g · 2023-06-13T06:47:24Z

At I/O 2023, Google released the demo app, Talking Character (https://developers.googleblog.com/2023/05/generative-ai-talking-character.html), which IIUC fits exactly the use case described here. The Web demo is partially open sourced here. You can find useful pieces of components in the directory. There has also been a discussion of releasing the talking character pipeline through MediaPipe, but we don't have concrete plan yet.

@ayushgdev and @kuaashish, do we have ways to track user requests like this?

tiamy · 2023-09-20T06:59:15Z

+1

GeorgeS2019 · 2024-04-29T00:44:22Z

We now have C# wrapper of Godot Mediapipe

GeorgeS2019 · 2024-05-01T05:02:38Z

The Godot community will attempt Text to Face => follow here

GeorgeS2019 added the type:feature Enhancement in the New Functionality or Request for a New Solution label May 18, 2023

google-ml-butler bot assigned ayushgdev May 18, 2023

This was referenced May 18, 2023

Integrating this project with Google Mediapipe kaiidams/Voice100Sharp#32

Closed

ARKit 52 blendshapes support request #3421

Closed

ayushgdev added legacy:face mesh Issues related to Face Mesh platform:unity MediaPipe Unity issues labels May 22, 2023

ayushgdev added the stat:awaiting response Waiting for user response label May 22, 2023

google-ml-butler bot removed the stat:awaiting response Waiting for user response label May 27, 2023

kuaashish assigned lu-wang-g and unassigned ayushgdev Jun 6, 2023

kuaashish added the stat:awaiting googler Waiting for Google Engineer's Response label Jun 6, 2023

kuaashish assigned yichunk and unassigned lu-wang-g Jan 8, 2024

GeorgeS2019 mentioned this issue May 1, 2024

Use of preprocessor directives in function-like macros is undefined #5366

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Text To Speech to Facial BlendShapes #4428

Text To Speech to Facial BlendShapes #4428

GeorgeS2019 commented May 18, 2023 •

edited

Loading

GeorgeS2019 commented May 18, 2023 •

edited

Loading

endink commented May 18, 2023

GeorgeS2019 commented May 18, 2023

FishWoWater commented May 19, 2023 •

edited

Loading

ayushgdev commented May 22, 2023

GeorgeS2019 commented May 27, 2023

kuaashish commented Jun 6, 2023

lu-wang-g commented Jun 13, 2023

tiamy commented Sep 20, 2023

GeorgeS2019 commented Apr 29, 2024

GeorgeS2019 commented May 1, 2024

Text To Speech to Facial BlendShapes #4428

Text To Speech to Facial BlendShapes #4428

Comments

GeorgeS2019 commented May 18, 2023 • edited Loading

MediaPipe Solution (you are using)

Programming language

Are you willing to contribute it

Describe the feature and the current behaviour/state

Will this change the current API? How?

Who will benefit with this feature?

Please specify the use cases for this feature

Any Other info

GeorgeS2019 commented May 18, 2023 • edited Loading

How the API looks Like ?

endink commented May 18, 2023

GeorgeS2019 commented May 18, 2023

FishWoWater commented May 19, 2023 • edited Loading

ayushgdev commented May 22, 2023

GeorgeS2019 commented May 27, 2023

kuaashish commented Jun 6, 2023

lu-wang-g commented Jun 13, 2023

tiamy commented Sep 20, 2023

GeorgeS2019 commented Apr 29, 2024

GeorgeS2019 commented May 1, 2024

GeorgeS2019 commented May 18, 2023 •

edited

Loading

GeorgeS2019 commented May 18, 2023 •

edited

Loading

FishWoWater commented May 19, 2023 •

edited

Loading