Gyutan

mavenized Gyutan (text -> label -> sasakama -> speech)

Install

clone this project

 $ cd $SRC_BASE
 $ git clone https://github.com/umjammer/Gyutan

download HTS voice

 $ cd $RESOURCE_BASE
 $ git clone https://github.com/icn-lab/htsvoice-tohoku-f01.git

set "fn.voice" in local.properties

 $ cd $SRC_BASE/Gyutan
 $ vi local.properties
 $ cat local.properties
 fn.voice=/Users/foo/src/resources/htsvoice-tohoku-f01/tohoku-f01-neutral.htsvoice

set "sen.home" in local.properties

 $ cd $SRC_BASE/Gyutan
 $ vi local.properties
 $ cat local.properties
 fn.voice=/Users/foo/src/resources/htsvoice-tohoku-f01/tohoku-f01-neutral.htsvoice
 sen.home=/Users/foo/src/java/Gyutan/sen

Usage

test

 $ mvn test

run

 $ mvn -P run antrun:run -Dtext="src/test/resources/test.txt" -Dwave="tmp/out.wav"

Reference

HTS

TODO

make njd as user friendly library
~~mora_size == 0 -> no pronounce -> delete~~
~~github ci needs to download and build dictionary~~

Original

Copyright (c) 2015-2016 Intelligent Communication Network (Ito-Nose) Laboratory Tohoku University.
Copyright (c) 2001-2016 Nagoya Institute of Technology Department of Computer Science.
All rights reserved.

Gyutan is a language translation version of "Open JTalk" developed by HTS Working Group. Gyutan is a Japanese Text-to-Speech software implemented by Java. Thanks to HTS Working Group, speech synthesis can be used easier.

How to synthesize

Gyutan needs Sasakama(Sasakama.jar). See also Sasakama repository.

To synthesize speech from Japanese text, you must prepare speech model HMM (HTS voice) .

You can get HTS voice below.

from our repository : https://github.com/icn-lab/htsvoice-tohoku-f01
from HTS Working Group : http://open-jtalk.sourceforge.net/
from MMDAgent Sample : http://www.mmdagent.jp/

Japanese morphological analyzer is needed to analyze Japanese text. And to determine accent type of synthesized speech, accent dictionary is needed. Sen, java implemented Japanese morphological analyzer, is used in this software. And naist-jdic(ChaSen format) is used as dictionary. Accent information is needed to synthesize speech. Accent information is extracted from naist-jdic(MeCab format) in the Open JTalk package(1.08), and added to naist-jdic(ChaSen format)

How to use

See bin/gyutan.sh(or bin/gyutan.bat on Windows). You need to define shell variables of each JARs and dictionary.

Link

"Open JTalk" developed by HTS Working Group : http://open-jtalk.sourceforge.net
Sen, Japanese morphological analyzer implement by Java : https://java.net/projects/sen
naist-jdic (for chasen) : https://osdn.jp/projects/naist-jdic/

What is Gyutan?

Gyutan is a cow's tongue. Gyutan is a famous food in Sendai city, Japan (Our laboratory is in there).

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.github/workflows		.github/workflows
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
jitpack.yml		jitpack.yml
local.properties.sample		local.properties.sample
pom.xml		pom.xml
sen.tar.bz2		sen.tar.bz2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gyutan

Install

Usage

test

run

Reference

TODO

Original

How to synthesize

How to use

Link

What is Gyutan?

About

Releases 3

Languages

License

umjammer/Gyutan

Folders and files

Latest commit

History

Repository files navigation

Gyutan

Install

Usage

test

run

Reference

TODO

Original

How to synthesize

How to use

Link

What is Gyutan?

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 3

Languages