Skip to content
forked from icn-lab/Gyutan

🐮 OpenJTalk Java Text to Speech engine delegatee

License

Notifications You must be signed in to change notification settings

umjammer/Gyutan

 
 

Repository files navigation

Release Java CI CodeQL Java Parent

Gyutan

Gyutan Image

mavenized Gyutan (text -> label -> sasakama -> speech)

Install

  • clone this project
 $ cd $SRC_BASE
 $ git clone https://github.com/umjammer/Gyutan
  • download HTS voice
 $ cd $RESOURCE_BASE
 $ git clone https://github.com/icn-lab/htsvoice-tohoku-f01.git
  • set "fn.voice" in local.properties
 $ cd $SRC_BASE/Gyutan
 $ vi local.properties
 $ cat local.properties
 fn.voice=/Users/foo/src/resources/htsvoice-tohoku-f01/tohoku-f01-neutral.htsvoice
  • set "sen.home" in local.properties
 $ cd $SRC_BASE/Gyutan
 $ vi local.properties
 $ cat local.properties
 fn.voice=/Users/foo/src/resources/htsvoice-tohoku-f01/tohoku-f01-neutral.htsvoice
 sen.home=/Users/foo/src/java/Gyutan/sen

Usage

test

 $ mvn test

run

 $ mvn -P run antrun:run -Dtext="src/test/resources/test.txt" -Dwave="tmp/out.wav"

Reference

TODO

  • make njd as user friendly library
  • mora_size == 0 -> no pronounce -> delete
  • github ci needs to download and build dictionary

Copyright (c) 2015-2016 Intelligent Communication Network (Ito-Nose) Laboratory Tohoku University.
Copyright (c) 2001-2016 Nagoya Institute of Technology Department of Computer Science.
All rights reserved.

Gyutan is a language translation version of "Open JTalk" developed by HTS Working Group. Gyutan is a Japanese Text-to-Speech software implemented by Java. Thanks to HTS Working Group, speech synthesis can be used easier.

How to synthesize

Gyutan needs Sasakama(Sasakama.jar). See also Sasakama repository.

To synthesize speech from Japanese text, you must prepare speech model HMM (HTS voice) .

You can get HTS voice below.

Japanese morphological analyzer is needed to analyze Japanese text. And to determine accent type of synthesized speech, accent dictionary is needed. Sen, java implemented Japanese morphological analyzer, is used in this software. And naist-jdic(ChaSen format) is used as dictionary. Accent information is needed to synthesize speech. Accent information is extracted from naist-jdic(MeCab format) in the Open JTalk package(1.08), and added to naist-jdic(ChaSen format)

How to use

See bin/gyutan.sh(or bin/gyutan.bat on Windows). You need to define shell variables of each JARs and dictionary.

Link

What is Gyutan?

Gyutan is a cow's tongue. Gyutan is a famous food in Sendai city, Japan (Our laboratory is in there).