open_jtalk(1) General Commands Manual open_jtalk(1)
NAME
open_jtalk -- Japanese TTS system
SYNOPSIS
open_jtalk [options] [infile]
DESCRIPTION
This manual page documents briefly the open_jtalk command.
This manual page was written for the Debian distribution because the original program does not have a manual page. Instead, it has docu-
mentation in the GNU Info format; see below.
open_jtalk is a program that synthesize speech waveform from Japanese texts. It uses HMMs trained by the HMM-based speech synthesis system
(HTS).
OPTIONS
A summary of options is included below.
-x dir dictionary directory
-td tree decision tree files for state duration
-tm tree Show version of program.
-tf tree decision tree files for Log F0
-tl tree decision tree files for low-pass filter
-md pdf model files for state duration
-mm pdf model files for spectrum
-mf pdf model files for Log F0
-ml pdf model files for low-pass filter
-dm win window files for calculation delta of spectrum
-df win window files for calculation delta of Log F0
-dl win window files for calculation delta of low-pass filter
-ow s filename of output wav audio (generated speech)
-ot s filename of output trace information
-s i sampling frequency [16000][1--48000]
-p i frame period (point) [80][1--]
-a f all-pass constant [0.42][0.0--1.0]
-g i gamma = -1 / i (if i=0 then gamma=0) [0][0--]
-b f postfiltering coefficient [0.0][-0.8--8.0]
-l regard input as log gain and output linear one (LSP)
-u f voiced/unvoiced threshold[0.5][0.0--1.0]
-em tree decision tree files for GV of spectrum
-ef tree decision tree files for GV of Log F0
-el tree decision tree files for GV of low-pass filter
-cm pdf filenames of GV for spectrum
-cf pdf filenames of GV for Log F0
-cl pdf filenames of GV for low-pass filter
-jm f weight of GV for spectrum [1.0][0.0--2.0]
-jf f weight of GV for Log F0 [1.0][0.0--2.0]
-jl f weight of GV for low-pass filter [1.0][0.0--2.0]
-k tree GV switch
-z i audio buffer size [1600][0--48000]
infile text file
option '-d' may be repeated to use multiple delta parameters. generated spectrum, log F0, and low-pass filter coefficient sequences are
saved in natural endian, binary (float) format.
EXAMPLE
If you installed hts-voice-nitech-jp-atr503-m001 in the current directory, the following command let you make a voice file from input.txt:
% open_jtalk -s 48000 -p 240 -a 0.55
-td tree-dur.inf -tm tree-mgc.inf -tf tree-lf0.inf
-tl tree-lpf.inf -md dur.pdf -mm mgc.pdf
-mf lf0.pdf -ml lpf.pdf -dm mgc.win1
-dm mgc.win2 -dm mgc.win3 -df lf0.win1
-df lf0.win2 -df lf0.win3 -dl lpf.win1
-em tree-gv-mgc.inf -ef tree-gv-lf0.inf -cm gv-mgc.pdf
-cf gv-lf0.pdf -k gv-switch.inf -ow output.wav
-x dic_dir input.txt
AUTHOR
This manual page was written by Koichi Akabe vbkaisetsu@gmail.com for the Debian system (and may be used by others). Permission is granted
to copy, distribute and/or modify this document under the terms of the GNU General Public License, Version 2 any later version published by
the Free Software Foundation.
On Debian systems, the complete text of the GNU General Public License can be found in /usr/share/common-licenses/GPL.
open_jtalk(1)