Add Japanese voices to KokoroTTS synthesis#116
Add Japanese voices to KokoroTTS synthesis#116entermask wants to merge 1 commit intoechogarden-project:mainfrom
Conversation
|
Thank, but the implementation of Kokoro used in Echogarden uses eSpeak-NG to convert raw text to phonemic characters (IPA or IPA-like), which are required by the model, and eSpeak-NG isn't able to phonemize Japanese properly. It replaces some letters with "Chinese letter". This problem happens in all instances where eSpeak-NG is used, including things like alignment (DTW-based), and other forms of synthesis. There are multiple past issues about this. The Japanese voices could technically be added (will likely require updating the voices packages, which is currently |
|
Do you try latest version of eSpeak-NG? I saw they support japanese right now. https://github.com/espeak-ng/espeak-ng/blob/master/docs/languages.md |
|
Yes, they list Japanese, with an annotation If you go to the bottom of the documentation page: I'm not familiar with Japanese (I know very little about the language), but I think the problem may be with "Kanji" characters. |
Some characters sound: "chinese letter".
Ex: もしおれがただ偶然、そしてこうしようというつもりでなくここに立っているのなら、ちょっとばかり絶望するところだな」と、そんなことが彼の頭に思い浮かんだ。
I’m not sure if it’s the base model or the JS.