一些背景:在尝试建立单位选择声音时,我按照以下步骤操作:https ://github.com/CSTR-Edinburgh/CSTR-Edinburgh.github.io/blob/master/_posts/2016-8-21-Multisyn_unit_selection .md并使用此处的语音定义:https ://raw.githubusercontent.com/CSTR-Edinburgh/merlin/master/egs/hybrid_synthesis/s1/voice_definition_files/unit_selection/cstr_us_awb_arctic_multisyn.scm 。不幸的是,wav 太吵了,所以我最终手动标记它们并跳过自动标记过程。
声音现在还可以,但仍需要一些工作。经常发生的一个错误是节日报告“缺少双音素”以暂停到电话转换,例如:
festival> (utt.relation.print (SayText "I can say anything I want.") 'Unit)
Missing diphone: #_ay
diphone still missing, backing off: #_ay
backed off: #_ay -> #_ax
diphone still missing, backing off: #_ax
backed off: #_ay -> #_#
diphone still missing, backing off: #_#
backed off: #_ay ->
Missing diphone: ey_eh
Interword so inserting silence.
diphone still missing, backing off: ey_#
backed off: ey_eh -> ax_#
diphone still missing, backing off: ax_#
backed off: ey_eh -> #_#
diphone still missing, backing off: #_#
backed off: ey_eh ->
Missing diphone: #_eh
diphone still missing, backing off: #_eh
backed off: #_eh -> #_ax
diphone still missing, backing off: #_ax
backed off: #_eh -> #_#
diphone still missing, backing off: #_#
backed off: #_eh ->
Missing diphone: t_#
diphone still missing, backing off: t_#
backed off: t_# -> #_#
diphone still missing, backing off: #_#
backed off: t_# ->
我尝试用and替换标签中的sil
and sp
(来自自动过程)(为了与festival/lib/radio_phones.scm中使用的静音相对应),我也尝试用just替换它们,但这并没有改变任何东西。源 wav/labs 肯定包含上面的转换(例如,有几个以“我可以”开头),但节日似乎从未使用这些。pau
h#
#
我怎样才能让节日在源数据中使用暂停到电话转换?
谢谢!