1

一些背景:在尝试建立单位选择声音时,我按照以下步骤操作:https ://github.com/CSTR-Edinburgh/CSTR-Edinburgh.github.io/blob/master/_posts/2016-8-21-Multisyn_unit_selection .md并使用此处的语音定义:https ://raw.githubusercontent.com/CSTR-Edinburgh/merlin/master/egs/hybrid_synthesis/s1/voice_definition_files/unit_selection/cstr_us_awb_arctic_multisyn.scm 。不幸的是,wav 太吵了,所以我最终手动标记它们并跳过自动标记过程。

声音现在还可以,但仍需要一些工作。经常发生的一个错误是节日报告“缺少双音素”以暂停到电话转换,例如:

festival> (utt.relation.print (SayText "I can say anything I want.") 'Unit)
Missing diphone: #_ay
 diphone still missing, backing off: #_ay
 backed off: #_ay -> #_ax
 diphone still missing, backing off: #_ax
 backed off: #_ay -> #_#
 diphone still missing, backing off: #_#
 backed off: #_ay ->
Missing diphone: ey_eh
 Interword so inserting silence.
 diphone still missing, backing off: ey_#
 backed off: ey_eh -> ax_#
 diphone still missing, backing off: ax_#
 backed off: ey_eh -> #_#
 diphone still missing, backing off: #_#
 backed off: ey_eh ->
Missing diphone: #_eh
 diphone still missing, backing off: #_eh
 backed off: #_eh -> #_ax
 diphone still missing, backing off: #_ax
 backed off: #_eh -> #_#
 diphone still missing, backing off: #_#
 backed off: #_eh ->
Missing diphone: t_#
 diphone still missing, backing off: t_#
 backed off: t_# -> #_#
 diphone still missing, backing off: #_#
 backed off: t_# ->

我尝试用and替换标签中的siland sp(来自自动过程)(为了与festival/lib/radio_phones.scm中使用的静音相对应),我也尝试用just替换它们,但这并没有改变任何东西。源 wav/labs 肯定包含上面的转换(例如,有几个以“我可以”开头),但节日似乎从未使用这些。pauh##

我怎样才能让节日在源数据中使用暂停到电话转换?

谢谢!

4

1 回答 1

0

发生的情况是,当我运行基于 Multisyn 单元选择的脚本时,build_utts 部分失败并跳过,因为手工标记的标签与 Festival 预测的不完全匹配。例如,如果演讲者说“极端”,eh k s ...但 Festival 会计算ih k s ...build_utts 脚本将失败并出现如下错误:

align missmatch at ih (0.000000) eh (2.810566)

我为每个话语手动运行 build_utts 脚本并相应地调整了标签。如果像我一样,你愚蠢到尝试给自己贴上一些对我有帮助的提示:

  • 考虑移除任何手机封盖,例如t_cld_cl因为这些手机在尝试匹配时真的会搞砸
  • 确保在每个话语的开头和结尾都有一个暂停(即#),因为 build_utts 脚本不会抱怨它,但是在 Festival 中运行语音时,您会收到如下错误:

            -=-=-=-=-=- EST Error -=-=-=-=-=-
            {FND} Feature end not defined
    
            -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-
    

感谢@NikolayShmyrev 为我指明了正确的方向。他还建议使用Ossian而不是使用 python 而不是 Festival 相当困难的代码的 Festival。

于 2018-01-31T10:17:00.130 回答