2016 ICME:Phonetic posteriorgrams for many-to-one voice conversion without parallel data training

時間 2021-01-11

原文原文鏈接

作者： Helen Meng 單位：港中文 abstract 非平行訓練數據進行voice conversion 首先用一個SI-ASR（speaker-independent 語音識別系統）提取PPGs(Phonetic PosteriorGrams)，這個PPGs可以對應於說話者的發音，並且對應於獨立說話者的說話內容。然後用DBLSTM（deep bi-LSTM)建模PPGs和target

>>阅读原文<<