即时检测:声门从语音信号的关闭和打开(Sound)

本文提出了一种直接从语音波形中检测声门闭合和打开瞬间(GCIs和GOIs)的新方法。这个过程分为两个连续的步骤。首先计算一个基于均值的信号,并从中提取语音事件发生的时间间隔。其次,通过在线性预测残差中定位一个不连续点来确定语音事件的精确位置。将该方法与基于CMU ARCTIC数据库的DYPSA算法进行了比较。一个显著的改善以及更好的噪声稳健性在此方法中被报道。此外,GOI识别的准确性对于声门的来源性描述是有保证的。

原文标题:Sound: Glottal Closure and Opening Instant Detection from Speech Signals

This paper proposes a new procedure to detect Glottal Closure and Opening Instants (GCIs and GOIs) directly from speech waveforms. The procedure is divided into two successive steps. First a mean-based signal is computed, and intervals where speech events are expected to occur are extracted from it. Secondly, at each interval a precise position of the speech event is assigned by locating a discontinuity in the Linear Prediction residual. The proposed method is compared to the DYPSA algorithm on the CMU ARCTIC database. A significant improvement as well as a better noise robustness are reported. Besides, results of GOI identification accuracy are promising for the glottal source characterization.

原文作者:Thomas Drugman,Thierry Dutoit

原文链接:https://arxiv.org/abs/2001.00841