Skip to content
#

htk

Here are 22 public repositories matching this topic...

Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.

  • Updated Mar 9, 2020
  • Python

Speech Tester is a set of Python scripts conceived as an extension to HTK Automatic Speech Recognition system (Young et al., 2002). Speech Tester aims to help language researchers and engineers to measure the intelligibility of transformed audio signals. Here the termed “transform” may describe any process that alters a speech signal or its reception by a potential listener. For instance, any type of audio compression, noise background and/or reverberation, vocoding (to simulate cochlear implants), partial auditory loss, or any combination of these and other imaginable transformations. Intelligibility refers to the capacity of a listener to recognize a transformed signal. In the rest of this introduction we describe the motivation to develop Speech Tester to measure speech intelligibility.

  • Updated May 17, 2019
  • Python

Improve this page

Add a description, image, and links to the htk topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the htk topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.