End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
audio
deep-learning
tensorflow
paper
end-to-end
evaluation
cnn
lstm
speech-recognition
rnn
automatic-speech-recognition
feature-vector
data-preprocessing
phonemes
timit-dataset
layer-normalization
rnn-encoder-decoder
chinese-speech-recognition
-
Updated
Feb 9, 2022 - Python
Creating CSV files manually is a lot of work. This could be automated by a script if the name of the WAV file is the same as the transcript.
The same could be done for creating a language model input text file. A script could pull the transcript from the WAV file name.