Skip to content
#

speech-recognition

Here are 1,548 public repositories matching this topic...

r9y9
r9y9 commented Jun 18, 2019

Current documentation in README explains how to install the toolkit and how to run examples. However, I don't think this is enough for users who want to make some changes to the existing recipes or make their own new recipe. In that case, one needs to understand what run.sh does step by step, but I think docs for that are missing at the moment. It would be great if we provide documentation for:

pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

  • Updated Jun 11, 2020
  • Python
tkornuta-nvidia
tkornuta-nvidia commented Dec 19, 2019

For example, documentation of AudioToSpectrogramPreprocessor is missing.

We investigated that with @redoctopus and she has found that nemo_asr.rst doesn't include several modules.

https://github.com/NVIDIA/NeMo/blob/master/docs/sources/source/collections/nemo_asr.rst

So the bigger issue includes: taking a look at all implemented components and checking whether the associated rst files are

deepxuexi
deepxuexi commented Jun 24, 2019

基于anaconda3安装:
1.为masr创建一个基于python3.6的虚拟环境,环境名称为masr
conda create -n masr python=3.6 pip numpy jupyter
2.切换到masr环境
source activate masr
3.安装几个必要的模块
pip install librosa
conda install pytorch-cpu==1.0.1 torchvision-cpu -c pytorch
pip install flask
4.解压ctcdecode.zip到anaconda3/envs/masr/lib/python3.6/site-packages/路径下,这样就不用去辛苦编译ctcdecode啦
[ctcdecode.zip](https://github.com/lukhy/masr/file

MarkRx
MarkRx commented Apr 27, 2017

It seems this file was last updated for 1.0.3. It is out of sync with the actual code. For example the index.d.ts file says:

        /**
         * Set up artyom for the application. This function will set the default language used by artyom
         * or notice the user if artyom is not supported in the actual browser.
         * @see http://docs.ourcodeworld.com/projects/artyom-js/doc
xinzheshen
xinzheshen commented Dec 25, 2019

@lfchener 我看报错这行代码是您最近添加的,我是pip安装的环境, 并按照readme 一步步执行的,但执行到infer时报错,是readme过时了吗?谢谢。
Traceback (most recent call last):
File "/home/shenxz/.IdeaIC2019.2/config/plugins/python-ce/helpers/pydev/pydevd.py", line 1415, in _exec
pydev_imports.execfile(file, globals, locals) # execute the script
File "/home/shenxz/code/DeepSpeech/infer.py", line 152, in
main()
File "/home/sh

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.