(简体中文|English)
PP-ASR is a tool to provide ASR(Automatic speech recognition) function. It provides a variety of Chinese and English models and supports model training. It also supports model inference using the command line. In addition, PP-ASR supports the deployment of streaming models and customized ASR.
The basic process of ASR is shown in the figure below:
The main characteristics of PP-ASR are shown below:
paddlespeech asr --model xxx --input xxx.wav
to use the pre-trained model to do model inference.The support pre-trained model list: released_model.
The model with good effect are Ds2 Online Wenetspeech ASR0 Model and Conformer Online Wenetspeech ASR1 Model. Both two models support streaming ASR.
For more information about model design, you can refer to the aistudio tutorial:
The referenced script for model training is stored in examples and stored according to "examples/dataset/model". The dataset mainly supports aishell and librispeech. The model supports deepspeech2 and u2(conformer/transformer).
The specific steps of executing the script are recorded in run.sh
.
For more information, you can refer to asr1
PP-ASR supports use paddlespeech asr --model xxx --input xxx.wav
to use the pre-trained model to do model inference after install paddlespeech
by pip install paddlespeech
.
Specific supported functions include:
For specific usage, please refer to: speech_recognition
PP-ASR supports the service deployment of streaming ASR. Support the simultaneous use of speech recognition and punctuation processing.
Demo of ASR Server: streaming_asr_server
Display of using ASR server on Web page: streaming_asr_demo_video
For more information about service deployment, you can refer to the aistudio tutorial:
For customized auto speech recognition and deployment, PP-ASR provides feature extraction(fbank) => Inference model(Scoring Library)=> C++ program of TLG(WFST, token, lexion, grammer). For specific usage, please refer to: speechx
If you want to quickly use it, you can refer to custom_streaming_asr
For more information about customized auto speech recognition and deployment, you can refer to the aistudio tutorial:
To use PP-ASR, you can see here install, It supplies three methods to install paddlespeech
, which are Easy, Medium and Hard. If you want to experience the inference function of paddlespeech, you can use Easy installation method.
Вы можете оставить комментарий после Вход в систему
Неприемлемый контент может быть отображен здесь и не будет показан на странице. Вы можете проверить и изменить его с помощью соответствующей функции редактирования.
Если вы подтверждаете, что содержание не содержит непристойной лексики/перенаправления на рекламу/насилия/вульгарной порнографии/нарушений/пиратства/ложного/незначительного или незаконного контента, связанного с национальными законами и предписаниями, вы можете нажать «Отправить» для подачи апелляции, и мы обработаем ее как можно скорее.
Опубликовать ( 0 )