![]() ![]() Support of long audios for Conformer with memory efficient local attention.Language Modelling for ASR (CTC and RNNT): N-gram LM in fusion with Beam Search decoding, Neural Rescoring with Transformer.Cache-aware Streaming Conformer with multiple lookaheads.Streaming/Buffered ASR (CTC/Transducer) - Chunked Inference Examples.NeMo Original Multi-blank Transducers and Token-and-Duration Transducers (TDT).Supports the following decoders/losses:. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |