Aleksandr Laptev is a Ph.D. student at ITMO University and a senior research scientist as NVIDIA. His scientific interests are Automatic Speech Recognition, Speech Synthesis (TTS), and Natural Language Processing. He writes open-access scientific articles, contributes to open-source software, and participates in international speech recognition competitions. His current research area is differentiable Weighted Finite-State Transducers.
This talk introduces NeMo: NVIDIA's open-source toolkit for conversational AI that provides a wide collection of models for automatic speech recognition, text-to-speech, natural language processing and neural machine translation.