NeMo Canary 1B Flash model: Transcribe & Translate audio

๐ŸŽ‰ NEW: Canary 1B V2 is now available!

๐ŸŒ 25 European Languages | โฑ๏ธ Much Improved Timestamp Prediction | ๐Ÿ”„ Enhanced ASR & AST

๐Ÿ”— Model: nvidia/canary-1b-v2๐Ÿš€ Try Live Demo

Step 1: Upload an audio file or record with your microphone.

This demo supports audio files up to 30 mins long. You can transcribe longer files locally with this NeMo script.

Step 2: Choose the input and output language.

If input & output languages are the same, you can also toggle generating punctuation & capitalization and timestamps.

Input audio is spoken in:
Transcribe in language:

Step 3: Run the model.

๐Ÿค Canary 1B Flash model | ๐Ÿง‘โ€๐Ÿ’ป NeMo Repository