Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -207,7 +207,7 @@ The model is intended for users requiring speech-to-text transcription capabilit
 ### Release Date:
-Huggingface 07/15/2025 via https://huggingface.co/nvidia/canary-qwen-2.5b
 ## Model Architecture:
 Canary-Qwen is a Speech-Augmented Language Model (SALM) [9] model with FastConformer [2] Encoder and Transformer Decoder [3]. It is built using two base models: `nvidia/canary-1b-flash` [1,5] and `Qwen/Qwen3-1.7B` [4], a linear projection, and low-rank adaptation (LoRA) applied to the LLM. The audio encoder computes audio representation that is mapped to the LLM embedding space via a linear projection, and concatenated with the embeddings of text tokens. The model is prompted with "Transcribe the following: <audio>", using Qwen's chat template.

 ### Release Date:
+Huggingface 07/17/2025 via https://huggingface.co/nvidia/canary-qwen-2.5b
 ## Model Architecture:
 Canary-Qwen is a Speech-Augmented Language Model (SALM) [9] model with FastConformer [2] Encoder and Transformer Decoder [3]. It is built using two base models: `nvidia/canary-1b-flash` [1,5] and `Qwen/Qwen3-1.7B` [4], a linear projection, and low-rank adaptation (LoRA) applied to the LLM. The audio encoder computes audio representation that is mapped to the LLM embedding space via a linear projection, and concatenated with the embeddings of text tokens. The model is prompted with "Transcribe the following: <audio>", using Qwen's chat template.