parakeet-tdt-0.6b-v3 - Fine-tuned on Children's Speech
Fine-tuned from nvidia/parakeet-tdt-0.6b-v3 on children's speech data from the DrivenData ASR competition.
Training config
- Mode: full
- Epochs: 15
- Batch size: 12 (accumulate: 2)
- Learning rate: 0.0001
- Precision: bf16-mixed
- Speed perturbation: True
Usage
import nemo.collections.asr as nemo_asr
model = nemo_asr.models.ASRModel.restore_from("best_model.nemo")
hypotheses = model.transcribe(["audio.flac"])
print(hypotheses[0].text)
- Downloads last month
- 49
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for Phaedrus33/parakeet-tdt-0.6b-v3-children-v11
Base model
nvidia/parakeet-tdt-0.6b-v3