parakeet-tdt-0.6b-v3 - Fine-tuned on Children's Speech

Fine-tuned from nvidia/parakeet-tdt-0.6b-v3 on children's speech data from the DrivenData ASR competition.

Training config

Mode: full
Epochs: 15
Batch size: 12 (accumulate: 2)
Learning rate: 0.0001
Precision: bf16-mixed
Speed perturbation: True

Usage

import nemo.collections.asr as nemo_asr

model = nemo_asr.models.ASRModel.restore_from("best_model.nemo")
hypotheses = model.transcribe(["audio.flac"])
print(hypotheses[0].text)

Downloads last month: 49

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Phaedrus33/parakeet-tdt-0.6b-v3-children-v11

Base model

nvidia/parakeet-tdt-0.6b-v3

Finetuned

(21)

this model