legacy-datasets/common_voice
Updated • 1.57k • 144
How to use MatsUy/wav2vec2-common_voice-nl-demo with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="MatsUy/wav2vec2-common_voice-nl-demo") # Load model directly
from transformers import AutoProcessor, AutoModelForCTC
processor = AutoProcessor.from_pretrained("MatsUy/wav2vec2-common_voice-nl-demo")
model = AutoModelForCTC.from_pretrained("MatsUy/wav2vec2-common_voice-nl-demo")This model is a fine-tuned version of facebook/wav2vec2-large-xlsr-53 on the COMMON_VOICE - NL dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 3.0536 | 1.12 | 500 | 0.5349 | 0.4338 |
| 0.2543 | 2.24 | 1000 | 0.3859 | 0.3029 |
| 0.1472 | 3.36 | 1500 | 0.3471 | 0.2818 |
| 0.1088 | 4.47 | 2000 | 0.3489 | 0.2731 |
| 0.0855 | 5.59 | 2500 | 0.3582 | 0.2558 |
| 0.0721 | 6.71 | 3000 | 0.3457 | 0.2471 |
| 0.0653 | 7.83 | 3500 | 0.3299 | 0.2357 |
| 0.0527 | 8.95 | 4000 | 0.3440 | 0.2334 |
| 0.0444 | 10.07 | 4500 | 0.3417 | 0.2289 |
| 0.0404 | 11.19 | 5000 | 0.3691 | 0.2204 |
| 0.0345 | 12.3 | 5500 | 0.3453 | 0.2102 |
| 0.0288 | 13.42 | 6000 | 0.3634 | 0.2089 |
| 0.027 | 14.54 | 6500 | 0.3532 | 0.2044 |