HuggingFaceM4/DocumentVQA
Viewer • Updated • 50k • 4.63k • 46
This model is a fine-tuned version of resnext50_32x4d.fb_swsl_ig1b_ft_in1k on an aggregated dataset of images that were classified as relevant (1.0) or irrelevant (0.0). It achieves the following results on the validation set:
The following hyperparameters were used during training:
| Training Loss | Epoch | Validation Loss | Accuracy |
|---|---|---|---|
| 0.5536 | 1 | 0.3270 | 0.9856 |
| 0.3176 | 2 | 0.1720 | 0.9922 |
| 0.1887 | 3 | 0.1332 | 0.9944 |
| 0.1280 | 4 | 0.1146 | 0.9938 |
| 0.1116 | 5 | 0.1236 | 0.9938 |
| 0.1016 | 6 | 0.1032 | 0.9936 |
Base model
timm/resnext50_32x4d.fb_swsl_ig1b_ft_in1k