The Nordic Pile: A 1.2TB Nordic Dataset for Language Modeling Paper • 2303.17183 • Published Mar 30, 2023 • 1
GPT-SW3: An Autoregressive Language Model for the Nordic Languages Paper • 2305.12987 • Published May 22, 2023
Why Not Simply Translate? A First Swedish Evaluation Benchmark for Semantic Similarity Paper • 2009.03116 • Published Sep 7, 2020
Should we Stop Training More Monolingual Models, and Simply Use Machine Translation Instead? Paper • 2104.10441 • Published Apr 21, 2021
SWEb: A Large Web Dataset for the Scandinavian Languages Paper • 2410.04456 • Published Oct 6, 2024 • 1
SWEb: A Large Web Dataset for the Scandinavian Languages Paper • 2410.04456 • Published Oct 6, 2024 • 1
R-grams: Unsupervised Learning of Semantic Units in Natural Language Paper • 1808.04670 • Published Aug 14, 2018
Text Annotation Handbook: A Practical Guide for Machine Learning Projects Paper • 2310.11780 • Published Oct 18, 2023
SWEb: A Large Web Dataset for the Scandinavian Languages Paper • 2410.04456 • Published Oct 6, 2024 • 1
Transferring Knowledge from Vision to Language: How to Achieve it and how to Measure it? Paper • 2109.11321 • Published Sep 23, 2021
SWEb: A Large Web Dataset for the Scandinavian Languages Paper • 2410.04456 • Published Oct 6, 2024 • 1