GPT-SW3: The First Large Generative Language Model for the Nordic Languages
This talk gives an overview of the process of building the first large generative language model for Swedish. We cover the motivation for building the model, as well as challenges and opportunities with data and compute. We also give examples of applications of the model and discuss future directions for building and deploying large language models for smaller languages.
Magnus Sahlgren, PhD, is Head of Research for Natural Language Understanding at AI Sweden. Sahlgren's research lies at the intersection between computational linguistics, philosophy, and artificial intelligence. He is primarily known for his work on computational models of meaning.