SlatorPod

#161 Microsoft’s Christian Federmann on the Translation Quality of Large Language Models

April 12, 2023 Slator
SlatorPod
#161 Microsoft’s Christian Federmann on the Translation Quality of Large Language Models
Show Notes Chapter Markers

In this week’s SlatorPod, we are joined by Christian Federmann, Principal Research Manager at Microsoft, where he works on machine translation (MT) evaluation and language expansion.

Christian recounts his journey from working at the German Research Center for Artificial Intelligence under the guidance of AI pioneer Hans Uszkoreit to joining Microsoft and building out Microsoft Translator.

He shares how Microsoft Translator evolved from using statistical MT to neural MT and why they opted for the Marian framework.

Christian expands on Microsoft’s push into large language models (LLMs) and how his team is now experimenting with NMT and LLM machine translation systems. He then explores how LLMs translate and the role of various prompts in the process.

Christian discusses the key metrics historically and currently used to evaluate machine translation. He also unpacks the findings from a recent research paper he co-authored investigating the applicability of LLMs for automated assessment of translation quality.

Christian describes how Microsoft’s custom translator fine-tunes and improves the user’s MT model through customer-specific data, which degrades more general domain performance. 

He shares Microsoft’s approach to expanding its support for languages with the recent addition of 13 African languages. Collaboration with language communities is an integral step in improving the quality of the translation models

To round off, Christian believes that the hype around LLMs may hit a wall within the next six months, as people realize the limitations of what they can achieve. However, in a year or two, there will be better solutions available, including LLM-enhanced machine translation.

Intro and Agenda
Background and Career Milestones
Microsoft Translator in a Nutshell
GPT Use at Microsoft
GPT Technical Issues
How do LLMs translate?
LLMs and Machine Translation
Translation Quality of LLMs
Prompt Engineering
Training LLMs to Machine Translate
Microsoft Translator User Profile
Data Training Cycle
Government Usage
Language Coverage
Pivot Language
Improving Machine Translation Quality
LLMs and MT Predictions