Microsoft Research Podcast

Microsoft Research Podcast


Abstracts: July 29, 2024

July 29, 2024

A lack of appropriate data, decreased model performance, and other obstacles have made it difficult to expand the input language models can receive. Li Lyna Zhang introduces LongRoPE, a method capable of extending content windows to more than 2 million tokens.

Read the paper

Get the code