seminars
Detail
Publication date: 28 de April, 2026Large Language Models: From Next-Token Prediction to Reasoning Agents
In just a few years, large language models have undergone a transformation that few anticipated: from statistical text generators to systems capable of multi-step reasoning, tool use, and autonomous action. This talk traces that journey.
We begin with the fundamentals: how LLMs work and what the scaling laws tell us about their behaviour. We then examine three pivotal developments that redefined the field: inference-time computation (chain-of-thought and beyond), reinforcement learning with verifiable rewards, and the rise of agentic systems.
A central focus of the talk is on evaluation: how progress is measured, which benchmarks matter and why, and what the dramatic improvements in cost-efficiency over the past three years let us anticipate about the years ahead. We also look at the open-source landscape, where models from Alibaba and DeepSeek have altered assumptions about the resources required to reach frontier AI.
We close with coding agents: how the agentic loop works, what tools like Claude Code and similar systems are already capable of, and what this implies for the future of the software engineering profession and the broader society.
| URL | https://meet.google.com/pyy-rwtk-apq |
|---|---|
| Date | 13/05/2026 2:00 pm |
| Location | DI Seminars Room and Google Meet |
| Host Bio | José Santos holds a Licenciatura in Engenharia Informática (2004), a Masters degree in Artificial Intelligence (2006) both from Faculdade de Ciências e Tecnologia da Universidade Nova de Lisboa and a PhD in Computer Science (Logic-based Machine Learning algorithms) from Imperial College, London (2010). José Santos joined Microsoft in 2011, and became a Principal Software Engineer in 2020. José is a co-inventor of two Microsoft patents, has worked on multiple projects (Bing, Shopping, SwiftKey, Azure), and since 2025 is in the Microsoft Core AI team, assessing the accuracy, cost-effectiveness, capabilities, and limitations of Large Language Models and agents. |