Nouvelles et reportages

Inferring rewards through interaction
| Jessica Maghakian, Akanksha Saran, Cheng Tan, et Paul Mineiro
In reinforcement learning, handcrafting reward functions is difficult and can yield algorithms that don’t generalize well. IGL-P, an interaction-grounded learning strategy, learns personalized rewards for different people in recommender system scenarios.
Prix | International World Wide Web Conference
John Langford, Rob Schapire and co-authors receive the 2023 Seoul Test of Time Award
The International World Wide Web Conference Committee (IW3C2) announced today that the 2023 Seoul Test of Time Award will be presented to the authors of the paper “A Contextual-Bandit Approach to Personalized News Article Recommendation;” Wei Chu, (Ant Group), Lihong…

Research Focus: Week of March 6, 2023
Welcome to Research Focus, a series of blog posts that highlights notable publications, events, code/datasets, new hires and other milestones from across the research community at Microsoft. Attack methods like Spectre (opens in new tab) exploit speculative execution, one of…

Research Focus: Week of February 20, 2023
Welcome to Research Focus, a new series of blog posts that highlights notable publications, events, code/datasets, new hires and other milestones from across the research community at Microsoft. Many real-world applications require sequential decision making, where an agent interacts with…
Dans l’actualité | ACM
danah boyd featured in the «People of ACM»
The August issue of People of ACM featured danah boyd, Partner Researcher in the New York City Lab. She is the author of «It’s Complicated: The Social Lives of Networked Teens” and «Participatory Culture in a Networked Era”. In 2013,…
Dans l’actualité | Microsoft India Paradigm Shift Podcast
Paradigm Shift, Episode 3: Pitches and Pawns
From analyzing every cricketing shot in the book, to infusing intelligence on the chessboard, to reimagining how fans engage with the players and the game–AI is shaping the next best move in sports.

PPE: A fast and provably efficient RL algorithm for exogenous noise
| Dipendra Misra et Yonathan Efroni
Picture a person walking in a park by a pond. The surrounding environment contains a number of moving objects that change the quality of the environment: clouds moving to hide the sun, altering the quality of light; ducks gliding across…

Lecture series aims to help spur dialogue around race and technology
In November, NYU media professor Charlton McIlwain (opens in new tab) joined fellow scholars Safiya Noble, Ruha Benjamin, and André Brock for a virtual discussion on anti-Blackness and technology hosted by the University of California Santa Barbara. The conversation was…

Econ4: Uncovering how decision-making shapes individuals and society through behavioral public economics featuring Evan Rose and Hunt Allcott
In this episode, Senior Principal Researcher Hunt Allcott talks with Postdoctoral Researcher Evan Rose about Allcott’s work exploring the everyday decisions people face, like buying fuel-efficient cars or taking out payday loans, and how a clearer understanding of these decisions…