Pinwheel graphic representing the Microsoft Research Summit

Return to Event: Microsoft Research Summit 2021

Microsoft Research Summit 2021 • Videos

Research talk: WebQA: Multihop and multimodal

October 19, 2021
Yonatan Bisk | Carnegie Mellon University
Microsoft Research Summit 2021 | Deep Learning & Large-Scale AI

Web search is fundamentally multimodal and multihop. Often, even before asking a question, individuals go directly to image search to find answers. Further, rarely do we find an answer from a single source, opting instead to aggregate information and reason through implications. Despite the frequency of this everyday occurrence, at present there is no unified question-answering benchmark that requires a single model to answer long-form natural language questions from text and open-ended visual sources that is akin to human experience. The researchers propose to bridge this gap between the natural language and computer vision communities with WebQA. They show that multihop text queries are difficult for a large-scale transformer model, and they also show that existing multi-modal transformers and visual representations do not perform well on open-domain visual queries. Our challenge for the community is to create a unified multimodal reasoning model that seamlessly transitions and reasons regardless of the source modality.

Learn more about the 2021 Microsoft Research Summit: https://Aka.ms/researchsummit (opens in new tab)

- Yonatan Bisk
  
  Professor
  
  CMU
Research Area
- Artificial intelligence
Event
- Microsoft Research Summit 2021

Deep Learning & Large-Scale AI

Opening remarks: Deep Learning and Large-Scale AI
October 19, 2021
Ahmed Awadallah
Research talk: Transformer efficiency: From model compression to training acceleration
October 19, 2021
Yu Cheng
Research talk: Resource-efficient learning for large pretrained models
October 19, 2021
Subho Mukherjee
Research talk: Towards data-efficient machine learning with meta-learning
October 19, 2021
Guoqing Zheng
Research talk: Computationally efficient large-scale AI
October 19, 2021
Song Han
Research talk: Prompt tuning: What works and what's next
October 19, 2021
Danqi Chen
Roundtable discussion: Efficient and adaptable large-scale AI
October 19, 2021
Ahmed Awadallah,

Jianfeng Gao,

Danqi Chen

, et. al.
Research talk: Large-scale, self-supervised pretraining: From language to vision
October 19, 2021
Li Dong,

Furu Wei
Research talk: NUWA: Neural visual world creation with multimodal pretraining
October 19, 2021
Lei Ji,

Chenfei Wu
Research talk: Knowledgeable pre-trained language models
October 19, 2021
Zhiyuan Liu
Panel: Large-scale neural platform models: Opportunities, concerns, and directions
October 19, 2021
Eric Horvitz,

Miles Brundage,

Yejin Choi

, et. al.
Research talk: Focal Attention: Towards local-global interactions in vision transformers
October 19, 2021
Jianwei Yang
Research talk: Towards Self-Learning End-to-end Dialog Systems
October 19, 2021
Baolin Peng
Research talk: WebQA: Multihop and multimodal
October 19, 2021
Yonatan Bisk
Research talk: Closing the loop in natural language interfaces to relational databases
October 19, 2021
Dragomir Radev
Roundtable discussion: Beyond language models: Knowledge, multiple modalities, and more
October 19, 2021
Yonatan Bisk,

Daniel McDuff,

Dragomir Radev
Closing remarks: Deep Learning and Large Scale AI
October 19, 2021
Jianfeng Gao