Nouvelles et reportages

The science behind semantic search: How AI from Bing is powering Azure Cognitive Search
| Rangan Majumder, Alec Berntson, Daxin Jiang (姜大昕), Jianfeng Gao, Furu Wei, et Nan Duan
Azure Cognitive Search (opens in new tab) is a cloud search service that gives developers APIs and tools to build rich search experiences over private, heterogeneous content in web, mobile, and enterprise applications. It has multiple components, including an API for indexing and querying, seamless integration through Azure data ingestion, deep…

HEXA: Self-supervised pretraining with hard examples improves visual representations
| Chunyuan Li, Lei Zhang, et Jianfeng Gao
Humans perceive the world through observing a large number of visual scenes around us and then effectively generalizing—in other words, interpreting and identifying scenes they haven’t encountered before—without heavily relying on labeled annotations for every single scene. One of the…

VinVL: Advancing the state of the art for vision-language models
| Pengchuan Zhang, Lei Zhang, et Jianfeng Gao
Humans understand the world by perceiving and fusing information from multiple channels, such as images viewed by the eyes, voices heard by the ears, and other forms of sensory input. One of the core aspirations in AI is to develop…

Microsoft DeBERTa surpasses human performance on the SuperGLUE benchmark
| Pengcheng He, Xiaodong Liu, Jianfeng Gao, et Weizhu Chen
Natural language understanding (NLU) is one of the longest running goals in AI, and SuperGLUE is currently among the most challenging benchmarks for evaluating NLU models. The benchmark consists of a wide range of NLU tasks, including question answering, natural…

Domain-specific language model pretraining for biomedical natural language processing
| Hoifung Poon et Jianfeng Gao
COVID-19 highlights a perennial problem facing scientists around the globe: how do we stay up to date with the cutting edge of scientific knowledge? In just a few months since the pandemic emerged, tens of thousands of research papers have…
Dans l’actualité | VentureBeat
Microsoft researchers claim ‘state-of-the-art’ biomedical NLP model
In a paper published on the preprint server Arxiv.org, Microsoft researchers propose an AI technique they call domain-specific language model pretraining for biomedical natural language processing (NLP). By compiling a “comprehensive” biomedical (NLP) benchmark from publicly available data sets, the coauthors claim they managed to…

Objects are the secret key to revealing the world between vision and language
| Chunyuan Li, Lei Zhang, et Jianfeng Gao
Humans perceive the world through many channels, such as images viewed by the eyes or voices heard by the ears. Though any individual channel might be incomplete or noisy, humans can naturally align and fuse the information collected from multiple…

A deep generative model trifecta: Three advances that work towards harnessing large-scale power
| Chunyuan Li et Jianfeng Gao
One of the core aspirations in artificial intelligence is to develop algorithms and techniques that endow computers with an ability to synthesize the observed data in our world. Every time researchers build a model to imitate this ability, this model…
Prix | IEEE Signal Processing Society
Hamid Palangi, Jianfeng Gao receive prestigious IEEE Signal Processing Society Best Paper Award
Hamid Palangi, Senior Researcher, Jianfeng Gao, Partner Research Manager, and their collaborators received the prestigious 2018 IEEE Signal Processing Society Best Paper award (Test of Time) for their work on deep sentence embedding for web search engines and information retrieval.…