About This Document
- sl:arxiv_author :
- sl:arxiv_firstAuthor : Omar Khattab
- sl:arxiv_num : 2007.00814
- sl:arxiv_published : 2020-07-01T23:50:58Z
- sl:arxiv_summary : Systems for Open-Domain Question Answering (OpenQA) generally depend on a
retriever for finding candidate passages in a large corpus and a reader for
extracting answers from those passages. In much recent work, the retriever is a
learned component that uses coarse-grained vector representations of questions
and passages. We argue that this modeling choice is insufficiently expressive
for dealing with the complexity of natural language questions. To address this,
we define ColBERT-QA, which adapts the scalable neural retrieval model ColBERT
to OpenQA. ColBERT creates fine-grained interactions between questions and
passages. We propose an efficient weak supervision strategy that iteratively
uses ColBERT to create its own training data. This greatly improves OpenQA
retrieval on Natural Questions, SQuAD, and TriviaQA, and the resulting system
attains state-of-the-art extractive OpenQA performance on all three datasets.@en
- sl:arxiv_title : Relevance-guided Supervision for OpenQA with ColBERT@en
- sl:arxiv_updated : 2021-08-02T17:14:01Z
- sl:bookmarkOf : https://arxiv.org/abs/2007.00814
- sl:creationDate : 2022-01-07
- sl:creationTime : 2022-01-07T18:39:10Z
Documents with similar tags (experimental)