1. 程式人生 > 其它 >[10-論文筆記][03] MS MARCO資料集整理

[10-論文筆記][03] MS MARCO資料集整理

MS MARCO資料集整理

論文地址:https://arxiv.org/pdf/1611.09268.pdf. NIPS2016
相關介紹:

任務1: Document Retrieval(2020/11/08-現在) 文件檢索任務

Based the questions in the Question Answering Dataset(原始MRC資料集)

and the documents which answered the questions a document ranking task was formulated. There are 3.2 million documents and the goal is to rank based on their relevance. 基於MRC任務進一步構建 query, 網頁回答排序任務,基於相關性, 320W 網頁檢索

Relevance labels are derived from what passages was marked as having the answer in the QnA dataset making this one of the largest relevance

datasets ever. 相關性標籤來源:QnA資料集; 具體見MS MARCO網站介紹;

This dataset is the focus of the 2020 and 2019 TREC Deep Learning Track and has been used as a teaching aid for ACM SIGIR/SIGKDD AFIRM Summer School on Machine Learning for Data Mining and Search. 資料集在競賽/會議中使用;

任務2: