Dataset Released
Queries Released
Deadline for submitting ranking runs.
To participate in TREC please pre-register at the following website: https://ir.nist.gov/trecsubmit.open/application.html
The Product Search Track studies information retrieval in a domain of product search.
The dataset is
Type | Filename | File size | Num Records | Format |
---|---|---|---|---|
Corpus | ||||
Train | ||||
Dev | ||||
Test (TREC test 2023) |
The document corpus is in jsonl format. Each document has:
If you unzip the corpus, you can quickly access a document using:
You are generally allowed to use external information while developing your runs. When you submit your runs, please fill in a form listing what resources you used. This could include an external corpus such as Wikipedia or a pretrained model (e.g. word embeddings, BERT). This could also include the provided set of document ranking training data, but also optionally other data such as the passage ranking task labels or external labels or pretrained models. This will allow us to analyze the runs and break they down into types.
We are sharing the following additional resources which we hope will be useful for the community.
Dataset | Filename | File size | Num Records | Format |
---|---|---|---|---|
- Daniel Campos (University of Illinois at Urbana-Champaign)
- Corby Rosset (Microsoft)
- Alessandro Magnani (Walmart)
- ChengXiang Zhai (University of Illinois at Urbana-Champaign)
- Surya Kallumadi (Lowes)