Papers
arxiv:2201.00365

Establishing Strong Baselines for TripClick Health Retrieval

Published on Jan 2, 2022
Authors:
,
,
,

Abstract

We present strong Transformer-based re-ranking and <PRE_TAG>dense retrieval</POST_TAG> baselines for the recently released TripClick health ad-hoc retrieval collection. We improve the - originally too noisy - training data with a simple negative sampling policy. We achieve large gains over BM25 in the re-ranking task of TripClick, which were not achieved with the original baselines. Furthermore, we study the impact of different domain-specific pre-trained models on TripClick. Finally, we show that <PRE_TAG>dense retrieval</POST_TAG> outperforms BM25 by considerable margins, even with simple training procedures.

Community

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2201.00365 in a model README.md to link it from this page.

Datasets citing this paper 1

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2201.00365 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.