MMM
YYYY
Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm
预训练和微调范式下的相关性排序建模
事前訓練と微調整パラダイム下の関連性ランキングのモデリング
예비 훈련 과 마이크로 모드 에서 의 상관 성 정렬 모델 링
Modelo de orden de correlación basado en paradigmas de pre - entrenamiento y ajuste fino
Modélisation de l'ordre de corrélation basée sur le paradigme de pré - formation et de réglage fin
модель ранжирования по типу ранжирования
Lin Bo ¹, Liang Pang 庞亮 ³, Gang Wang ⁴, Jun Xu 徐君 ², XiuQiang He 何秀强 ⁴, Ji-Rong Wen 文继荣 ²
¹ School of Information, Renmin University of China, Beijing, China
中国 北京 中国人民大学信息学院
² Gaoling School of Artificial Intelligence, Renmin University of China, , Beijing, China
中国 北京 中国人民大学高瓴人工智能学院
³ Institute of Computing Technology, Chinese Academy of Sciences
中国 北京 中国科学院计算技术研究所
⁴ Huawei Noah’s Ark Lab
中国 香港 华为诺亚方舟实验室
arXiv, 12 August 2021
Abstract

Recently, pre-trained language models such as BERT have been applied to document ranking for information retrieval. These methods usually first pre-train a general language model on an unlabeled large corpus and then conduct ranking-specific fine-tuning on expert-labeled relevance datasets. Though reliminary successes have been observed in a variety of IR tasks, a lot of room still remains for further improvement.

Ideally, an IR system would model relevance from a user-system dualism: the user's view and the system's view. User's view judges the relevance based on the activities of “real users” while the system's view focuses on the relevance signals from the system side, e.g., from the experts or algorithms, etc. Inspired by the user-system relevance views and the success of pre-trained language models, in this paper we propose a novel ranking framework called Pre-Rank that takes both user's view and system's view into consideration, under the pre-training and fine-tuning paradigm. Specifically, to model the user's view of relevance, Pre-Rank pre-trains the initial query-document representations based on a large-scale user activities data such as the click log. To model the system's view of relevance, Pre-Rank further fine-tunes the model on expert-labeled relevance data. More importantly, the pre-trained representations, are fine-tuned together with handcrafted learning-to-rank features under a wide and deep network architecture. In this way, Pre-Rank can model the relevance by incorporating the relevant knowledge and signals from both real search users and the IR experts.

To verify the effectiveness of Pre-Rank, we showed two implementations by using BERT and SetRank as the underlying ranking model, respectively. Experimental results base on three publicly available benchmarks showed that in both of the implementations, Pre-Rank can respectively outperform the underlying ranking models and achieved state-ofthe-art performances. The results demonstrate the effectiveness of Pre-Rank in combining the user-system views of relevance.
arXiv_1
arXiv_2
arXiv_3
arXiv_4
Reviews and Discussions
https://www.hotpaper.io/index.html
High-resolution tumor marker detection based on microwave photonics demodulated dual wavelength fiber laser sensor
High performance laser induced plasma assisted ablation by GHz burst mode femtosecond pulses
Sequential harmonic spin–orbit angular momentum generation in nonlinear optical crystals
Advanced biological imaging techniques based on metasurfaces
Orthogonal matrix of polarization combinations: concept and application to multichannel holographic recording
High-precision multi-focus laser sculpting of microstructured glass
Multi-physical field null medium: new solutions for the simultaneous control of EM waves and heat flow
Adaptive decentralized AI scheme for signal recognition of distributed sensor systems
Data-driven polarimetric approaches fuel computational imaging expansion
An externally perceivable smart leaky-wave antenna based on spoof surface plasmon polaritons
The possibilities of using a mixture of PDMS and phosphor in a wide range of industry applications
Agile cavity ringdown spectroscopy enabled by moderate optical feedback to a quantum cascade laser



Previous Article                                Next Article
About
|
Contact
|
Copyright © Hot Paper