- The objective function does not guarantee the optimization of some important measure in retrieval such as AP. These measure better capture the user experience than the pairwise ordering.
- Document pairs are not generated independently, which violate the assumption of classification methods that is applied.
- The number of document pairs, which grows in n square order, from query to query is highly unbalanced. This will lead to a biased model in training.
A listwise approach is proposed as follow:
- For each query, a list of score over all documents is given as ground truth.
- A ranking function f, which is what we want to learn, takes both query and document as input, and gives a score for each document-query pair.
- For each query, a list of score is generated using the ranking function
- The loss function is defined over the "ground truth list" and the "ranking list" of each query as input.
The ranking of the documents given a query can be viewed as a permutation, which is relevant to the list of score.
Now the loss function is defined as followed:
- Defined the probability of each permutation based on the ordering and the score of each documents.
- Given two lists of score, the probability of the permutation can be calculated, and the cross entropy of the ground truth and the score generated by ranking function is used as the loss function.
To reduce the computational cost, only the top k ranking documents are considered in the learning to rank process.