wsta 20 - evaluation and re-ranking

发表于2024-04-03|更新于2024-04-03|topic

|阅读量:1

Hard to characterise the quality of a system’s results:

a subjective problem
query is not the information need

human judgements: too expensive and slow

Automatic evaluation

Simplify assumption:
- retrieval is ad-hoc (no prior knowledge of the user)
- effectiveness based on relevance
  - relevant or irrelevant: binary or multiple grades
  - Relevance of docs are independent
Test collections:
- Relevance judgements (qrels)
- But not all docs have _qrels_ (big collection)
Relevance vector $R <1,0,0,0,1ldots>$ how to map it to a number? -> precision & recall (hard)
- Precision @ k
- Average precision
- Mean Average Precision (MAP)

RANK-BIASED PRECISION

RBP Formula

$RBP=(1-p)timesSigma^{d}_{i=1}r_itimes p^{i-1}$

Patient user: p = 0.95; Inpatient user: p = 0.50

EFFECTIVENESS IN PRACTICE:

Also look at query logs and click logs
Construct (learn) a similarity metric automatically from training data (queries, click data, documents)
Machine learning

Learning to rank

Training data $$: learn to combine “features representing” $x=$ to predict $r_i$

LEARNING TO RANK OBJECTIVES:

POINT-WISE OBJECTIVE
- Ask the user how relevant is $d_i$
Pair-wise objective (Given two docs)
- Ask the user: Which of these two documents is more relevant?
List-wise objective
- List-wise objective (Output is a ranked lists)
- Ask the user: Rearrange this list

文章作者: 安全书

文章链接: https://lua.ren/zl/2016-01-01-1085_wsta%2020%20-%20evaluation%20and%20re-ranking/

版权声明: 本博客所有文章除特别声明外，均采用 null 许可协议。转载请注明来自安全书！

相关推荐

lua快速入门

Lua 学习 chapter7

C# 使用 Lua 取得 Redis 自訂複雜型別

Lua语言学习（二）

OpenResty Lua学习笔记

Lua语法速记