BibSLEIGH
BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
EDIT!
CC-BY
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter
Travelled to:
1 × Australia
1 × Brazil
1 × Chile
1 × France
1 × Germany
1 × Portugal
1 × Singapore
1 × Switzerland
1 × The Netherlands
2 × China
2 × Ireland
2 × United Kingdom
4 × USA
Collaborated with:
P.Chandar J.Allan E.Kanoulas E.Yilmaz M.D.Smucker P.N.Bennett W.Webber V.Pavlu D.Zhu S.L.Thota I.Soboroff D.Petkova J.A.Aslam A.Bah R.K.Sitaraman J.Lewis J.Arguello F.Díaz J.Callan P.D.Clough M.Sanderson H.Fang D.M.Chickering S.T.Dumais
Talks about:
evalu (15) retriev (11) test (10) document (6) prefer (6) user (6) system (5) inform (5) effect (5) rank (5)

Person: Ben Carterette

DBLP DBLP: Carterette:Ben

Contributed to:

SIGIR 20152015
SIGIR 20142014
SIGIR 20132013
CIKM 20122012
SIGIR 20122012
CIKM 20112011
ECIR 20112011
SIGIR 20112011
SIGIR 20102010
CIKM 20092009
ECIR 20092009
SIGIR 20092009
ECIR 20082008
SIGIR 20082008
CIKM 20072007
SIGIR 20072007
SIGIR 20062006
CIKM 20052005
SIGIR 20052005

Wrote 35 papers:

SIGIR-2015-BahCC #documentation
Document Comprehensiveness and User Preferences in Novelty Search Tasks (AB, PC, BC), pp. 735–738.
SIGIR-2015-Carterette #effectiveness #random #testing
The Best Published Result is Random: Sequential Testing and its Effect on Reported Effectiveness (BC), pp. 747–750.
SIGIR-2014-Carterette #information retrieval #statistics #testing #theory and practice
Statistical significance testing in information retrieval: theory and practice (BC), p. 1286.
SIGIR-2013-ChandarC #evaluation #metric
Preference based evaluation measures for novelty and diversity (PC, BC), pp. 413–422.
SIGIR-2013-ChandarWC #documentation #predict
Document features predicting assessor disagreement (PC, WW, BC), pp. 745–748.
SIGIR-2013-ZhuC #adaptation
An adaptive evidence weighting method for medical record search (DZ, BC), pp. 1025–1028.
CIKM-2012-CarteretteKY #behaviour #evaluation #variability
Incorporating variability in user behavior into systems based evaluation (BC, EK, EY), pp. 135–144.
CIKM-2012-WebberCC #retrieval
Alternative assessor disagreement and retrieval depth (WW, PC, BC), pp. 125–134.
SIGIR-2012-CarteretteKY #development #evaluation #metric
Advances on the development of evaluation measures (BC, EK, EY), pp. 1200–1201.
SIGIR-2012-ChandarC #documentation #novel #retrieval #using
Using preference judgments for novel document retrieval (PC, BC), pp. 861–870.
SIGIR-2012-ChandarC12a #rank #using
Using PageRank to infer user preferences (PC, BC), pp. 1167–1168.
CIKM-2011-CarteretteKY #behaviour #effectiveness #evaluation #simulation
Simulating simple user behavior for system effectiveness evaluation (BC, EK, EY), pp. 611–620.
ECIR-2011-ArguelloDCC
A Methodology for Evaluating Aggregated Search Results (JA, FD, JC, BC), pp. 141–152.
ECIR-2011-ThotaC #statistics #testing
Within-Document Term-Based Index Pruning with Statistical Hypothesis Testing (SLT, BC), pp. 543–554.
SIGIR-2011-Carterette #concept #effectiveness #framework #modelling
System effectiveness, user models, and user utility: a conceptual framework for investigation (BC), pp. 903–912.
SIGIR-2011-KanoulasCCS #multi
Evaluating multi-query sessions (EK, BC, PDC, MS), pp. 1053–1062.
SIGIR-2010-CarteretteKPF #design #reuse
Reusable test collections through experimental design (BC, EK, VP, HF), pp. 547–554.
SIGIR-2010-CarteretteKY #evaluation #information retrieval #low cost
Low cost evaluation in information retrieval (BC, EK, EY), p. 903.
SIGIR-2010-CarteretteS #evaluation #fault #information retrieval
The effect of assessor error on IR system evaluation (BC, IS), pp. 539–546.
SIGIR-2010-ChandarC #using
Diversification of search results using webgraphs (PC, BC), pp. 869–870.
CIKM-2009-CarteretteC #documentation #modelling #novel #probability #ranking #retrieval #topic
Probabilistic models of ranking novel documents for faceted topic retrieval (BC, PC), pp. 1287–1296.
ECIR-2009-CarterettePKAA #query
If I Had a Million Queries (BC, VP, EK, JAA, JA), pp. 288–300.
SIGIR-2009-Carterette #correlation #distance #on the #rank #ranking
On rank correlation and the distance between rankings (BC), pp. 436–443.
SIGIR-2009-SmuckerAC #evaluation #information retrieval #statistics #testing
Agreement among statistical significance tests for information retrieval evaluation at varying sample sizes (MDS, JA, BC), pp. 630–631.
ECIR-2008-CarteretteBCD
Here or There (BC, PNB, DMC, STD), pp. 16–27.
SIGIR-2008-CarteretteB #evaluation #metric
Evaluation measures for preference judgments (BC, PNB), pp. 685–686.
SIGIR-2008-CarterettePKAA #evaluation #query
Evaluation over thousands of queries (BC, VP, EK, JAA, JA), pp. 651–658.
CIKM-2007-CarteretteA #documentation #evaluation #retrieval #using
Semiautomatic evaluation of retrieval systems using document similarities (BC, JA), pp. 873–876.
CIKM-2007-CarteretteS #testing
Hypothesis testing with incomplete relevance judgments (BC, MDS), pp. 643–652.
CIKM-2007-SmuckerAC #comparison #evaluation #information retrieval #statistics #testing
A comparison of statistical significance tests for information retrieval evaluation (MDS, JA, BC), pp. 623–632.
SIGIR-2007-Carterette #evaluation #retrieval #robust
Robust test collections for retrieval evaluation (BC), pp. 55–62.
SIGIR-2006-CarteretteAS #evaluation #retrieval
Minimal test collections for retrieval evaluation (BC, JA, RKS), pp. 268–275.
SIGIR-2006-CarteretteP #learning #ranking
Learning a ranking from pairwise preferences (BC, DP), pp. 629–630.
CIKM-2005-CarteretteA #incremental
Incremental test collections (BC, JA), pp. 680–687.
SIGIR-2005-AllanCL #information retrieval #question
When will information retrieval be “good enough”? (JA, BC, JL), pp. 433–440.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.