Proceedings of the 27th International ACM SIGIR Conference on Research and Development in Information Retrieval
BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
EDIT!
CC-BY
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter

Mark Sanderson, Kalervo Järvelin, James Allan, Peter Bruza
Proceedings of the 27th International ACM SIGIR Conference on Research and Development in Information Retrieval
SIGIR, 2004.

KER
DBLP
Scholar
Full names Links ISxN
@proceedings{SIGIR-2004,
	address       = "Sheffield, United Kingdom",
	editor        = "Mark Sanderson and Kalervo Järvelin and James Allan and Peter Bruza",
	isbn          = "1-58113-881-4",
	publisher     = "{ACM}",
	title         = "{Proceedings of the 27th International ACM SIGIR Conference on Research and Development in Information Retrieval}",
	year          = 2004,
}

Contents (141 items)

SIGIR-2004-BellGL #challenge #using
Challenges in using lifetime personal information stores (GB, JG, RL), p. 1.
SIGIR-2004-ShahC #retrieval
Evaluating high accuracy retrieval techniques (CS, WBC), pp. 2–9.
SIGIR-2004-AmitayCLS #evaluation #scalability #set #using
Scaling IR-system evaluation using term relevance sets (EA, DC, RL, AS), pp. 10–17.
SIGIR-2004-DiazJ #precise #predict #query #using
Using temporal profiles of queries for precision prediction (FD, RJ), pp. 18–24.
SIGIR-2004-BuckleyV #evaluation #retrieval
Retrieval evaluation with incomplete information (CB, EMV), pp. 25–32.
SIGIR-2004-SandersonJ
Forming test collections with no system pooling (MS, HJ), pp. 33–40.
SIGIR-2004-OardSDHMWRFGMKS #information retrieval #speech
Building an information retrieval test collection for spontaneous conversational speech (DWO, DS, DSD, XH, GCM, JW, BR, MF, SG, JM, LK, SS), pp. 41–48.
SIGIR-2004-FangTZ #formal method #heuristic #information retrieval
A formal study of information retrieval heuristics (HF, TT, CZ), pp. 49–56.
SIGIR-2004-WenLM #probability #retrieval
Probabilistic model for contextual retrieval (JRW, NL, WYM), pp. 57–63.
SIGIR-2004-Nallapati #information retrieval #modelling
Discriminative models for information retrieval (RN), pp. 64–71.
SIGIR-2004-KazaiLV #evaluation #problem #retrieval #xml
The overlap problem in content-oriented XML retrieval evaluation (GK, ML, APdV), pp. 72–79.
SIGIR-2004-KampsRS #normalisation #retrieval #xml
Length normalization in XML retrieval (JK, MdR, BS), pp. 80–87.
SIGIR-2004-LiuZC #configuration management #information retrieval #ranking #xml
Configurable indexing and ranking for XML information retrieval (SL, QZ, WWC), pp. 88–95.
SIGIR-2004-HeCLM #documentation #locality #representation
Locality preserving indexing for document representation (XH, DC, HL, WYM), pp. 96–103.
SIGIR-2004-KokiopoulouS #information retrieval #polynomial #semantics
Polynomial filtering in latent semantic indexing for information retrieval (EK, YS), pp. 104–111.
SIGIR-2004-TangDX #on the #peer-to-peer #scalability #semantics
On scaling latent semantic indexing for large peer-to-peer systems (CT, SD, ZX), pp. 112–121.
SIGIR-2004-Canny #named
GaP: a factor model for discrete data (JFC), pp. 122–129.
SIGIR-2004-LauBS #adaptation #information retrieval
Belief revision for adaptive information retrieval (RYKL, PB, DS), pp. 130–137.
SIGIR-2004-FanLWXF #feedback #ranking #retrieval #robust
Tuning before feedback: combining ranking discovery and blind feedback for robust retrieval (WF, ML, LW, WX, EAF), pp. 138–145.
SIGIR-2004-ChengTCWLC #corpus #information retrieval #query #web
Translating unknown queries with web corpora for cross-language information retrieval (PJC, JWT, RCC, JHW, WHL, LFC), pp. 146–153.
SIGIR-2004-RogatiY #information retrieval
Resource selection for domain-specific cross-lingual IR (MR, YY), pp. 154–161.
SIGIR-2004-ZhangV #automation #information retrieval #using #web
Using the web for automated translation extraction in cross-language information retrieval (YZ, PV), pp. 162–169.
SIGIR-2004-GaoNWC #dependence #information retrieval
Dependence language model for information retrieval (JG, JYN, GW, GC), pp. 170–177.
SIGIR-2004-HiemstraRZ #information retrieval #modelling
Parsimonious language models for information retrieval (DH, SER, HZ), pp. 178–185.
SIGIR-2004-LiuC #clustering #modelling #retrieval #using
Cluster-based retrieval using language models (XL, WBC), pp. 186–193.
SIGIR-2004-KurlandL #ad hoc #corpus #information retrieval #modelling
Corpus structure, language models, and ad hoc information retrieval (OK, LL), pp. 194–201.
SIGIR-2004-XuG #clustering #concept #documentation
Document clustering by concept factorization (WX, YG), pp. 202–209.
SIGIR-2004-ZengHCMM #clustering #learning #web
Learning to cluster web search results (HJZ, QCH, ZC, WYM, JM), pp. 210–217.
SIGIR-2004-LiMO #adaptation #clustering #documentation
Document clustering via adaptive subspace iteration (TL, SM, MO), pp. 218–225.
SIGIR-2004-SiersdorferS #clustering #documentation #self #strict
Restrictive clustering and metaclustering for self-organizing document collections (SS, SS), pp. 226–233.
SIGIR-2004-MladenicBGM #classification #feature model #interactive #linear #modelling #using
Feature selection using linear classifier weights: interaction with classification models (DM, JB, MG, NMF), pp. 234–241.
SIGIR-2004-ShenCYZZLM #classification #summary
Web-page classification through summarization (DS, ZC, QY, HJZ, BZ, YL, WYM), pp. 242–249.
SIGIR-2004-DavidovGM #categorisation #dataset #generative
Parameterized generation of labeled datasets for text categorization based on a hierarchical directory (DD, EG, SM), pp. 250–257.
SIGIR-2004-KimSR #approach #information retrieval #using #word
Information retrieval using word senses: root sense tagging approach (SBK, HCS, HCR), pp. 258–265.
SIGIR-2004-LiuLYM #approach #documentation #effectiveness #retrieval
An effective approach to document retrieval via utilizing WordNet and recognizing phrases (SL, FL, CTY, WM), pp. 266–272.
SIGIR-2004-AmitayHSS #named #web
Web-a-where: geotagging web content (EA, NH, RS, AS), pp. 273–280.
SIGIR-2004-ZhangPZ #machine learning #recognition #using
Focused named entity recognition using machine learning (LZ, YP, TZ), pp. 281–288.
SIGIR-2004-LamHC #learning #mining #similarity
Learning phonetic similarity for matching named entity translations and mining new translations (WL, RH, PSC), pp. 289–296.
SIGIR-2004-KumaranA #classification #detection
Text classification and named entities for new event detection (GK, JA), pp. 297–304.
SIGIR-2004-SilvestriOP #clustering #documentation #identifier
Assigning identifiers to documents to enhance the clustering property of fulltext indexes (FS, SO, RP), pp. 305–312.
SIGIR-2004-TryfonopoulosKD #algorithm #information retrieval #modelling #proximity
Filtering algorithms for information retrieval models with named attributes and proximity operators (CT, MK, YD), pp. 313–320.
SIGIR-2004-BeitzelJCGF #analysis #query #scalability #topic #web
Hourly analysis of a very large topically categorized web query log (SMB, ECJ, AC, DAG, OF), pp. 321–328.
SIGIR-2004-McLaughlinH #algorithm #collaboration #evaluation #experience #metric #user interface
A collaborative filtering algorithm and evaluation metric that accurately model the user experience (MRM, JLH), pp. 329–336.
SIGIR-2004-JinCS #automation #collaboration
An automatic weighting scheme for collaborative filtering (RJ, JYC, LS), pp. 337–344.
SIGIR-2004-Zhang #adaptation #classification #using
Using bayesian priors to combine classifiers for adaptive filtering (YZ0), pp. 345–352.
SIGIR-2004-YuTY #framework #information management #parametricity
A nonparametric hierarchical bayesian framework for information filtering (KY, VT, SY), pp. 353–360.
SIGIR-2004-FanGLX #automation #concept #image #representation #using
Automatic image annotation by using concept-sensitive salient objects for image content representation (JF, YG, HL, GX), pp. 361–368.
SIGIR-2004-RathML #image
A search engine for historical manuscript images (TMR, RM, VL), pp. 369–376.
SIGIR-2004-KellyB #comprehension #feedback
Display time as implicit feedback: understanding task effects (DK, NJB), pp. 377–384.
SIGIR-2004-WuMMTWLLB #topic
Human versus machine in the topic distillation task (MW, GM, AM, MC(T, RW, YL, HJL, NJB), pp. 385–392.
SIGIR-2004-Willett #information retrieval #named
Chemoinformatics: an application domain for information retrieval techniques (PW0), p. 393.
SIGIR-2004-XiLB #effectiveness #learning #ranking
Learning effective ranking functions for newsgroup search (WX, JL, EB), pp. 394–401.
SIGIR-2004-LarkeyFCL #modelling #multi #topic
Language-specific models in multilingual topic tracking (LSL, FF, MEC, VL), pp. 402–409.
SIGIR-2004-ZhangL #integration #taxonomy #web
Web taxonomy integration through co-bootstrapping (DZ, WSL), pp. 410–417.
SIGIR-2004-XuWL #approach #evaluation
Evaluation of an extraction-based approach to answering definitional questions (JX, RMW, AL), pp. 418–424.
SIGIR-2004-ChieuL #query #timeline
Query based event extraction along a timeline (HLC, YKL), pp. 425–432.
SIGIR-2004-GrabskiS
Sentence completion (KG, TS), pp. 433–439.
SIGIR-2004-CaiHWM #analysis
Block-level link analysis (DC, XH, JRW, WYM), pp. 440–447.
SIGIR-2004-PlachourasO #topic
Usefulness of hyperlink structure for query-biased topic distillation (VP, IO), pp. 448–455.
SIGIR-2004-CaiYWM #web
Block-based web search (DC, SY, JRW, WYM), pp. 456–463.
SIGIR-2004-DoranSNDC #generative #hybrid #statistics
A hybrid statistical/linguistic model for generating news story gists (WPD, NS, EN, JD, JC), pp. 464–465.
SIGIR-2004-SandersonP #image
Image based gisting in CLIR (MS, RP), pp. 466–467.
SIGIR-2004-GreevyS #using
Classifying racist texts using a support vector machine (EG, AFS), pp. 468–469.
SIGIR-2004-AzmanO #clustering
Discovery of aggregate usage profiles based on clustering information needs (AA, IO), pp. 470–471.
SIGIR-2004-LuC #network #peer-to-peer #retrieval
Merging retrieval results in hierarchical peer-to-peer networks (JL, JC), pp. 472–473.
SIGIR-2004-SakaiSIKK #evaluation
The effect of back-formulating questions in question answering evaluation (TS, YS, YI, TK, MK), pp. 474–475.
SIGIR-2004-MontgomerySCE #analysis #documentation #empirical #feedback
Effect of varying number of documents in blind feedback: analysis of the 2003 NRRC RIA workshop “bf_numdocs” experiment suite (JM, LS, JC, DAE), pp. 476–477.
SIGIR-2004-GrankaJG #analysis #behaviour
Eye-tracking analysis of user behavior in WWW search (LAG, TJ, GG), pp. 478–479.
SIGIR-2004-ChandrasekarCCB
Subwebs for specialized search (RC, HC, SCO, EB), pp. 480–481.
SIGIR-2004-GuL #comparison #documentation #feedback #information retrieval #using
Comparison of using passages and documents for blind relevance feedback in information retrieval (ZG, ML), pp. 482–483.
SIGIR-2004-CloughS #feedback #pseudo
Measuring pseudo relevance feedback & CLIR (PDC, MS), pp. 484–485.
SIGIR-2004-TaoZ #feedback #pseudo
A two-stage mixture model for pseudo feedback (TT, CZ), pp. 486–487.
SIGIR-2004-CrestanL #natural language
Natural language processing for browse help (EC, CdL), pp. 488–489.
SIGIR-2004-MayfieldM
Triangulation without translation (JM, PM), pp. 490–491.
SIGIR-2004-SriramSZ
A session-based search engine (SS, XS, CZ), pp. 492–493.
SIGIR-2004-BeitzelJCGF04a #evaluation
Evaluation of filtering current news search results (SMB, ECJ, AC, DAG, OF), pp. 494–495.
SIGIR-2004-HoenkampS #documentation #markov
The document as an ergodic markov chain (EH, DS), pp. 496–497.
SIGIR-2004-DAmore #community #detection
Expertise community detection (RJD), pp. 498–499.
SIGIR-2004-RoussinovR #learning #web
Learning patterns to answer open domain questions on the web (DR, JARF), pp. 500–501.
SIGIR-2004-Leuski #email #people
Email is a stage: discovering people roles from email archives (AL), pp. 502–503.
SIGIR-2004-ShahS #database
Searching databases for sematically-related schemas (GS, TFSM), pp. 504–505.
SIGIR-2004-Buckley #comparative #predict #ranking #retrieval #topic
Topic prediction based on comparative retrieval rankings (CB), pp. 506–507.
SIGIR-2004-LiddyDY #evaluation
Context-based question-answering evaluation (EDL, AD, OY), pp. 508–509.
SIGIR-2004-SunHW #comprehension #design #user interface #visualisation
Design of an e-book user interface and visualizations to support reading for comprehension (YS, DJH, SNKW), pp. 510–511.
SIGIR-2004-HawkingUC #towards
Toward better weighting of anchors (DH, TU, NC), pp. 512–513.
SIGIR-2004-YeS #clustering #retrieval
Aggregated feature retrieval for MPEG-7 via clustering (JY, AFS), pp. 514–515.
SIGIR-2004-Corrada-EmmanuelC #modelling #retrieval
Answer models for question answering passage retrieval (ACE, WBC), pp. 516–517.
SIGIR-2004-WuG #collaboration #documentation #repository
Collaborative filing in a document repository (HW, MDG), pp. 518–519.
SIGIR-2004-WhiteJ #case study #metric #similarity #topic
A study of topic similarity measures (RWW, JMJ), pp. 520–521.
SIGIR-2004-YangC #classification #effectiveness #web
Effectiveness of web page classification on finding list answers (HY, TSC), pp. 522–523.
SIGIR-2004-ZhangV04a #detection #query
Detection and translation of OOV terms prior to query time (YZ, PV), pp. 524–525.
SIGIR-2004-NemethST #automation #evaluation #interactive #query
Evaluation of the real and perceived value of automatic and interactive query expansion (YN, BS, MTM), pp. 526–527.
SIGIR-2004-HarmanB #information management #reliability
The NRRC reliable information access (RIA) workshop (DH, CB), pp. 528–529.
SIGIR-2004-Soboroff #documentation #on the #web
On evaluating web search with very few relevant documents (IS), pp. 530–531.
SIGIR-2004-LiKGO #music #recommendation
A music recommender based on audio features (QL, BMK, DG, DwO), pp. 532–533.
SIGIR-2004-MaS #information management #using
Information extraction using two-phase pattern discovery (LM, JS), pp. 534–535.
SIGIR-2004-LuZT #documentation
A search engine for imaged documents in PDF files (YL, LZ, CLT), pp. 536–537.
SIGIR-2004-LiuCKG #predict
Context sensitive vocabulary and its application in protein secondary structure prediction (YL, JGC, JKS, VG), pp. 538–539.
SIGIR-2004-MetzlerLC #modelling #multi
Formal multiple-bernoulli models for language modeling (DM, VL, WBC), pp. 540–541.
SIGIR-2004-AzzopardiGR #documentation #modelling
User biased document language modelling (LA, MG, CJvR), pp. 542–543.
SIGIR-2004-Collins-ThompsonC #information retrieval #overview
Information retrieval for language tutoring: an overview of the REAP project (KCT, JC), pp. 544–545.
SIGIR-2004-XuU #analysis #mining #ranking #web
A unified model of literal mining and link analysis for ranking web resources (YX, KU), pp. 546–547.
SIGIR-2004-LiuCOH #automation #query #recognition
Automatic recognition of reading levels from user queries (XL, WBC, PO, DMH), pp. 548–549.
SIGIR-2004-BasilicoH #collaboration #framework
A joint framework for collaborative and content filtering (JB, TH), pp. 550–551.
SIGIR-2004-KimCK #dependence #documentation #using
Refining term weights of documents using term dependencies (HSK, IC, MK), pp. 552–553.
SIGIR-2004-SigurbjornssonKR #multi #retrieval #xml
Multiple sources of evidence for XML retrieval (BS, JK, MdR), pp. 554–555.
SIGIR-2004-TsengT #categorisation #verification
Verifying a Chinese collection for text categorization (YHT, WJT), pp. 556–557.
SIGIR-2004-HedleyYJS #documentation #web
Query-related data extraction of hidden web documents (YLH, MY, AEJ, MS), pp. 558–559.
SIGIR-2004-FujiiIK #retrieval
The patent retrieval task in the fourth NTCIR workshop (AF, MI, NK), pp. 560–561.
SIGIR-2004-Voorhees #effectiveness
Measuring ineffectiveness (EMV), pp. 562–563.
SIGIR-2004-Cowans #information retrieval #process #using
Information retrieval using hierarchical dirichlet processes (PJC), pp. 564–565.
SIGIR-2004-GowederPR #detection #information retrieval
Broken plural detection for arabic information retrieval (AG, MP, ANDR), pp. 566–567.
SIGIR-2004-JinS #case study #collaboration #normalisation
A study of methods for normalizing user ratings in collaborative filtering (RJ, LS), pp. 568–569.
SIGIR-2004-WarrenL #feedback #information management #overview #reliability
A review of relevance feedback experiments at the 2003 reliable information access (RIA) workshop (RHW, TL), pp. 570–571.
SIGIR-2004-LiuHW #community #information management
Supporting federated information sharing communities (BL, DJH, SNKW), pp. 572–573.
SIGIR-2004-Collins-ThompsonCTC #documentation #performance #quality #retrieval
The effect of document retrieval quality on factoid question answering performance (KCT, JC, ELT, CLAC), pp. 574–575.
SIGIR-2004-UpstillR #recommendation #web
Exploiting hyperlink recommendation evidence in navigational web search (TU, SER), pp. 576–577.
SIGIR-2004-HunnisettT #categorisation
Context-based methods for text categorisation (DSH, WJT), pp. 578–579.
SIGIR-2004-AeryC #classification #email #named
eMailSift: mining-based approaches to email classification (MA, SC), pp. 580–581.
SIGIR-2004-ConradS #corpus #detection
Constructing a text corpus for inexact duplicate detection (JGC, CPS), pp. 582–583.
SIGIR-2004-Buckley04a #information retrieval #why
Why current IR engines fail (CB), pp. 584–585.
SIGIR-2004-Zahariev #ambiguity #automation
Automatic sense disambiguation for acronyms (MZ), pp. 586–587.
SIGIR-2004-SomloH #web
Filtering for personal web information agents (GS, AEH), pp. 588–589.
SIGIR-2004-ChristelMH #image #retrieval #video
Evaluating content-based filters for image and video retrieval (MGC, NM, CH), pp. 590–591.
SIGIR-2004-FanL #classification #semantics #video
Semantic video classification by integrating unlabeled samples for classifier training (JF, HL), pp. 592–593.
SIGIR-2004-DumaisCSH #query
Implicit queries (IQ) for contextualized search (STD, EC, RS, EH), p. 594.
SIGIR-2004-WhiteJ04a #predict
An implicit system for predicting interests (RWW, JMJ), p. 595.
SIGIR-2004-GeyCLC #documentation #multi #query
Geotemporal querying of multilingual documents (FCG, AC, RRL, KC), p. 596.
SIGIR-2004-ShenSZ #named
ACES: a contextual engine for search (XS, SS, CZ), p. 597.
SIGIR-2004-ChapmanDC #named #semantics #web
Armadillo: harvesting information for the semantic web (SC, AD, FC), p. 598.
SIGIR-2004-KruschwitzA #automation #named
UKSearch: search with automatically acquired domain knowledge (UK, HAB), p. 599.
SIGIR-2004-LarsonF #information retrieval #what
Geographic information retrieval (GIR): searching where and what (RRL, PF), p. 600.
SIGIR-2004-Bot #algorithm #documentation #feedback #representation
Improving document representation by accumulating relevance feedback (abstract only): the relevance feedback accumulation algorithm (RSB), p. 602.
SIGIR-2004-Gnasa #information management #online #question
Sharing knowledge online (abstract only): a dream or reality? (MG), p. 602.
SIGIR-2004-Leidner
Toponym resolution in text (abstract only): “which sheffield is it?” (JLL), p. 602.
SIGIR-2004-Liu #community #information management
Supporting federated information sharing communities (BL), p. 602.
SIGIR-2004-Martin #natural language #reliability #verification #web
Reliability and verification of natural language text on the world wide web (MJM), p. 603.
SIGIR-2004-Ogilvie #comprehension #generative #information retrieval #modelling #probability #using
Understanding combination of evidence using generative probabilistic models for information retrieval (PO), p. 603.
SIGIR-2004-Sun #comprehension #representation
Discovering and representing the contextual and narrative structure of e-books to support reading and comprehension (YS), p. 603.
SIGIR-2004-Trotman #approach #information retrieval
An artificial intelligence approach to information retrieval (AT), p. 603.
SIGIR-2004-Yuan #framework #multi
Supporting multiple information-seeking strategies in a single system framework (XY), p. 604.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.