Mark Sanderson, Kalervo Järvelin, James Allan, Peter Bruza
Proceedings of the 27th International ACM SIGIR Conference on Research and Development in Information Retrieval
SIGIR, 2004.
@proceedings{SIGIR-2004, address = "Sheffield, United Kingdom", editor = "Mark Sanderson and Kalervo Järvelin and James Allan and Peter Bruza", isbn = "1-58113-881-4", publisher = "{ACM}", title = "{Proceedings of the 27th International ACM SIGIR Conference on Research and Development in Information Retrieval}", year = 2004, }
Contents (141 items)
- SIGIR-2004-BellGL #challenge #using
- Challenges in using lifetime personal information stores (GB, JG, RL), p. 1.
- SIGIR-2004-ShahC #retrieval
- Evaluating high accuracy retrieval techniques (CS, WBC), pp. 2–9.
- SIGIR-2004-AmitayCLS #evaluation #scalability #set #using
- Scaling IR-system evaluation using term relevance sets (EA, DC, RL, AS), pp. 10–17.
- SIGIR-2004-DiazJ #precise #predict #query #using
- Using temporal profiles of queries for precision prediction (FD, RJ), pp. 18–24.
- SIGIR-2004-BuckleyV #evaluation #retrieval
- Retrieval evaluation with incomplete information (CB, EMV), pp. 25–32.
- SIGIR-2004-SandersonJ
- Forming test collections with no system pooling (MS, HJ), pp. 33–40.
- SIGIR-2004-OardSDHMWRFGMKS #information retrieval #speech
- Building an information retrieval test collection for spontaneous conversational speech (DWO, DS, DSD, XH, GCM, JW, BR, MF, SG, JM, LK, SS), pp. 41–48.
- SIGIR-2004-FangTZ #formal method #heuristic #information retrieval
- A formal study of information retrieval heuristics (HF, TT, CZ), pp. 49–56.
- SIGIR-2004-WenLM #probability #retrieval
- Probabilistic model for contextual retrieval (JRW, NL, WYM), pp. 57–63.
- SIGIR-2004-Nallapati #information retrieval #modelling
- Discriminative models for information retrieval (RN), pp. 64–71.
- SIGIR-2004-KazaiLV #evaluation #problem #retrieval #xml
- The overlap problem in content-oriented XML retrieval evaluation (GK, ML, APdV), pp. 72–79.
- SIGIR-2004-KampsRS #normalisation #retrieval #xml
- Length normalization in XML retrieval (JK, MdR, BS), pp. 80–87.
- SIGIR-2004-LiuZC #configuration management #information retrieval #ranking #xml
- Configurable indexing and ranking for XML information retrieval (SL, QZ, WWC), pp. 88–95.
- SIGIR-2004-HeCLM #documentation #locality #representation
- Locality preserving indexing for document representation (XH, DC, HL, WYM), pp. 96–103.
- SIGIR-2004-KokiopoulouS #information retrieval #polynomial #semantics
- Polynomial filtering in latent semantic indexing for information retrieval (EK, YS), pp. 104–111.
- SIGIR-2004-TangDX #on the #peer-to-peer #scalability #semantics
- On scaling latent semantic indexing for large peer-to-peer systems (CT, SD, ZX), pp. 112–121.
- SIGIR-2004-Canny #named
- GaP: a factor model for discrete data (JFC), pp. 122–129.
- SIGIR-2004-LauBS #adaptation #information retrieval
- Belief revision for adaptive information retrieval (RYKL, PB, DS), pp. 130–137.
- SIGIR-2004-FanLWXF #feedback #ranking #retrieval #robust
- Tuning before feedback: combining ranking discovery and blind feedback for robust retrieval (WF, ML, LW, WX, EAF), pp. 138–145.
- SIGIR-2004-ChengTCWLC #corpus #information retrieval #query #web
- Translating unknown queries with web corpora for cross-language information retrieval (PJC, JWT, RCC, JHW, WHL, LFC), pp. 146–153.
- SIGIR-2004-RogatiY #information retrieval
- Resource selection for domain-specific cross-lingual IR (MR, YY), pp. 154–161.
- SIGIR-2004-ZhangV #automation #information retrieval #using #web
- Using the web for automated translation extraction in cross-language information retrieval (YZ, PV), pp. 162–169.
- SIGIR-2004-GaoNWC #dependence #information retrieval
- Dependence language model for information retrieval (JG, JYN, GW, GC), pp. 170–177.
- SIGIR-2004-HiemstraRZ #information retrieval #modelling
- Parsimonious language models for information retrieval (DH, SER, HZ), pp. 178–185.
- SIGIR-2004-LiuC #clustering #modelling #retrieval #using
- Cluster-based retrieval using language models (XL, WBC), pp. 186–193.
- SIGIR-2004-KurlandL #ad hoc #corpus #information retrieval #modelling
- Corpus structure, language models, and ad hoc information retrieval (OK, LL), pp. 194–201.
- SIGIR-2004-XuG #clustering #concept #documentation
- Document clustering by concept factorization (WX, YG), pp. 202–209.
- SIGIR-2004-ZengHCMM #clustering #learning #web
- Learning to cluster web search results (HJZ, QCH, ZC, WYM, JM), pp. 210–217.
- SIGIR-2004-LiMO #adaptation #clustering #documentation
- Document clustering via adaptive subspace iteration (TL, SM, MO), pp. 218–225.
- SIGIR-2004-SiersdorferS #clustering #documentation #self #strict
- Restrictive clustering and metaclustering for self-organizing document collections (SS, SS), pp. 226–233.
- SIGIR-2004-MladenicBGM #classification #feature model #interactive #linear #modelling #using
- Feature selection using linear classifier weights: interaction with classification models (DM, JB, MG, NMF), pp. 234–241.
- SIGIR-2004-ShenCYZZLM #classification #summary
- Web-page classification through summarization (DS, ZC, QY, HJZ, BZ, YL, WYM), pp. 242–249.
- SIGIR-2004-DavidovGM #categorisation #dataset #generative
- Parameterized generation of labeled datasets for text categorization based on a hierarchical directory (DD, EG, SM), pp. 250–257.
- SIGIR-2004-KimSR #approach #information retrieval #using #word
- Information retrieval using word senses: root sense tagging approach (SBK, HCS, HCR), pp. 258–265.
- SIGIR-2004-LiuLYM #approach #documentation #effectiveness #retrieval
- An effective approach to document retrieval via utilizing WordNet and recognizing phrases (SL, FL, CTY, WM), pp. 266–272.
- SIGIR-2004-AmitayHSS #named #web
- Web-a-where: geotagging web content (EA, NH, RS, AS), pp. 273–280.
- SIGIR-2004-ZhangPZ #machine learning #recognition #using
- Focused named entity recognition using machine learning (LZ, YP, TZ), pp. 281–288.
- SIGIR-2004-LamHC #learning #mining #similarity
- Learning phonetic similarity for matching named entity translations and mining new translations (WL, RH, PSC), pp. 289–296.
- SIGIR-2004-KumaranA #classification #detection
- Text classification and named entities for new event detection (GK, JA), pp. 297–304.
- SIGIR-2004-SilvestriOP #clustering #documentation #identifier
- Assigning identifiers to documents to enhance the clustering property of fulltext indexes (FS, SO, RP), pp. 305–312.
- SIGIR-2004-TryfonopoulosKD #algorithm #information retrieval #modelling #proximity
- Filtering algorithms for information retrieval models with named attributes and proximity operators (CT, MK, YD), pp. 313–320.
- SIGIR-2004-BeitzelJCGF #analysis #query #scalability #topic #web
- Hourly analysis of a very large topically categorized web query log (SMB, ECJ, AC, DAG, OF), pp. 321–328.
- SIGIR-2004-McLaughlinH #algorithm #collaboration #evaluation #experience #metric #user interface
- A collaborative filtering algorithm and evaluation metric that accurately model the user experience (MRM, JLH), pp. 329–336.
- SIGIR-2004-JinCS #automation #collaboration
- An automatic weighting scheme for collaborative filtering (RJ, JYC, LS), pp. 337–344.
- SIGIR-2004-Zhang #adaptation #classification #using
- Using bayesian priors to combine classifiers for adaptive filtering (YZ0), pp. 345–352.
- SIGIR-2004-YuTY #framework #information management #parametricity
- A nonparametric hierarchical bayesian framework for information filtering (KY, VT, SY), pp. 353–360.
- SIGIR-2004-FanGLX #automation #concept #image #representation #using
- Automatic image annotation by using concept-sensitive salient objects for image content representation (JF, YG, HL, GX), pp. 361–368.
- SIGIR-2004-RathML #image
- A search engine for historical manuscript images (TMR, RM, VL), pp. 369–376.
- SIGIR-2004-KellyB #comprehension #feedback
- Display time as implicit feedback: understanding task effects (DK, NJB), pp. 377–384.
- SIGIR-2004-WuMMTWLLB #topic
- Human versus machine in the topic distillation task (MW, GM, AM, MC(T, RW, YL, HJL, NJB), pp. 385–392.
- SIGIR-2004-Willett #information retrieval #named
- Chemoinformatics: an application domain for information retrieval techniques (PW0), p. 393.
- SIGIR-2004-XiLB #effectiveness #learning #ranking
- Learning effective ranking functions for newsgroup search (WX, JL, EB), pp. 394–401.
- SIGIR-2004-LarkeyFCL #modelling #multi #topic
- Language-specific models in multilingual topic tracking (LSL, FF, MEC, VL), pp. 402–409.
- SIGIR-2004-ZhangL #integration #taxonomy #web
- Web taxonomy integration through co-bootstrapping (DZ, WSL), pp. 410–417.
- SIGIR-2004-XuWL #approach #evaluation
- Evaluation of an extraction-based approach to answering definitional questions (JX, RMW, AL), pp. 418–424.
- SIGIR-2004-ChieuL #query #timeline
- Query based event extraction along a timeline (HLC, YKL), pp. 425–432.
- SIGIR-2004-GrabskiS
- Sentence completion (KG, TS), pp. 433–439.
- SIGIR-2004-CaiHWM #analysis
- Block-level link analysis (DC, XH, JRW, WYM), pp. 440–447.
- SIGIR-2004-PlachourasO #topic
- Usefulness of hyperlink structure for query-biased topic distillation (VP, IO), pp. 448–455.
- SIGIR-2004-CaiYWM #web
- Block-based web search (DC, SY, JRW, WYM), pp. 456–463.
- SIGIR-2004-DoranSNDC #generative #hybrid #statistics
- A hybrid statistical/linguistic model for generating news story gists (WPD, NS, EN, JD, JC), pp. 464–465.
- SIGIR-2004-SandersonP #image
- Image based gisting in CLIR (MS, RP), pp. 466–467.
- SIGIR-2004-GreevyS #using
- Classifying racist texts using a support vector machine (EG, AFS), pp. 468–469.
- SIGIR-2004-AzmanO #clustering
- Discovery of aggregate usage profiles based on clustering information needs (AA, IO), pp. 470–471.
- SIGIR-2004-LuC #network #peer-to-peer #retrieval
- Merging retrieval results in hierarchical peer-to-peer networks (JL, JC), pp. 472–473.
- SIGIR-2004-SakaiSIKK #evaluation
- The effect of back-formulating questions in question answering evaluation (TS, YS, YI, TK, MK), pp. 474–475.
- SIGIR-2004-MontgomerySCE #analysis #documentation #empirical #feedback
- Effect of varying number of documents in blind feedback: analysis of the 2003 NRRC RIA workshop “bf_numdocs” experiment suite (JM, LS, JC, DAE), pp. 476–477.
- SIGIR-2004-GrankaJG #analysis #behaviour
- Eye-tracking analysis of user behavior in WWW search (LAG, TJ, GG), pp. 478–479.
- SIGIR-2004-ChandrasekarCCB
- Subwebs for specialized search (RC, HC, SCO, EB), pp. 480–481.
- SIGIR-2004-GuL #comparison #documentation #feedback #information retrieval #using
- Comparison of using passages and documents for blind relevance feedback in information retrieval (ZG, ML), pp. 482–483.
- SIGIR-2004-CloughS #feedback #pseudo
- Measuring pseudo relevance feedback & CLIR (PDC, MS), pp. 484–485.
- SIGIR-2004-TaoZ #feedback #pseudo
- A two-stage mixture model for pseudo feedback (TT, CZ), pp. 486–487.
- SIGIR-2004-CrestanL #natural language
- Natural language processing for browse help (EC, CdL), pp. 488–489.
- SIGIR-2004-MayfieldM
- Triangulation without translation (JM, PM), pp. 490–491.
- SIGIR-2004-SriramSZ
- A session-based search engine (SS, XS, CZ), pp. 492–493.
- SIGIR-2004-BeitzelJCGF04a #evaluation
- Evaluation of filtering current news search results (SMB, ECJ, AC, DAG, OF), pp. 494–495.
- SIGIR-2004-HoenkampS #documentation #markov
- The document as an ergodic markov chain (EH, DS), pp. 496–497.
- SIGIR-2004-DAmore #community #detection
- Expertise community detection (RJD), pp. 498–499.
- SIGIR-2004-RoussinovR #learning #web
- Learning patterns to answer open domain questions on the web (DR, JARF), pp. 500–501.
- SIGIR-2004-Leuski #email #people
- Email is a stage: discovering people roles from email archives (AL), pp. 502–503.
- SIGIR-2004-ShahS #database
- Searching databases for sematically-related schemas (GS, TFSM), pp. 504–505.
- SIGIR-2004-Buckley #comparative #predict #ranking #retrieval #topic
- Topic prediction based on comparative retrieval rankings (CB), pp. 506–507.
- SIGIR-2004-LiddyDY #evaluation
- Context-based question-answering evaluation (EDL, AD, OY), pp. 508–509.
- SIGIR-2004-SunHW #comprehension #design #user interface #visualisation
- Design of an e-book user interface and visualizations to support reading for comprehension (YS, DJH, SNKW), pp. 510–511.
- SIGIR-2004-HawkingUC #towards
- Toward better weighting of anchors (DH, TU, NC), pp. 512–513.
- SIGIR-2004-YeS #clustering #retrieval
- Aggregated feature retrieval for MPEG-7 via clustering (JY, AFS), pp. 514–515.
- SIGIR-2004-Corrada-EmmanuelC #modelling #retrieval
- Answer models for question answering passage retrieval (ACE, WBC), pp. 516–517.
- SIGIR-2004-WuG #collaboration #documentation #repository
- Collaborative filing in a document repository (HW, MDG), pp. 518–519.
- SIGIR-2004-WhiteJ #case study #metric #similarity #topic
- A study of topic similarity measures (RWW, JMJ), pp. 520–521.
- SIGIR-2004-YangC #classification #effectiveness #web
- Effectiveness of web page classification on finding list answers (HY, TSC), pp. 522–523.
- SIGIR-2004-ZhangV04a #detection #query
- Detection and translation of OOV terms prior to query time (YZ, PV), pp. 524–525.
- SIGIR-2004-NemethST #automation #evaluation #interactive #query
- Evaluation of the real and perceived value of automatic and interactive query expansion (YN, BS, MTM), pp. 526–527.
- SIGIR-2004-HarmanB #information management #reliability
- The NRRC reliable information access (RIA) workshop (DH, CB), pp. 528–529.
- SIGIR-2004-Soboroff #documentation #on the #web
- On evaluating web search with very few relevant documents (IS), pp. 530–531.
- SIGIR-2004-LiKGO #music #recommendation
- A music recommender based on audio features (QL, BMK, DG, DwO), pp. 532–533.
- SIGIR-2004-MaS #information management #using
- Information extraction using two-phase pattern discovery (LM, JS), pp. 534–535.
- SIGIR-2004-LuZT #documentation
- A search engine for imaged documents in PDF files (YL, LZ, CLT), pp. 536–537.
- SIGIR-2004-LiuCKG #predict
- Context sensitive vocabulary and its application in protein secondary structure prediction (YL, JGC, JKS, VG), pp. 538–539.
- SIGIR-2004-MetzlerLC #modelling #multi
- Formal multiple-bernoulli models for language modeling (DM, VL, WBC), pp. 540–541.
- SIGIR-2004-AzzopardiGR #documentation #modelling
- User biased document language modelling (LA, MG, CJvR), pp. 542–543.
- SIGIR-2004-Collins-ThompsonC #information retrieval #overview
- Information retrieval for language tutoring: an overview of the REAP project (KCT, JC), pp. 544–545.
- SIGIR-2004-XuU #analysis #mining #ranking #web
- A unified model of literal mining and link analysis for ranking web resources (YX, KU), pp. 546–547.
- SIGIR-2004-LiuCOH #automation #query #recognition
- Automatic recognition of reading levels from user queries (XL, WBC, PO, DMH), pp. 548–549.
- SIGIR-2004-BasilicoH #collaboration #framework
- A joint framework for collaborative and content filtering (JB, TH), pp. 550–551.
- SIGIR-2004-KimCK #dependence #documentation #using
- Refining term weights of documents using term dependencies (HSK, IC, MK), pp. 552–553.
- SIGIR-2004-SigurbjornssonKR #multi #retrieval #xml
- Multiple sources of evidence for XML retrieval (BS, JK, MdR), pp. 554–555.
- SIGIR-2004-TsengT #categorisation #verification
- Verifying a Chinese collection for text categorization (YHT, WJT), pp. 556–557.
- SIGIR-2004-HedleyYJS #documentation #web
- Query-related data extraction of hidden web documents (YLH, MY, AEJ, MS), pp. 558–559.
- SIGIR-2004-FujiiIK #retrieval
- The patent retrieval task in the fourth NTCIR workshop (AF, MI, NK), pp. 560–561.
- SIGIR-2004-Voorhees #effectiveness
- Measuring ineffectiveness (EMV), pp. 562–563.
- SIGIR-2004-Cowans #information retrieval #process #using
- Information retrieval using hierarchical dirichlet processes (PJC), pp. 564–565.
- SIGIR-2004-GowederPR #detection #information retrieval
- Broken plural detection for arabic information retrieval (AG, MP, ANDR), pp. 566–567.
- SIGIR-2004-JinS #case study #collaboration #normalisation
- A study of methods for normalizing user ratings in collaborative filtering (RJ, LS), pp. 568–569.
- SIGIR-2004-WarrenL #feedback #information management #overview #reliability
- A review of relevance feedback experiments at the 2003 reliable information access (RIA) workshop (RHW, TL), pp. 570–571.
- SIGIR-2004-LiuHW #community #information management
- Supporting federated information sharing communities (BL, DJH, SNKW), pp. 572–573.
- SIGIR-2004-Collins-ThompsonCTC #documentation #performance #quality #retrieval
- The effect of document retrieval quality on factoid question answering performance (KCT, JC, ELT, CLAC), pp. 574–575.
- SIGIR-2004-UpstillR #recommendation #web
- Exploiting hyperlink recommendation evidence in navigational web search (TU, SER), pp. 576–577.
- SIGIR-2004-HunnisettT #categorisation
- Context-based methods for text categorisation (DSH, WJT), pp. 578–579.
- SIGIR-2004-AeryC #classification #email #named
- eMailSift: mining-based approaches to email classification (MA, SC), pp. 580–581.
- SIGIR-2004-ConradS #corpus #detection
- Constructing a text corpus for inexact duplicate detection (JGC, CPS), pp. 582–583.
- SIGIR-2004-Buckley04a #information retrieval #why
- Why current IR engines fail (CB), pp. 584–585.
- SIGIR-2004-Zahariev #ambiguity #automation
- Automatic sense disambiguation for acronyms (MZ), pp. 586–587.
- SIGIR-2004-SomloH #web
- Filtering for personal web information agents (GS, AEH), pp. 588–589.
- SIGIR-2004-ChristelMH #image #retrieval #video
- Evaluating content-based filters for image and video retrieval (MGC, NM, CH), pp. 590–591.
- SIGIR-2004-FanL #classification #semantics #video
- Semantic video classification by integrating unlabeled samples for classifier training (JF, HL), pp. 592–593.
- SIGIR-2004-DumaisCSH #query
- Implicit queries (IQ) for contextualized search (STD, EC, RS, EH), p. 594.
- SIGIR-2004-WhiteJ04a #predict
- An implicit system for predicting interests (RWW, JMJ), p. 595.
- SIGIR-2004-GeyCLC #documentation #multi #query
- Geotemporal querying of multilingual documents (FCG, AC, RRL, KC), p. 596.
- SIGIR-2004-ShenSZ #named
- ACES: a contextual engine for search (XS, SS, CZ), p. 597.
- SIGIR-2004-ChapmanDC #named #semantics #web
- Armadillo: harvesting information for the semantic web (SC, AD, FC), p. 598.
- SIGIR-2004-KruschwitzA #automation #named
- UKSearch: search with automatically acquired domain knowledge (UK, HAB), p. 599.
- SIGIR-2004-LarsonF #information retrieval #what
- Geographic information retrieval (GIR): searching where and what (RRL, PF), p. 600.
- SIGIR-2004-Bot #algorithm #documentation #feedback #representation
- Improving document representation by accumulating relevance feedback (abstract only): the relevance feedback accumulation algorithm (RSB), p. 602.
- SIGIR-2004-Gnasa #information management #online #question
- Sharing knowledge online (abstract only): a dream or reality? (MG), p. 602.
- SIGIR-2004-Leidner
- Toponym resolution in text (abstract only): “which sheffield is it?” (JLL), p. 602.
- SIGIR-2004-Liu #community #information management
- Supporting federated information sharing communities (BL), p. 602.
- SIGIR-2004-Martin #natural language #reliability #verification #web
- Reliability and verification of natural language text on the world wide web (MJM), p. 603.
- SIGIR-2004-Ogilvie #comprehension #generative #information retrieval #modelling #probability #using
- Understanding combination of evidence using generative probabilistic models for information retrieval (PO), p. 603.
- SIGIR-2004-Sun #comprehension #representation
- Discovering and representing the contextual and narrative structure of e-books to support reading and comprehension (YS), p. 603.
- SIGIR-2004-Trotman #approach #information retrieval
- An artificial intelligence approach to information retrieval (AT), p. 603.
- SIGIR-2004-Yuan #framework #multi
- Supporting multiple information-seeking strategies in a single system framework (XY), p. 604.
23 ×#information retrieval
18 ×#documentation
16 ×#retrieval
16 ×#using
16 ×#web
11 ×#modelling
9 ×#evaluation
9 ×#query
8 ×#clustering
8 ×#feedback
18 ×#documentation
16 ×#retrieval
16 ×#using
16 ×#web
11 ×#modelling
9 ×#evaluation
9 ×#query
8 ×#clustering
8 ×#feedback