Proceedings of the 26th International ACM SIGIR Conference on Research and Development in Information Retrieval
SIGIR, 2003.
@proceedings{SIGIR-2003, address = "Toronto, Canada", isbn = "1-58113-646-3", publisher = "{ACM}", title = "{Proceedings of the 26th International ACM SIGIR Conference on Research and Development in Information Retrieval}", year = 2003, }
Contents (106 items)
- SIGIR-2003-Broder #graph #modelling #using #web
- Keynote Address — exploring, modeling, and using the web graph (AZB), p. 1.
- SIGIR-2003-Croft #evolution #information retrieval
- Salton Award Lecture — Information retrieval and computer science: an evolving relationship (WBC), pp. 2–3.
- SIGIR-2003-ZaragozaHT #ad hoc #information retrieval
- Bayesian extension to the language model for ad hoc information retrieval (HZ, DH, MET), pp. 4–9.
- SIGIR-2003-ZhaiCL #evaluation #independence #metric #retrieval #topic
- Beyond independent relevance: methods and evaluation metrics for subtopic retrieval (CZ, WWC, JDL), pp. 10–17.
- SIGIR-2003-TeevanK #analysis #development #empirical #exponential #probability #retrieval #using
- Empirical development of an exponential probabilistic model for text retrieval: using textual analysis to build a better model (JT, DRK), pp. 18–25.
- SIGIR-2003-ZhangL #classification #using
- Question classification using support vector machines (DZ, WSL), pp. 26–32.
- SIGIR-2003-YangCWK #using
- Structured use of external knowledge for event-based open domain question answering (HY, TSC, SW, CKK), pp. 33–40.
- SIGIR-2003-TellexKLFM #algorithm #evaluation #retrieval
- Quantitative evaluation of passage retrieval algorithms for question answering (ST, BK, JJL, AF, GM), pp. 41–47.
- SIGIR-2003-ChenLWPM #web
- Building a web thesaurus from web link structure (ZC, SL, LW, GP, WYM), pp. 48–55.
- SIGIR-2003-XueZCMZL #analysis #web
- Implicit link analysis for small web search (GRX, HJZ, ZC, WYM, HZ, CJL), pp. 56–63.
- SIGIR-2003-KangK #classification #documentation #query #retrieval #web
- Query type classification for web document retrieval (IHK, GCK), pp. 64–71.
- SIGIR-2003-DumaisCCJSR #information retrieval
- Stuff I’ve seen: a system for personal information retrieval and re-use (STD, EC, JJC, GJ, RS, DCR), pp. 72–79.
- SIGIR-2003-McDonaldT #image #retrieval
- Search strategies in content-based image retrieval (SM, JT), pp. 80–87.
- SIGIR-2003-Anick #feedback #refinement #using #web
- Using terminological feedback for web search refinement: a log-based study (PGA), pp. 88–95.
- SIGIR-2003-YangZK #analysis #categorisation #classification #scalability
- A scalability analysis of classifiers in text categorization (YY, JZ, BK), pp. 96–103.
- SIGIR-2003-KhmelevT #categorisation #verification
- A repetition based measure for verification of text collections and for text categorization (DVK, WJT), pp. 104–110.
- SIGIR-2003-Bennett #classification #probability #symmetry #using
- Using asymmetric distributions to improve text classifier probability estimates (PNB), pp. 111–118.
- SIGIR-2003-JeonLM #automation #image #modelling #retrieval #using
- Automatic image annotation and retrieval using cross-media relevance models (JJ, VL, RM), pp. 119–126.
- SIGIR-2003-BleiJ #modelling
- Modeling annotated data (DMB, MIJ), pp. 127–134.
- SIGIR-2003-WesterveldV #analysis #generative #image #probability #retrieval
- Experimental result analysis for a generative probabilistic image retrieval model (TW, APdV), pp. 135–142.
- SIGIR-2003-OgilvieC #documentation
- Combining document representations for known-item search (PO, JPC), pp. 143–150.
- SIGIR-2003-CarmelMMMS #documentation #xml
- Searching XML documents via XML fragments (DC, YSM, MM, YM, AS), pp. 151–158.
- SIGIR-2003-StokoeOT #ambiguity #information retrieval #revisited #word
- Word sense disambiguation in information retrieval revisited (CS, MPO, JT), pp. 159–166.
- SIGIR-2003-TsuruokaT #generative #probability
- Probabilistic term variant generator for biomedical terms (YT, JT), pp. 167–173.
- SIGIR-2003-GaoWLC #approach #categorisation #learning
- A maximal figure-of-merit learning approach to text categorization (SG, WW, CHL, TSC), pp. 174–181.
- SIGIR-2003-CaiH #automation #categorisation #concept
- Text categorization by boosting automatically extracted concepts (LC, TH), pp. 182–189.
- SIGIR-2003-ZhangY #categorisation #classification #linear #robust
- Robustness of regularized linear classification methods in text categorization (JZ, YY), pp. 190–197.
- SIGIR-2003-NanasUR #concept #representation
- Building and applying a concept hierarchy representation of a user profile (NN, VSU, ANDR), pp. 198–204.
- SIGIR-2003-BelkinKKKLMTYC #information retrieval #interactive #query
- Query length in interactive information retrieval (NJB, DK, GK, JYK, HJL, GM, MC(T, XJY, CC), pp. 205–212.
- SIGIR-2003-Ruthven #effectiveness #interactive #query
- Re-examining the potential effectiveness of interactive query expansion (IR), pp. 213–220.
- SIGIR-2003-Dupret #analysis #concept #orthogonal #semantics
- Latent concepts and the number orthogonal factors in latent semantic analysis (GD), pp. 221–226.
- SIGIR-2003-Roelleke #probability
- A frequency-based and a poisson-based definition of the probability of being informative (TR), pp. 227–234.
- SIGIR-2003-PintoMWC #random #using
- Table extraction using conditional random fields (DP, AM, XW, WBC), pp. 235–242.
- SIGIR-2003-SoboroffR
- Building a filtering test collection for TREC 2002 (IS, SER), pp. 243–250.
- SIGIR-2003-IwayamaFKM #documentation #empirical #modelling #retrieval
- An empirical study on retrieval models for different document genres: patents and newspaper articles (MI, AF, NK, YM), pp. 251–258.
- SIGIR-2003-Hofmann #analysis #collaboration #probability #semantics
- Collaborative filtering via gaussian probabilistic latent semantic analysis (TH), pp. 259–266.
- SIGIR-2003-XuLG #clustering #documentation #matrix
- Document clustering based on non-negative matrix factorization (WX, XL, YG), pp. 267–273.
- SIGIR-2003-WangZCLTM #clustering #multi #named
- ReCoM: reinforcement clustering of multi-type interrelated data objects (JW, HJZ, ZC, HL, LT, WYM), pp. 274–281.
- SIGIR-2003-LiOL #case study #classification #comparative #music
- A comparative study on content-based music genre classification (TL, MO, QL), pp. 282–289.
- SIGIR-2003-NottelmannF #quality #retrieval
- Evaluating different methods of estimating retrieval quality for resource selection (HN, NF), pp. 290–297.
- SIGIR-2003-SiC #documentation #estimation
- Relevant document distribution estimation method for resource selection (LS, JPC), pp. 298–305.
- SIGIR-2003-BawaMR #named #segmentation #set #topic
- SETS: search enhanced by topic segmentation (MB, GSM, PR), pp. 306–313.
- SIGIR-2003-AllanWB #detection #retrieval
- Retrieval and novelty detection at the sentence level (JA, CW, AB), pp. 314–321.
- SIGIR-2003-JiZ #independence #programming #segmentation #using
- Domain-independent text segmentation using anisotropic diffusion and dynamic programming (XJ, HZ), pp. 322–329.
- SIGIR-2003-BrantsC #detection
- A System for new event detection (TB, FC), pp. 330–337.
- SIGIR-2003-DarwishO #probability #query
- Probabilistic structured query methods (KD, DWO), pp. 338–344.
- SIGIR-2003-PirkolaTKVJ #fuzzy
- Fuzzy translation of cross-lingual spelling variants (AP, JT, HK, KV, KJ), pp. 345–352.
- SIGIR-2003-QuGE #automation #retrieval
- Automatic transliteration for Japanese-to-English text retrieval (YQ, GG, DAE), pp. 353–360.
- SIGIR-2003-AslamS #effectiveness #on the #retrieval
- On the effectiveness of evaluating retrieval systems in the absence of relevance judgments (JAA, RS), pp. 361–362.
- SIGIR-2003-CallanCNPS #data fusion #distributed #library #multi
- Resource selection and data fusion in multimedia distributed digital libraries (JPC, FC, HN, PP, XMS), pp. 363–364.
- SIGIR-2003-VirgaK
- Transliteration of proper names in cross-language applications (PV, SK), pp. 365–366.
- SIGIR-2003-Davison #analysis #towards #unification
- Toward a unification of text and link analysis (BDD0), pp. 367–368.
- SIGIR-2003-AzzopardiGR #information retrieval #metric
- Investigating the relationship between language model perplexity and IR precision-recall measures (LA, MG, KvR), pp. 369–370.
- SIGIR-2003-ChoiK #concept #topic #using
- Topic distillation using hierarchy concept tree (IC, MK), pp. 371–372.
- SIGIR-2003-BeitzelJCGF #automation #evaluation #retrieval #using #web
- Using manually-built web directories for automatic evaluation of known-item retrieval (SMB, ECJ, AC, DAG, OF), pp. 373–374.
- SIGIR-2003-FengZP #detection #music #retrieval
- Popular music retrieval by detecting mood (YF, YZ, YP), pp. 375–376.
- SIGIR-2003-ShenZ #documentation #information retrieval #interactive #query #ranking
- Exploiting query history for document ranking in interactive information retrieval (XS, CZ), pp. 377–378.
- SIGIR-2003-NurayC #automation #ranking #retrieval
- Automatic ranking of retrieval systems in imperfect environments (RN, FC), pp. 379–380.
- SIGIR-2003-EdensGJL #automation #information retrieval
- An investigation of broad coverage automatic pronoun resolution for information retrieval (RJE, HLG, GJFJ, AMLA), pp. 381–382.
- SIGIR-2003-Li
- Syntactic features in question answering (XL), pp. 383–384.
- SIGIR-2003-TombrosRJ #web
- Searchers’ criteria For assessing web pages (AT, IR, JMJ), pp. 385–386.
- SIGIR-2003-BillerbeckZ #query
- When query expansion fails (BB, JZ), pp. 387–388.
- SIGIR-2003-LavrenkoP #modelling #music #random
- Music modeling with random fields (VL, JP), pp. 389–390.
- SIGIR-2003-YangW #summary
- Fractal summarization: summarization based on fractal theory (CCY, FLW), pp. 391–392.
- SIGIR-2003-AslamPS #algorithm #evaluation #performance #retrieval
- A unified model for metasearch and the efficient evaluation of retrieval systems via the hedge algorithm (JAA, VP, RS), pp. 393–394.
- SIGIR-2003-MuM #retrieval #statistics #video #visual notation
- Statistical visual feature indexes in video retrieval (XM, GM), pp. 395–396.
- SIGIR-2003-SadatYU #automation #corpus #information retrieval
- Enhancing cross-language information retrieval by an automatic acquisition of bilingual terminology from comparable corpora (FS, MY, SU), pp. 397–398.
- SIGIR-2003-TsengJ #categorisation
- Document-self expansion for text categorization (YHT, DWJ), pp. 399–400.
- SIGIR-2003-KlampanosJ #architecture #information retrieval #peer-to-peer
- An architecture for peer-to-peer information retrieval (IAK, JMJ), pp. 401–402.
- SIGIR-2003-LinNNNSTNA #multimodal #using #video
- User-trainable video annotation using multimodal cues (CYL, MRN, AN, CN, JRS, BLT, HJN, WHA), pp. 403–404.
- SIGIR-2003-SrikanthS #dependence #documentation #modelling #query #retrieval
- Incorporating query term dependencies in language models for document retrieval (MS, RKS), pp. 405–406.
- SIGIR-2003-HuBZ #analysis #fault #topic
- Error analysis of difficult TREC topics (XH, SB, CZ), pp. 407–408.
- SIGIR-2003-KampsMRS #question #retrieval #what #xml
- XML retrieval: what to retrieve? (JK, MM, MdR, BS), pp. 409–410.
- SIGIR-2003-BartlettT #data flow
- Discovering and structuring information flow among bioinformatics resources (JCB, EGT), pp. 411–412.
- SIGIR-2003-GilesPTHLRP #named
- eBizSearch: a niche search engine for e-business (CLG, YP, PBT, HH, SL, AR, NP), pp. 413–414.
- SIGIR-2003-MayfieldM #n-gram
- Single n-gram stemming (JM, PM), pp. 415–416.
- SIGIR-2003-Sakai #evaluation #multi #performance #retrieval
- Average gain ratio: a simple retrieval performance measure for evaluation with multiple relevance levels (TS), pp. 417–418.
- SIGIR-2003-BruzaS #comparison #dependence #modelling #probability #using
- A comparison of various approaches for using probabilistic dependencies in language modeling (PB, DS), pp. 419–420.
- SIGIR-2003-LiZO #generative #linear #topic
- Topic hierarchy generation via linear discriminant projection (TL, SZ, MO), pp. 421–422.
- SIGIR-2003-MartinJ #information retrieval #personalisation
- A personalised information retrieval tool (IM, JMJ), pp. 423–424.
- SIGIR-2003-KrovetzUG #classification #source code
- Classification of source code archives (RK, SU, CLG), pp. 425–426.
- SIGIR-2003-ClarkeT #documentation #retrieval
- Passage retrieval vs. document retrieval for factoid question answering (CLAC, ELT), pp. 427–428.
- SIGIR-2003-SakaiK #performance #question #retrieval #what
- Evaluating retrieval performance for Japanese question answering: what are best passages? (TS, TK), pp. 429–430.
- SIGIR-2003-TsaiMT #classification #hybrid #image #network #using
- Image classification using hybrid neural networks (CFT, KM, JT), pp. 431–432.
- SIGIR-2003-GirolamiK #equivalence #on the
- On an equivalence between PLSI and LDA (MG, AK), pp. 433–434.
- SIGIR-2003-JonesF #predict #query #word
- Query word deletion prediction (RJ, DCF), pp. 435–436.
- SIGIR-2003-LevinCS #effectiveness #query
- Assessing the effectiveness of pen-based input queries (SL, PDC, MS), pp. 437–438.
- SIGIR-2003-AntoniukN
- A light weight PDA-friendly collection fusion technique (JA, MAN), pp. 439–440.
- SIGIR-2003-HayashiOBMMMHHI #multi #speech
- Speech-based and video-supported indexing of multimedia broadcast news (YH, KO, KB, OM, YM, SM, MH, TH, NI), pp. 441–442.
- SIGIR-2003-AhmadVO #categorisation #evaluation #summary
- Summary evaluation and text categorization (KA, BV, PCFdO), pp. 443–444.
- SIGIR-2003-HanMGZ #classification #clustering #rule-based #word
- Rule-based word clustering for text classification (HH, EM, CLG, HZ), pp. 445–446.
- SIGIR-2003-AgunF #component #hardware #named
- HAT: a hardware assisted TOP-DOC inverted index component (SKA, OF), pp. 447–448.
- SIGIR-2003-AslamF #documentation #similarity
- An information-theoretic measure for document similarity (JAA, MF), pp. 449–450.
- SIGIR-2003-EvansBH #optimisation #performance #robust
- Optimizing term vectors for efficient and robust filtering (DAE, JB, DAH), pp. 451–452.
- SIGIR-2003-Downie #evaluation #information retrieval #music
- The TREC-like evaluation of music IR systems (JSD), pp. 453–454.
- SIGIR-2003-AllanK #framework #modelling
- Stemming in the language modeling framework (JA, GK), pp. 455–456.
- SIGIR-2003-LawrieC #generative #summary #web
- Generating hierarchical summaries for web searches (DJL, WBC), pp. 457–458.
- SIGIR-2003-EironM #analysis #web
- Analysis of anchor text for web search (NE, KSM), pp. 459–460.
- SIGIR-2003-HeWON #interactive #query
- User-assisted query translation for interactive CLIR (DH, JW, DWO, MN), p. 461.
- SIGIR-2003-Blair-GoldensohnMS #hybrid #named
- DefScriber: a hybrid system for definitional QA (SBG, KM, AHS), p. 462.
- SIGIR-2003-YuJR #keyword #query #using #xml
- Querying XML using structures and keywords in timber (CY, HVJ, DRR), p. 463.
- SIGIR-2003-WuRDCMHY #named
- SE-LEGO: creating metasearch engines on demand (ZW, VVR, CD, KSC, WM, HH, CTY), p. 464.
- SIGIR-2003-BerrettiCNSW #data fusion #distributed #library #multi #named
- MIND: resource selection and data fusion in multimedia distributed digital libraries (SB, JPC, HN, XMS, SW), p. 465.
- SIGIR-2003-Koster
- Head/modifier pairs for everyone (CHAK), p. 466.
- SIGIR-2003-BohnackerR #documentation #retrieval #web
- Document retrieval from user-selected web sites (UB, IR), p. 467.
- SIGIR-2003-LeuskiOB #email #named
- eArchivarius: accessing collections of electronic mail (AL, DWO, RB), p. 468.
23 ×#retrieval
14 ×#using
12 ×#information retrieval
11 ×#documentation
11 ×#query
10 ×#web
9 ×#analysis
9 ×#classification
8 ×#modelling
8 ×#named
14 ×#using
12 ×#information retrieval
11 ×#documentation
11 ×#query
10 ×#web
9 ×#analysis
9 ×#classification
8 ×#modelling
8 ×#named