83 papers:
- HT-2015-Bayomi #adaptation #corpus #framework #reuse
- A Framework to Provide Customized Reuse of Open Corpus Content for Adaptive Systems (MB), pp. 315–318.
- ICSME-2015-WangPV #corpus #mining #scalability
- Developing a model of loop actions by mining loop characteristics from a large code corpus (XW, LLP, KVS), pp. 51–60.
- MSR-2015-BarikLSSM #corpus #named #spreadsheet
- Fuse: A Reproducible, Extendable, Internet-Scale Corpus of Spreadsheets (TB, KL, JS, JS, ERMH), pp. 486–489.
- CHI-2015-WoltersKMDM #corpus #design #interface
- The CADENCE Corpus: A New Resource for Inclusive Voice Interface Design (MKW, JK, SEM, MD, JDM), pp. 3963–3966.
- ECIR-2015-CarrascoMMSRE #corpus
- Linguistically-Enhanced Search over an Open Diachronic Corpus (RCC, IMS, EMG, FSM, GCR, MPEE), pp. 801–804.
- ECIR-2015-HagenWS #corpus #topic #web
- A Corpus of Realistic Known-Item Topics with Associated Web Pages in the ClueWeb09 (MH, DW, BS), pp. 513–525.
- SIGIR-2015-ChakrabortyGP #corpus #retrieval
- Retrieval from Noisy E-Discovery Corpus in the Absence of Training Data (AC, KG, SKP), pp. 755–758.
- SIGIR-2015-HeindorfPSE #analysis #corpus #detection #knowledge base #towards
- Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis (SH, MP, BS, GE), pp. 831–834.
- ICSME-2014-LandmanSV #analysis #corpus #empirical #java #scalability
- Empirical Analysis of the Relationship between CC and SLOC in a Large Corpus of Java Methods (DL, AS, JJV), pp. 221–230.
- HCI-AS-2014-JiaNBBT #corpus #framework #named #online #research
- CORPUS: Next-Generation Online Platform for Research Collaborations in Humanities (YJ, XN, RB, DB, ADT), pp. 3–12.
- HIMI-DE-2014-KobayashiS #corpus #topic
- Finding Division Points for Time-Series Corpus Based on Topic Changes (HK, RS), pp. 364–372.
- CIKM-2014-MukherjeeAJ #corpus #framework #ontology
- Domain Cartridge: Unsupervised Framework for Shallow Domain Ontology Construction from Corpus (SM, JA, SJ), pp. 929–938.
- ECIR-2014-BauerCRG #corpus #formal method #learning #web
- Learning a Theory of Marriage (and Other Relations) from a Web Corpus (SB, SC, LR, TG), pp. 591–597.
- SIGIR-2014-TongWZ #corpus #taxonomy
- Principled dictionary pruning for low-memory corpus compression (JT, AW, JZ), pp. 283–292.
- OOPSLA-2014-HsiaoCN #corpus #program analysis #statistics #using #web
- Using web corpus statistics for program analysis (CHH, MJC, SN), pp. 49–65.
- FSE-2014-Nguyen0NR #api #corpus #mining #scalability
- Mining preconditions of APIs in large-scale code corpus (HAN, RD, TNN, HR), pp. 166–177.
- CIKM-2013-McMinnMJ #corpus #detection #scalability #twitter
- Building a large-scale corpus for evaluating event detection on twitter (AJM, YM, JMJ), pp. 409–418.
- CIKM-2013-Zhang0D #corpus #mining #query
- Mining a search engine’s corpus without a query pool (MZ, NZ, GD), pp. 29–38.
- DocEng-2012-WidlocherM #corpus #framework #mining
- The Glozz platform: a corpus annotation and mining tool (AW, YM), pp. 171–180.
- HT-2012-OKeeffeOCLW #adaptation #corpus #hypermedia #modelling #semantics #web
- Linked open corpus models, leveraging the semantic web for adaptive hypermedia (IO, AO, PC, SL, VW), pp. 321–322.
- ITiCSE-2012-PoonSTK #corpus #detection #source code
- Instructor-centric source code plagiarism detection and plagiarism corpus (JYHP, KS, YFT, MYK), pp. 122–127.
- CIKM-2012-KoopmanZBSL #concept #evaluation #information retrieval #metric #similarity
- An evaluation of corpus-driven measures of medical concept similarity for information retrieval (BK, GZ, PB, LS, ML), pp. 2439–2442.
- CIKM-2012-SiposSSJ #corpus #summary #using #word
- Temporal corpus summarization using submodular word coverage (RS, AS, PS, TJ), pp. 754–763.
- CIKM-2012-XiangFWHR #corpus #detection #scalability #topic #twitter
- Detecting offensive tweets via topical feature discovery over a large scale twitter corpus (GX, BF, LW, JIH, CPR), pp. 1980–1984.
- KEOD-2012-SuzukiF #bibliography #segmentation #similarity #using #word
- Segmentation of Review Texts by using Thesaurus and Corpus-based Word Similarity (YS, FF), pp. 381–384.
- MLDM-2012-WangYL #corpus
- Measuring the Dynamic Relatedness between Chinese Entities Orienting to News Corpus (ZW, JY, XL), pp. 631–644.
- SIGIR-2012-CastelliRFHLR #corpus
- Distilling and exploring nuggets from a corpus (VC, HR, RF, DJH, XL, SR), p. 1006.
- SIGIR-2012-McCreadieSLMOM #corpus #on the #reuse #twitter
- On building a reusable Twitter corpus (RM, IS, JL, CM, IO, DM), pp. 1113–1114.
- SIGIR-2012-PotthastHSGMTW #corpus #named
- ChatNoir: a search engine for the ClueWeb09 corpus (MP, MH, BS, JG, MM, MT, CW), p. 1004.
- SAC-2012-Hosokawa #generative
- Corpus-based place metadatabase generation for geocoding (YH), pp. 965–967.
- DRR-2011-EynardME #corpus #framework #navigation
- A framework to improve digital corpus uses: image-mode navigation (LE, VM, HE), pp. 1–10.
- ICDAR-2011-ChattopadhyaySSR #analysis #corpus
- Creation and Analysis of a Corpus of Text Rich Indian TV Videos (TC, SS, AS, NR), pp. 849–853.
- SIGMOD-2011-ZhangZD #corpus #estimation #mining #performance
- Mining a search engine’s corpus: efficient yet unbiased sampling and aggregate estimation (MZ, NZ, GD), pp. 793–804.
- CIKM-2011-YamamotoNT11a #community #corpus
- Extracting adjective facets from community Q&A corpus (TY, SN, KT), pp. 2021–2024.
- CIKM-2011-YeungI #corpus #generative #multi
- Extracting multi-dimensional relations: a generative model of groups of entities in a corpus (CmAY, TI), pp. 1203–1208.
- SIGIR-2011-AsadiML
- Cross-corpus relevance projection (NA, DM, JJL), pp. 1163–1164.
- SIGIR-2011-HoobinPZ #corpus
- Sample selection for dictionary-based corpus compression (CH, SJP, JZ), pp. 1137–1138.
- SIGIR-2011-PaikPP #algorithm #novel #statistics #using
- A novel corpus-based stemming algorithm using co-occurrence statistics (JHP, DP, SKP), pp. 863–872.
- SIGMOD-2010-SiferLWB #corpus #keyword #multi #summary
- Integrating keyword search with multiple dimension tree views over a summary corpus data cube (MS, JL, YW, SB), pp. 1167–1170.
- ICEIS-HCI-2010-KralC #automation #corpus #web
- Automatic Dialog Act Corpus Creation from Web Pages (PK, CC), pp. 198–203.
- ICPR-2010-ElnakibECS #analysis #corpus
- Dyslexia Diagnostics by Centerline-Based Shape Analysis of the Corpus Callosum (AE, AEB, MC, AES), pp. 261–264.
- ICPR-2010-PastorTCV #corpus
- A Bi-modal Handwritten Text Corpus: Baseline Results (MP, AHT, FC, EV), pp. 1933–1936.
- ICPR-2010-RomeroTV #analysis #corpus #image
- Computer Assisted Transcription of Text Images: Results on the GERMANA Corpus and Analysis of Improvements Needed for Practical Use (VR, AHT, EV), pp. 2017–2020.
- SIGIR-2010-Potthast #corpus #crowdsourcing #wiki
- Crowdsourcing a wikipedia vandalism corpus (MP), pp. 789–790.
- HT-2009-SteichenLOW #corpus #generative #hypermedia #reuse
- Dynamic hypertext generation for reusing open corpus content (BS, SL, AO, VW), pp. 119–128.
- ICDAR-2009-Martin-ToralAP #corpus #detection #documentation
- Detection of Incoherences in a Document Corpus Based on the Application of a Neuro-Fuzzy System (SMT, VA, GISP), pp. 1101–1105.
- DHM-2009-ClavelM #approach #modelling #multimodal #named #permutation
- PERMUTATION: A Corpus-Based Approach for Modeling Personality and Multimodal Expression of Affects in Virtual Characters (CC, JCM), pp. 211–220.
- HCD-2009-NakanoR #analysis #corpus #multimodal #usability
- Multimodal Corpus Analysis as a Method for Ensuring Cultural Usability of Embodied Conversational Agents (YIN, MR), pp. 521–530.
- HT-2008-LawlessHW #corpus #education #learning
- Enhancing access to open corpus educational content: learning in the wild (SL, LH, VW), pp. 167–174.
- ICEIS-AIDSS-2008-Martin-ToralSD #corpus #detection #documentation
- Detection of Incoherences in a Technical and Normative Document Corpus (SMT, GISP, YAD), pp. 282–287.
- CIKM-2008-CustisA #corpus #query #statistics
- Investigating external corpus and clickthrough statistics for query expansion in the legal domain (TC, KAK), pp. 1363–1364.
- CIKM-2008-RogatiYC #corpus #information retrieval #optimisation
- Corpus microsurgery: criteria optimization for medical cross-language ir (MR, YY, JGC), pp. 1365–1366.
- ECIR-2008-AyacheQ #corpus #learning #using #video
- Video Corpus Annotation Using Active Learning (SA, GQ), pp. 187–198.
- ECIR-2008-Talvensaari #corpus #quality
- Effects of Aligned Corpus Quality and Size in Corpus-Based CLIR (TT), pp. 114–125.
- KDD-2008-ChowGS #detection #privacy #using
- Detecting privacy leaks using corpus-based association rules (RC, PG, JS), pp. 893–901.
- SIGIR-2008-Banerjee #classification #corpus #modelling #topic #using
- Improving text classification accuracy using topic modeling over an additional corpus (SB), pp. 867–868.
- DRR-2007-HeD #adaptation #clustering #corpus #retrieval
- Combining text clustering and retrieval for corpus adaptation (FH, XD).
- ICDAR-2007-VargasFTA #corpus
- Off-line Handwritten Signature GPDS-960 Corpus (FV, MAF, CMT, JBA), pp. 764–768.
- ITiCSE-2007-TremblayMSZ #corpus #maintenance #student
- Introducing students to professional software construction: a “software construction and maintenance” course and its maintenance corpus (GT, BM, AS, PZ), pp. 176–180.
- LATA-2007-YoonSK #corpus #rule-based #word
- Rule-based Word Spacing in Korean Based on Lexical Information Extracted from a Corpus (JY, GYS, SK), pp. 589–599.
- DHM-2007-ZhengLODK #corpus #simulation
- Human Motion Simulation and Action Corpus (GZ, WL, PO, LD, IK), pp. 314–322.
- KDD-2007-BhagwatEM #clustering #corpus #documentation #scalability #similarity
- Content-based document routing and index partitioning for scalable similarity-based searches in a large corpus (DB, KE, PM), pp. 105–112.
- SIGIR-2007-LiLHC #ad hoc #corpus #query #using #wiki
- Improving weak ad-hoc queries using wikipedia as external corpus (YL, RWPL, EKSH, KFLC), pp. 797–798.
- HT-2006-JensenM #corpus #multi #retrieval #web
- Different indexing strategies for multilingual web retrieval: experiments with the EuroGOV corpus (NJ, TM), pp. 169–170.
- CIKM-2006-BroderFJKMNPTX #corpus #query
- Estimating corpus size via queries (AZB, MF, VJ, RK, RM, SUN, RP, AT, YX), pp. 594–603.
- ECIR-2006-ZhangWGV #automation #corpus #parallel #web
- Automatic Acquisition of Chinese-English Parallel Corpus from the Web (YZ, KW, JG, PV), pp. 420–431.
- ICDAR-2005-MihovSRDN #comparative #corpus #evaluation
- A Corpus for Comparative Evaluation of OCR Software and Postcorrection Techniques (SM, KUS, CR, VD, VN), pp. 162–166.
- SIGIR-2005-ZhouG #categorisation #corpus #geometry #on the
- On redundancy of training corpus for text categorization: a perspective of geometry (SZ, JG), pp. 671–672.
- ECIR-2004-ChenTH #corpus #identification #novel #using
- Identification of Relevant and Novel Sentences Using Reference Corpus (HHC, MFT, MHH), pp. 85–98.
- SIGIR-2004-ConradS #corpus #detection
- Constructing a text corpus for inexact duplicate detection (JGC, CPS), pp. 582–583.
- SIGIR-2004-KurlandL #ad hoc #corpus #information retrieval #modelling
- Corpus structure, language models, and ad hoc information retrieval (OK, LL), pp. 194–201.
- ICEIS-v2-2003-MengYLCCH #corpus
- Act E-Service Question Answering Systems Based on Faq Corpus (IHM, WPY, HYL, YLC, BC, SLH), pp. 286–293.
- ECIR-2003-AhmadTVH #image #retrieval
- Corpus-Based Thesaurus Construction for Image Retrieval in Specialist Domains (KA, MT, BV, CH), pp. 502–510.
- ICLP-2003-MoonMHK #ambiguity #integration #network #semantics #word
- Integration of Semantic Networks for Corpus-Based Word Sense Disambiguation (YJM, KM, YH, PK), pp. 492–493.
- SIGIR-2002-ClarkeCLLT #corpus #performance
- The impact of corpus size on question answering performance (CLAC, GVC, ML, TRL, ELT), pp. 369–370.
- HCI-CCAD-1999-MayburyBL #user interface
- Corpus-based user interfaces (MTM, SB, FL), pp. 922–926.
- ICML-1998-LittmanJK #corpus #independence #learning #representation
- Learning a Language-Independent Representation for Terms from a Partially Aligned Corpus (MLL, FJ, GAK), pp. 314–322.
- SIGIR-1998-Jean-David #automation #corpus #query
- Automatic Acquisition of Terminological Relations from a Corpus for Query Expansion (SJD), pp. 371–372.
- SIGIR-1998-RagasK #algorithm #classification #corpus
- Four Text Classification Algorithms Compared on a Dutch Corpus (HR, CHAK), pp. 369–370.
- CIKM-1997-GauchW #analysis #approach #automation #corpus #query
- A Corpus Analysis Approach for Automatic Query Expansion (SG, JW), pp. 278–284.
- SIGIR-1997-SilversteinP #clustering #corpus #set
- Almost-Constant-Time Clustering of Arbitrary Corpus Subsets (CS, JOP), pp. 60–66.
- PLDI-1995-CalderGLMMZ #branch #predict
- Corpus-Based Static Branch Prediction (BC, DG, DCL, JHM, MM, BGZ), pp. 79–92.
- SIGIR-1992-Krovetz #corpus #information retrieval
- Corpus Linguistics and Information Retrieval (RK), pp. 348–351.