83 papers:
HT-2015-Bayomi #adaptation #corpus #framework #reuse- A Framework to Provide Customized Reuse of Open Corpus Content for Adaptive Systems (MB), pp. 315–318.
ICSME-2015-WangPV #corpus #mining #scalability- Developing a model of loop actions by mining loop characteristics from a large code corpus (XW, LLP, KVS), pp. 51–60.
MSR-2015-BarikLSSM #corpus #named #spreadsheet- Fuse: A Reproducible, Extendable, Internet-Scale Corpus of Spreadsheets (TB, KL, JS, JS, ERMH), pp. 486–489.
CHI-2015-WoltersKMDM #corpus #design #interface- The CADENCE Corpus: A New Resource for Inclusive Voice Interface Design (MKW, JK, SEM, MD, JDM), pp. 3963–3966.
ECIR-2015-CarrascoMMSRE #corpus- Linguistically-Enhanced Search over an Open Diachronic Corpus (RCC, IMS, EMG, FSM, GCR, MPEE), pp. 801–804.
ECIR-2015-HagenWS #corpus #topic #web- A Corpus of Realistic Known-Item Topics with Associated Web Pages in the ClueWeb09 (MH, DW, BS), pp. 513–525.
SIGIR-2015-ChakrabortyGP #corpus #retrieval- Retrieval from Noisy E-Discovery Corpus in the Absence of Training Data (AC, KG, SKP), pp. 755–758.
SIGIR-2015-HeindorfPSE #analysis #corpus #detection #knowledge base #towards- Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis (SH, MP, BS, GE), pp. 831–834.
ICSME-2014-LandmanSV #analysis #corpus #empirical #java #scalability- Empirical Analysis of the Relationship between CC and SLOC in a Large Corpus of Java Methods (DL, AS, JJV), pp. 221–230.
HCI-AS-2014-JiaNBBT #corpus #framework #named #online #research- CORPUS: Next-Generation Online Platform for Research Collaborations in Humanities (YJ, XN, RB, DB, ADT), pp. 3–12.
HIMI-DE-2014-KobayashiS #corpus #topic- Finding Division Points for Time-Series Corpus Based on Topic Changes (HK, RS), pp. 364–372.
CIKM-2014-MukherjeeAJ #corpus #framework #ontology- Domain Cartridge: Unsupervised Framework for Shallow Domain Ontology Construction from Corpus (SM, JA, SJ), pp. 929–938.
ECIR-2014-BauerCRG #corpus #formal method #learning #web- Learning a Theory of Marriage (and Other Relations) from a Web Corpus (SB, SC, LR, TG), pp. 591–597.
SIGIR-2014-TongWZ #corpus #taxonomy- Principled dictionary pruning for low-memory corpus compression (JT, AW, JZ), pp. 283–292.
OOPSLA-2014-HsiaoCN #corpus #program analysis #statistics #using #web- Using web corpus statistics for program analysis (CHH, MJC, SN), pp. 49–65.
FSE-2014-Nguyen0NR #api #corpus #mining #scalability- Mining preconditions of APIs in large-scale code corpus (HAN, RD, TNN, HR), pp. 166–177.
CIKM-2013-McMinnMJ #corpus #detection #scalability #twitter- Building a large-scale corpus for evaluating event detection on twitter (AJM, YM, JMJ), pp. 409–418.
CIKM-2013-Zhang0D #corpus #mining #query- Mining a search engine’s corpus without a query pool (MZ, NZ, GD), pp. 29–38.
DocEng-2012-WidlocherM #corpus #framework #mining- The Glozz platform: a corpus annotation and mining tool (AW, YM), pp. 171–180.
HT-2012-OKeeffeOCLW #adaptation #corpus #hypermedia #modelling #semantics #web- Linked open corpus models, leveraging the semantic web for adaptive hypermedia (IO, AO, PC, SL, VW), pp. 321–322.
ITiCSE-2012-PoonSTK #corpus #detection #source code- Instructor-centric source code plagiarism detection and plagiarism corpus (JYHP, KS, YFT, MYK), pp. 122–127.
CIKM-2012-KoopmanZBSL #concept #evaluation #information retrieval #metric #similarity- An evaluation of corpus-driven measures of medical concept similarity for information retrieval (BK, GZ, PB, LS, ML), pp. 2439–2442.
CIKM-2012-SiposSSJ #corpus #summary #using #word- Temporal corpus summarization using submodular word coverage (RS, AS, PS, TJ), pp. 754–763.
CIKM-2012-XiangFWHR #corpus #detection #scalability #topic #twitter- Detecting offensive tweets via topical feature discovery over a large scale twitter corpus (GX, BF, LW, JIH, CPR), pp. 1980–1984.
KEOD-2012-SuzukiF #bibliography #segmentation #similarity #using #word- Segmentation of Review Texts by using Thesaurus and Corpus-based Word Similarity (YS, FF), pp. 381–384.
MLDM-2012-WangYL #corpus- Measuring the Dynamic Relatedness between Chinese Entities Orienting to News Corpus (ZW, JY, XL), pp. 631–644.
SIGIR-2012-CastelliRFHLR #corpus- Distilling and exploring nuggets from a corpus (VC, HR, RF, DJH, XL, SR), p. 1006.
SIGIR-2012-McCreadieSLMOM #corpus #on the #reuse #twitter- On building a reusable Twitter corpus (RM, IS, JL, CM, IO, DM), pp. 1113–1114.
SIGIR-2012-PotthastHSGMTW #corpus #named- ChatNoir: a search engine for the ClueWeb09 corpus (MP, MH, BS, JG, MM, MT, CW), p. 1004.
SAC-2012-Hosokawa #generative- Corpus-based place metadatabase generation for geocoding (YH), pp. 965–967.
DRR-2011-EynardME #corpus #framework #navigation- A framework to improve digital corpus uses: image-mode navigation (LE, VM, HE), pp. 1–10.
ICDAR-2011-ChattopadhyaySSR #analysis #corpus- Creation and Analysis of a Corpus of Text Rich Indian TV Videos (TC, SS, AS, NR), pp. 849–853.
SIGMOD-2011-ZhangZD #corpus #estimation #mining #performance- Mining a search engine’s corpus: efficient yet unbiased sampling and aggregate estimation (MZ, NZ, GD), pp. 793–804.
CIKM-2011-YamamotoNT11a #community #corpus- Extracting adjective facets from community Q&A corpus (TY, SN, KT), pp. 2021–2024.
CIKM-2011-YeungI #corpus #generative #multi- Extracting multi-dimensional relations: a generative model of groups of entities in a corpus (CmAY, TI), pp. 1203–1208.
SIGIR-2011-AsadiML- Cross-corpus relevance projection (NA, DM, JJL), pp. 1163–1164.
SIGIR-2011-HoobinPZ #corpus- Sample selection for dictionary-based corpus compression (CH, SJP, JZ), pp. 1137–1138.
SIGIR-2011-PaikPP #algorithm #novel #statistics #using- A novel corpus-based stemming algorithm using co-occurrence statistics (JHP, DP, SKP), pp. 863–872.
SIGMOD-2010-SiferLWB #corpus #keyword #multi #summary- Integrating keyword search with multiple dimension tree views over a summary corpus data cube (MS, JL, YW, SB), pp. 1167–1170.
ICEIS-HCI-2010-KralC #automation #corpus #web- Automatic Dialog Act Corpus Creation from Web Pages (PK, CC), pp. 198–203.
ICPR-2010-ElnakibECS #analysis #corpus- Dyslexia Diagnostics by Centerline-Based Shape Analysis of the Corpus Callosum (AE, AEB, MC, AES), pp. 261–264.
ICPR-2010-PastorTCV #corpus- A Bi-modal Handwritten Text Corpus: Baseline Results (MP, AHT, FC, EV), pp. 1933–1936.
ICPR-2010-RomeroTV #analysis #corpus #image- Computer Assisted Transcription of Text Images: Results on the GERMANA Corpus and Analysis of Improvements Needed for Practical Use (VR, AHT, EV), pp. 2017–2020.
SIGIR-2010-Potthast #corpus #crowdsourcing #wiki- Crowdsourcing a wikipedia vandalism corpus (MP), pp. 789–790.
HT-2009-SteichenLOW #corpus #generative #hypermedia #reuse- Dynamic hypertext generation for reusing open corpus content (BS, SL, AO, VW), pp. 119–128.
ICDAR-2009-Martin-ToralAP #corpus #detection #documentation- Detection of Incoherences in a Document Corpus Based on the Application of a Neuro-Fuzzy System (SMT, VA, GISP), pp. 1101–1105.
DHM-2009-ClavelM #approach #modelling #multimodal #named #permutation- PERMUTATION: A Corpus-Based Approach for Modeling Personality and Multimodal Expression of Affects in Virtual Characters (CC, JCM), pp. 211–220.
HCD-2009-NakanoR #analysis #corpus #multimodal #usability- Multimodal Corpus Analysis as a Method for Ensuring Cultural Usability of Embodied Conversational Agents (YIN, MR), pp. 521–530.
HT-2008-LawlessHW #corpus #education #learning- Enhancing access to open corpus educational content: learning in the wild (SL, LH, VW), pp. 167–174.
ICEIS-AIDSS-2008-Martin-ToralSD #corpus #detection #documentation- Detection of Incoherences in a Technical and Normative Document Corpus (SMT, GISP, YAD), pp. 282–287.
CIKM-2008-CustisA #corpus #query #statistics- Investigating external corpus and clickthrough statistics for query expansion in the legal domain (TC, KAK), pp. 1363–1364.
CIKM-2008-RogatiYC #corpus #information retrieval #optimisation- Corpus microsurgery: criteria optimization for medical cross-language ir (MR, YY, JGC), pp. 1365–1366.
ECIR-2008-AyacheQ #corpus #learning #using #video- Video Corpus Annotation Using Active Learning (SA, GQ), pp. 187–198.
ECIR-2008-Talvensaari #corpus #quality- Effects of Aligned Corpus Quality and Size in Corpus-Based CLIR (TT), pp. 114–125.
KDD-2008-ChowGS #detection #privacy #using- Detecting privacy leaks using corpus-based association rules (RC, PG, JS), pp. 893–901.
SIGIR-2008-Banerjee #classification #corpus #modelling #topic #using- Improving text classification accuracy using topic modeling over an additional corpus (SB), pp. 867–868.
DRR-2007-HeD #adaptation #clustering #corpus #retrieval- Combining text clustering and retrieval for corpus adaptation (FH, XD).
ICDAR-2007-VargasFTA #corpus- Off-line Handwritten Signature GPDS-960 Corpus (FV, MAF, CMT, JBA), pp. 764–768.
ITiCSE-2007-TremblayMSZ #corpus #maintenance #student- Introducing students to professional software construction: a “software construction and maintenance” course and its maintenance corpus (GT, BM, AS, PZ), pp. 176–180.
LATA-2007-YoonSK #corpus #rule-based #word- Rule-based Word Spacing in Korean Based on Lexical Information Extracted from a Corpus (JY, GYS, SK), pp. 589–599.
DHM-2007-ZhengLODK #corpus #simulation- Human Motion Simulation and Action Corpus (GZ, WL, PO, LD, IK), pp. 314–322.
KDD-2007-BhagwatEM #clustering #corpus #documentation #scalability #similarity- Content-based document routing and index partitioning for scalable similarity-based searches in a large corpus (DB, KE, PM), pp. 105–112.
SIGIR-2007-LiLHC #ad hoc #corpus #query #using #wiki- Improving weak ad-hoc queries using wikipedia as external corpus (YL, RWPL, EKSH, KFLC), pp. 797–798.
HT-2006-JensenM #corpus #multi #retrieval #web- Different indexing strategies for multilingual web retrieval: experiments with the EuroGOV corpus (NJ, TM), pp. 169–170.
CIKM-2006-BroderFJKMNPTX #corpus #query- Estimating corpus size via queries (AZB, MF, VJ, RK, RM, SUN, RP, AT, YX), pp. 594–603.
ECIR-2006-ZhangWGV #automation #corpus #parallel #web- Automatic Acquisition of Chinese-English Parallel Corpus from the Web (YZ, KW, JG, PV), pp. 420–431.
ICDAR-2005-MihovSRDN #comparative #corpus #evaluation- A Corpus for Comparative Evaluation of OCR Software and Postcorrection Techniques (SM, KUS, CR, VD, VN), pp. 162–166.
SIGIR-2005-ZhouG #categorisation #corpus #geometry #on the- On redundancy of training corpus for text categorization: a perspective of geometry (SZ, JG), pp. 671–672.
ECIR-2004-ChenTH #corpus #identification #novel #using- Identification of Relevant and Novel Sentences Using Reference Corpus (HHC, MFT, MHH), pp. 85–98.
SIGIR-2004-ConradS #corpus #detection- Constructing a text corpus for inexact duplicate detection (JGC, CPS), pp. 582–583.
SIGIR-2004-KurlandL #ad hoc #corpus #information retrieval #modelling- Corpus structure, language models, and ad hoc information retrieval (OK, LL), pp. 194–201.
ICEIS-v2-2003-MengYLCCH #corpus- Act E-Service Question Answering Systems Based on Faq Corpus (IHM, WPY, HYL, YLC, BC, SLH), pp. 286–293.
ECIR-2003-AhmadTVH #image #retrieval- Corpus-Based Thesaurus Construction for Image Retrieval in Specialist Domains (KA, MT, BV, CH), pp. 502–510.
ICLP-2003-MoonMHK #ambiguity #integration #network #semantics #word- Integration of Semantic Networks for Corpus-Based Word Sense Disambiguation (YJM, KM, YH, PK), pp. 492–493.
SIGIR-2002-ClarkeCLLT #corpus #performance- The impact of corpus size on question answering performance (CLAC, GVC, ML, TRL, ELT), pp. 369–370.
HCI-CCAD-1999-MayburyBL #user interface- Corpus-based user interfaces (MTM, SB, FL), pp. 922–926.
ICML-1998-LittmanJK #corpus #independence #learning #representation- Learning a Language-Independent Representation for Terms from a Partially Aligned Corpus (MLL, FJ, GAK), pp. 314–322.
SIGIR-1998-Jean-David #automation #corpus #query- Automatic Acquisition of Terminological Relations from a Corpus for Query Expansion (SJD), pp. 371–372.
SIGIR-1998-RagasK #algorithm #classification #corpus- Four Text Classification Algorithms Compared on a Dutch Corpus (HR, CHAK), pp. 369–370.
CIKM-1997-GauchW #analysis #approach #automation #corpus #query- A Corpus Analysis Approach for Automatic Query Expansion (SG, JW), pp. 278–284.
SIGIR-1997-SilversteinP #clustering #corpus #set- Almost-Constant-Time Clustering of Arbitrary Corpus Subsets (CS, JOP), pp. 60–66.
PLDI-1995-CalderGLMMZ #branch #predict- Corpus-Based Static Branch Prediction (BC, DG, DCL, JHM, MM, BGZ), pp. 79–92.
SIGIR-1992-Krovetz #corpus #information retrieval- Corpus Linguistics and Information Retrieval (RK), pp. 348–351.