BibSLEIGH
BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
CC-BY
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter
Used together with:
base (20)
use (11)
web (9)
text (9)
analysi (8)

Stem corpus$ (all stems)

83 papers:

HTHT-2015-Bayomi #adaptation #corpus #framework #reuse
A Framework to Provide Customized Reuse of Open Corpus Content for Adaptive Systems (MB), pp. 315–318.
ICSMEICSME-2015-WangPV #corpus #mining #scalability
Developing a model of loop actions by mining loop characteristics from a large code corpus (XW, LLP, KVS), pp. 51–60.
MSRMSR-2015-BarikLSSM #corpus #named #spreadsheet
Fuse: A Reproducible, Extendable, Internet-Scale Corpus of Spreadsheets (TB, KL, JS, JS, ERMH), pp. 486–489.
CHICHI-2015-WoltersKMDM #corpus #design #interface
The CADENCE Corpus: A New Resource for Inclusive Voice Interface Design (MKW, JK, SEM, MD, JDM), pp. 3963–3966.
ECIRECIR-2015-CarrascoMMSRE #corpus
Linguistically-Enhanced Search over an Open Diachronic Corpus (RCC, IMS, EMG, FSM, GCR, MPEE), pp. 801–804.
ECIRECIR-2015-HagenWS #corpus #topic #web
A Corpus of Realistic Known-Item Topics with Associated Web Pages in the ClueWeb09 (MH, DW, BS), pp. 513–525.
SIGIRSIGIR-2015-ChakrabortyGP #corpus #retrieval
Retrieval from Noisy E-Discovery Corpus in the Absence of Training Data (AC, KG, SKP), pp. 755–758.
SIGIRSIGIR-2015-HeindorfPSE #analysis #corpus #detection #knowledge base #towards
Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis (SH, MP, BS, GE), pp. 831–834.
ICSMEICSME-2014-LandmanSV #analysis #corpus #empirical #java #scalability
Empirical Analysis of the Relationship between CC and SLOC in a Large Corpus of Java Methods (DL, AS, JJV), pp. 221–230.
HCIHCI-AS-2014-JiaNBBT #corpus #framework #named #online #research
CORPUS: Next-Generation Online Platform for Research Collaborations in Humanities (YJ, XN, RB, DB, ADT), pp. 3–12.
HCIHIMI-DE-2014-KobayashiS #corpus #topic
Finding Division Points for Time-Series Corpus Based on Topic Changes (HK, RS), pp. 364–372.
CIKMCIKM-2014-MukherjeeAJ #corpus #framework #ontology
Domain Cartridge: Unsupervised Framework for Shallow Domain Ontology Construction from Corpus (SM, JA, SJ), pp. 929–938.
ECIRECIR-2014-BauerCRG #corpus #formal method #learning #web
Learning a Theory of Marriage (and Other Relations) from a Web Corpus (SB, SC, LR, TG), pp. 591–597.
SIGIRSIGIR-2014-TongWZ #corpus #taxonomy
Principled dictionary pruning for low-memory corpus compression (JT, AW, JZ), pp. 283–292.
OOPSLAOOPSLA-2014-HsiaoCN #corpus #program analysis #statistics #using #web
Using web corpus statistics for program analysis (CHH, MJC, SN), pp. 49–65.
FSEFSE-2014-Nguyen0NR #api #corpus #mining #scalability
Mining preconditions of APIs in large-scale code corpus (HAN, RD, TNN, HR), pp. 166–177.
CIKMCIKM-2013-McMinnMJ #corpus #detection #scalability #twitter
Building a large-scale corpus for evaluating event detection on twitter (AJM, YM, JMJ), pp. 409–418.
CIKMCIKM-2013-Zhang0D #corpus #mining #query
Mining a search engine’s corpus without a query pool (MZ, NZ, GD), pp. 29–38.
DocEngDocEng-2012-WidlocherM #corpus #framework #mining
The Glozz platform: a corpus annotation and mining tool (AW, YM), pp. 171–180.
HTHT-2012-OKeeffeOCLW #adaptation #corpus #hypermedia #modelling #semantics #web
Linked open corpus models, leveraging the semantic web for adaptive hypermedia (IO, AO, PC, SL, VW), pp. 321–322.
ITiCSEITiCSE-2012-PoonSTK #corpus #detection #source code
Instructor-centric source code plagiarism detection and plagiarism corpus (JYHP, KS, YFT, MYK), pp. 122–127.
CIKMCIKM-2012-KoopmanZBSL #concept #evaluation #information retrieval #metric #similarity
An evaluation of corpus-driven measures of medical concept similarity for information retrieval (BK, GZ, PB, LS, ML), pp. 2439–2442.
CIKMCIKM-2012-SiposSSJ #corpus #summary #using #word
Temporal corpus summarization using submodular word coverage (RS, AS, PS, TJ), pp. 754–763.
CIKMCIKM-2012-XiangFWHR #corpus #detection #scalability #topic #twitter
Detecting offensive tweets via topical feature discovery over a large scale twitter corpus (GX, BF, LW, JIH, CPR), pp. 1980–1984.
KEODKEOD-2012-SuzukiF #bibliography #segmentation #similarity #using #word
Segmentation of Review Texts by using Thesaurus and Corpus-based Word Similarity (YS, FF), pp. 381–384.
MLDMMLDM-2012-WangYL #corpus
Measuring the Dynamic Relatedness between Chinese Entities Orienting to News Corpus (ZW, JY, XL), pp. 631–644.
SIGIRSIGIR-2012-CastelliRFHLR #corpus
Distilling and exploring nuggets from a corpus (VC, HR, RF, DJH, XL, SR), p. 1006.
SIGIRSIGIR-2012-McCreadieSLMOM #corpus #on the #reuse #twitter
On building a reusable Twitter corpus (RM, IS, JL, CM, IO, DM), pp. 1113–1114.
SIGIRSIGIR-2012-PotthastHSGMTW #corpus #named
ChatNoir: a search engine for the ClueWeb09 corpus (MP, MH, BS, JG, MM, MT, CW), p. 1004.
SACSAC-2012-Hosokawa #generative
Corpus-based place metadatabase generation for geocoding (YH), pp. 965–967.
DRRDRR-2011-EynardME #corpus #framework #navigation
A framework to improve digital corpus uses: image-mode navigation (LE, VM, HE), pp. 1–10.
ICDARICDAR-2011-ChattopadhyaySSR #analysis #corpus
Creation and Analysis of a Corpus of Text Rich Indian TV Videos (TC, SS, AS, NR), pp. 849–853.
SIGMODSIGMOD-2011-ZhangZD #corpus #estimation #mining #performance
Mining a search engine’s corpus: efficient yet unbiased sampling and aggregate estimation (MZ, NZ, GD), pp. 793–804.
CIKMCIKM-2011-YamamotoNT11a #community #corpus
Extracting adjective facets from community Q&A corpus (TY, SN, KT), pp. 2021–2024.
CIKMCIKM-2011-YeungI #corpus #generative #multi
Extracting multi-dimensional relations: a generative model of groups of entities in a corpus (CmAY, TI), pp. 1203–1208.
SIGIRSIGIR-2011-AsadiML
Cross-corpus relevance projection (NA, DM, JJL), pp. 1163–1164.
SIGIRSIGIR-2011-HoobinPZ #corpus
Sample selection for dictionary-based corpus compression (CH, SJP, JZ), pp. 1137–1138.
SIGIRSIGIR-2011-PaikPP #algorithm #novel #statistics #using
A novel corpus-based stemming algorithm using co-occurrence statistics (JHP, DP, SKP), pp. 863–872.
SIGMODSIGMOD-2010-SiferLWB #corpus #keyword #multi #summary
Integrating keyword search with multiple dimension tree views over a summary corpus data cube (MS, JL, YW, SB), pp. 1167–1170.
ICEISICEIS-HCI-2010-KralC #automation #corpus #web
Automatic Dialog Act Corpus Creation from Web Pages (PK, CC), pp. 198–203.
ICPRICPR-2010-ElnakibECS #analysis #corpus
Dyslexia Diagnostics by Centerline-Based Shape Analysis of the Corpus Callosum (AE, AEB, MC, AES), pp. 261–264.
ICPRICPR-2010-PastorTCV #corpus
A Bi-modal Handwritten Text Corpus: Baseline Results (MP, AHT, FC, EV), pp. 1933–1936.
ICPRICPR-2010-RomeroTV #analysis #corpus #image
Computer Assisted Transcription of Text Images: Results on the GERMANA Corpus and Analysis of Improvements Needed for Practical Use (VR, AHT, EV), pp. 2017–2020.
SIGIRSIGIR-2010-Potthast #corpus #crowdsourcing #wiki
Crowdsourcing a wikipedia vandalism corpus (MP), pp. 789–790.
HTHT-2009-SteichenLOW #corpus #generative #hypermedia #reuse
Dynamic hypertext generation for reusing open corpus content (BS, SL, AO, VW), pp. 119–128.
ICDARICDAR-2009-Martin-ToralAP #corpus #detection #documentation
Detection of Incoherences in a Document Corpus Based on the Application of a Neuro-Fuzzy System (SMT, VA, GISP), pp. 1101–1105.
HCIDHM-2009-ClavelM #approach #modelling #multimodal #named #permutation
PERMUTATION: A Corpus-Based Approach for Modeling Personality and Multimodal Expression of Affects in Virtual Characters (CC, JCM), pp. 211–220.
HCIHCD-2009-NakanoR #analysis #corpus #multimodal #usability
Multimodal Corpus Analysis as a Method for Ensuring Cultural Usability of Embodied Conversational Agents (YIN, MR), pp. 521–530.
HTHT-2008-LawlessHW #corpus #education #learning
Enhancing access to open corpus educational content: learning in the wild (SL, LH, VW), pp. 167–174.
ICEISICEIS-AIDSS-2008-Martin-ToralSD #corpus #detection #documentation
Detection of Incoherences in a Technical and Normative Document Corpus (SMT, GISP, YAD), pp. 282–287.
CIKMCIKM-2008-CustisA #corpus #query #statistics
Investigating external corpus and clickthrough statistics for query expansion in the legal domain (TC, KAK), pp. 1363–1364.
CIKMCIKM-2008-RogatiYC #corpus #information retrieval #optimisation
Corpus microsurgery: criteria optimization for medical cross-language ir (MR, YY, JGC), pp. 1365–1366.
ECIRECIR-2008-AyacheQ #corpus #learning #using #video
Video Corpus Annotation Using Active Learning (SA, GQ), pp. 187–198.
ECIRECIR-2008-Talvensaari #corpus #quality
Effects of Aligned Corpus Quality and Size in Corpus-Based CLIR (TT), pp. 114–125.
KDDKDD-2008-ChowGS #detection #privacy #using
Detecting privacy leaks using corpus-based association rules (RC, PG, JS), pp. 893–901.
SIGIRSIGIR-2008-Banerjee #classification #corpus #modelling #topic #using
Improving text classification accuracy using topic modeling over an additional corpus (SB), pp. 867–868.
DRRDRR-2007-HeD #adaptation #clustering #corpus #retrieval
Combining text clustering and retrieval for corpus adaptation (FH, XD).
ICDARICDAR-2007-VargasFTA #corpus
Off-line Handwritten Signature GPDS-960 Corpus (FV, MAF, CMT, JBA), pp. 764–768.
ITiCSEITiCSE-2007-TremblayMSZ #corpus #maintenance #student
Introducing students to professional software construction: a “software construction and maintenance” course and its maintenance corpus (GT, BM, AS, PZ), pp. 176–180.
LATALATA-2007-YoonSK #corpus #rule-based #word
Rule-based Word Spacing in Korean Based on Lexical Information Extracted from a Corpus (JY, GYS, SK), pp. 589–599.
HCIDHM-2007-ZhengLODK #corpus #simulation
Human Motion Simulation and Action Corpus (GZ, WL, PO, LD, IK), pp. 314–322.
KDDKDD-2007-BhagwatEM #clustering #corpus #documentation #scalability #similarity
Content-based document routing and index partitioning for scalable similarity-based searches in a large corpus (DB, KE, PM), pp. 105–112.
SIGIRSIGIR-2007-LiLHC #ad hoc #corpus #query #using #wiki
Improving weak ad-hoc queries using wikipedia as external corpus (YL, RWPL, EKSH, KFLC), pp. 797–798.
HTHT-2006-JensenM #corpus #multi #retrieval #web
Different indexing strategies for multilingual web retrieval: experiments with the EuroGOV corpus (NJ, TM), pp. 169–170.
CIKMCIKM-2006-BroderFJKMNPTX #corpus #query
Estimating corpus size via queries (AZB, MF, VJ, RK, RM, SUN, RP, AT, YX), pp. 594–603.
ECIRECIR-2006-ZhangWGV #automation #corpus #parallel #web
Automatic Acquisition of Chinese-English Parallel Corpus from the Web (YZ, KW, JG, PV), pp. 420–431.
ICDARICDAR-2005-MihovSRDN #comparative #corpus #evaluation
A Corpus for Comparative Evaluation of OCR Software and Postcorrection Techniques (SM, KUS, CR, VD, VN), pp. 162–166.
SIGIRSIGIR-2005-ZhouG #categorisation #corpus #geometry #on the
On redundancy of training corpus for text categorization: a perspective of geometry (SZ, JG), pp. 671–672.
ECIRECIR-2004-ChenTH #corpus #identification #novel #using
Identification of Relevant and Novel Sentences Using Reference Corpus (HHC, MFT, MHH), pp. 85–98.
SIGIRSIGIR-2004-ConradS #corpus #detection
Constructing a text corpus for inexact duplicate detection (JGC, CPS), pp. 582–583.
SIGIRSIGIR-2004-KurlandL #ad hoc #corpus #information retrieval #modelling
Corpus structure, language models, and ad hoc information retrieval (OK, LL), pp. 194–201.
ICEISICEIS-v2-2003-MengYLCCH #corpus
Act E-Service Question Answering Systems Based on Faq Corpus (IHM, WPY, HYL, YLC, BC, SLH), pp. 286–293.
ECIRECIR-2003-AhmadTVH #image #retrieval
Corpus-Based Thesaurus Construction for Image Retrieval in Specialist Domains (KA, MT, BV, CH), pp. 502–510.
ICLPICLP-2003-MoonMHK #ambiguity #integration #network #semantics #word
Integration of Semantic Networks for Corpus-Based Word Sense Disambiguation (YJM, KM, YH, PK), pp. 492–493.
SIGIRSIGIR-2002-ClarkeCLLT #corpus #performance
The impact of corpus size on question answering performance (CLAC, GVC, ML, TRL, ELT), pp. 369–370.
HCIHCI-CCAD-1999-MayburyBL #user interface
Corpus-based user interfaces (MTM, SB, FL), pp. 922–926.
ICMLICML-1998-LittmanJK #corpus #independence #learning #representation
Learning a Language-Independent Representation for Terms from a Partially Aligned Corpus (MLL, FJ, GAK), pp. 314–322.
SIGIRSIGIR-1998-Jean-David #automation #corpus #query
Automatic Acquisition of Terminological Relations from a Corpus for Query Expansion (SJD), pp. 371–372.
SIGIRSIGIR-1998-RagasK #algorithm #classification #corpus
Four Text Classification Algorithms Compared on a Dutch Corpus (HR, CHAK), pp. 369–370.
CIKMCIKM-1997-GauchW #analysis #approach #automation #corpus #query
A Corpus Analysis Approach for Automatic Query Expansion (SG, JW), pp. 278–284.
SIGIRSIGIR-1997-SilversteinP #clustering #corpus #set
Almost-Constant-Time Clustering of Arbitrary Corpus Subsets (CS, JOP), pp. 60–66.
PLDIPLDI-1995-CalderGLMMZ #branch #predict
Corpus-Based Static Branch Prediction (BC, DG, DCL, JHM, MM, BGZ), pp. 79–92.
SIGIRSIGIR-1992-Krovetz #corpus #information retrieval
Corpus Linguistics and Information Retrieval (RK), pp. 348–351.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.