Proceedings of the 14th International Conference on Knowledge Discovery and Data Mining
BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
EDIT!
CC-BY
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter

Ying Li, Bing Liu, Sunita Sarawagi
Proceedings of the 14th International Conference on Knowledge Discovery and Data Mining
KDD, 2008.

KER
DBLP
Scholar
Full names Links ISxN
@proceedings{KDD-2008,
	address       = "Las Vegas, Nevada, USA",
	editor        = "Ying Li and Bing Liu and Sunita Sarawagi",
	isbn          = "978-1-60558-193-4",
	publisher     = "{ACM}",
	title         = "{Proceedings of the 14th International Conference on Knowledge Discovery and Data Mining}",
	year          = 2008,
}

Contents (134 items)

KDD-2008-EdelmanS #design #internet
Internet advertising and optimal auction design (BE, MS), p. 1.
KDD-2008-GraepelH #data analysis #modelling #online #scalability
Large scale data analysis and modelling in online services and advertising (TG, RH), p. 2.
KDD-2008-HastieFT #coordination
Regularization paths and coordinate descent (TH, JF, RT), p. 3.
KDD-2008-Malik #future of #image
The future of image search (JM), p. 4.
KDD-2008-Miletzki #overview #pattern matching #pattern recognition #recognition
Genesis of postal address reading, current state and future prospects: thirty years of pattern recognition on duty of postal services (UM), pp. 5–6.
KDD-2008-AnagnostopoulosKM #correlation #network #social
Influence and correlation in social networks (AA, RK, MM), pp. 7–15.
KDD-2008-BecchettiBCG #algorithm #graph #performance
Efficient semi-streaming algorithms for local triangle counting in massive graphs (LB, PB, CC, AG), pp. 16–24.
KDD-2008-BhattacharyaGJ #categorisation #documentation #identification
Structured entity identification and document categorization: two tasks with one joint model (IB, SG, SJ), pp. 25–33.
KDD-2008-BifetG #adaptation #data type #mining
Mining adaptively frequent closed unlabeled rooted trees in data streams (AB, RG), pp. 34–42.
KDD-2008-BilgicG #classification #effectiveness
Effective label acquisition for collective classification (MB, LG), pp. 43–51.
KDD-2008-BonchiCDG #composition #query #topic
Topical query decomposition (FB, CC, DD, AG), pp. 52–60.
KDD-2008-BoutsidisMD #analysis #component #feature model
Unsupervised feature selection for principal components analysis (CB, MWM, PD), pp. 61–69.
KDD-2008-BrickellS #cost analysis #privacy
The cost of privacy: destruction of data-mining utility in anonymized data publishing (JB, VS), pp. 70–78.
KDD-2008-ChakrabartiKP #generative #web
Generating succinct titles for web URLs (DC, RK, KP), pp. 79–87.
KDD-2008-ChakrabartiKSB #learning #ranking
Structured learning for non-smooth ranking losses (SC, RK, US, CB), pp. 88–96.
KDD-2008-ChangYM
Partitioned logistic regression for spam filtering (MWC, WtY, CM), pp. 97–105.
KDD-2008-ChenJCLWY #classification #kernel #learning
Learning subspace kernels for classification (JC, SJ, BC, QL, MW, JY), pp. 106–114.
KDD-2008-ChenZC #collaboration #community #personalisation #recommendation
Combinational collaborative filtering for personalized community recommendation (WC, DZ, EYC), pp. 115–123.
KDD-2008-ChenW #classification #feature model #metric #named #performance #problem
FAST: a roc-based feature selection metric for small samples and imbalanced data classification problems (XwC, MW), pp. 124–132.
KDD-2008-ChengT #learning
Semi-supervised learning with data calibration for long-term time series forecasting (HC, PNT), pp. 133–141.
KDD-2008-ChoRC #data mining #identification #mining #network
Reconstructing chemical reaction networks: data mining meets system identification (YJC, NR, YC), pp. 142–150.
KDD-2008-Christen #automation #classification #nearest neighbour #using
Automatic record linkage using seeded nearest neighbour and support vector machine classification (PC), pp. 151–159.
KDD-2008-CrandallCHKS #community #feedback #online #similarity #social
Feedback effects between similarity and social influence in online communities (DJC, DC, DPH, JMK, SS), pp. 160–168.
KDD-2008-DasSN #category theory #dataset #detection
Anomaly pattern detection in categorical datasets (KD, JGS, DBN), pp. 169–176.
KDD-2008-SarmaGI #query #using
Bypass rates: reducing query abandonment using negative inferences (ADS, SG, SI), pp. 177–185.
KDD-2008-DasguptaKS
De-duping URLs via rewrite rules (AD, RK, AS), pp. 186–194.
KDD-2008-DavisD #learning #metric #problem
Structured metric learning for high dimensional problems (JVD, ISD), pp. 195–203.
KDD-2008-RaedtGN #constraints #mining #programming
Constraint programming for itemset mining (LDR, TG, SN), pp. 204–212.
KDD-2008-ElkanN #classification #learning
Learning classifiers from only positive and unlabeled data (CE, KN), pp. 213–220.
KDD-2008-EshghiR #locality #order #rank #statistics
Locality sensitive hash functions based on concomitant rank order statistics (KE, SR), pp. 221–229.
KDD-2008-FanZCGYHYV #mining #modelling
Direct mining of discriminative and essential frequent patterns via model-based search tree (WF, KZ, HC, JG, XY, JH, PSY, OV), pp. 230–238.
KDD-2008-FormanR #classification #file system #scalability
Scaling up text classification for large file systems (GF, SR), pp. 239–246.
KDD-2008-FujiwaraSY #identification #markov #modelling #named #performance
SPIRAL: efficient and exact model identification for hidden Markov models (YF, YS, MY), pp. 247–255.
KDD-2008-GallagherTEF #classification #network #using
Using ghost edges for classification in sparsely labeled networks (BG, HT, TER, CF), pp. 256–264.
KDD-2008-GantaKS #composition #privacy
Composition attacks and auxiliary information in data privacy (SRG, SPK, AS), pp. 265–273.
KDD-2008-GantiKV #categorisation #documentation #scalability
Entity categorization over large document collections (VG, ACK, RV), pp. 274–282.
KDD-2008-GaoFJH #information management #multi
Knowledge transfer via multiple model local structure mapping (JG, WF, JJ, JH), pp. 283–291.
KDD-2008-GarrigaJM #matrix
Banded structure in binary matrices (GCG, EJ, HM), pp. 292–300.
KDD-2008-GuptaFFSK #algorithm #approximate #evaluation #mining
Quantitative evaluation of approximate frequent pattern mining algorithms (RG, GF, BF, MS, VK), pp. 301–309.
KDD-2008-HallSM #dependence #using
Unsupervised deduplication using cross-field dependencies (RH, CAS, AM), pp. 310–317.
KDD-2008-HuYS #constraints #named #permutation #proximity
Permu-pattern: discovery of mutable permutation patterns with proximity constraint (MH, JY, WS), pp. 318–326.
KDD-2008-HuangDLL #clustering #equivalence #higher-order
Simultaneous tensor subspace selection and clustering: the equivalence of high order svd and k-means clustering (HH, CHQD, DL, TL), pp. 327–335.
KDD-2008-HwangKRZ #graph #mining
Bridging centrality: graph mining from element level to group level (WH, TK, MR, AZ), pp. 336–344.
KDD-2008-HyvonenMT #matrix
Interpretable nonnegative matrix decompositions (SH, PM, ET), pp. 345–353.
KDD-2008-IfrimBW #categorisation #n-gram #performance
Fast logistic regression for text categorization with variable-length n-grams (GI, GHB, GW), pp. 354–362.
KDD-2008-IwataYU #documentation #probability #semantics #topic #visualisation
Probabilistic latent semantic visualization: topic model for visualizing documents (TI, TY, NU), pp. 363–371.
KDD-2008-JensenFTM #automation #design #identification
Automatic identification of quasi-experimental designs for discovering causal knowledge (DDJ, ASF, BJT, MEM), pp. 372–380.
KDD-2008-JiTYY #classification #multi
Extracting shared subspace for multi-label classification (SJ, LT, SY, JY), pp. 381–389.
KDD-2008-JiangPLCH #mining
Mining preferences from superior and inferior examples (BJ, JP, XL, DWC, JH), pp. 390–398.
KDD-2008-JinAXR #effectiveness #performance #summary
Effective and efficient itemset pattern summarization: regression-based approaches (RJ, MAA, YX, NR), pp. 399–407.
KDD-2008-KeerthiSCHL #linear #multi #scalability
A sequential dual method for large scale multi-class linear svms (SSK, SS, KWC, CJH, CJL), pp. 408–416.
KDD-2008-KiernanT #scalability #sequence #summary
Constructing comprehensive summaries of large event sequences (JK, ET), pp. 417–425.
KDD-2008-Koren #collaboration #multi
Factorization meets the neighborhood: a multifaceted collaborative filtering model (YK), pp. 426–434.
KDD-2008-KossinetsKW #communication #network #social
The structure of information pathways in a social communication network (GK, JMK, DJW), pp. 435–443.
KDD-2008-KriegelSZ #detection
Angle-based outlier detection in high-dimensional data (HPK, MS, AZ), pp. 444–452.
KDD-2008-LaxmanTW #generative #modelling #predict #sequence #using
Stream prediction using a generative model based on frequent episodes in event sequences (SL, VT, RWW), pp. 453–461.
KDD-2008-LeskovecBKT #evolution #network #social
Microscopic evolution of social networks (JL, LB, RK, AT), pp. 462–470.
KDD-2008-LiFGMF #learning #linear #named #parallel #performance
Cut-and-stitch: efficient parallel learning of linear dynamical systems on smps (LL, WF, FG, TCM, CF), pp. 471–479.
KDD-2008-LingD #learning #query
Active learning with direct query construction (CXL, JD), pp. 480–487.
KDD-2008-LingDXYY #learning
Spectral domain-transfer learning (XL, WD, GRX, QY, YY), pp. 488–496.
KDD-2008-LingMZS #mining #multi #topic
Mining multi-faceted overviews of arbitrary topics in a text collection (XL, QM, CZ, BRS), pp. 497–505.
KDD-2008-LozanoA #multi
Multi-class cost-sensitive boosting with p-norm loss functions (ACL, NA), pp. 506–514.
KDD-2008-MadaniH #learning #on the
On updates that constrain the features’ connections during learning (OM, JH), pp. 515–523.
KDD-2008-McGlohonAF #component #generative #graph
Weighted graphs and disconnected components: patterns and a generator (MM, LA, CF), pp. 524–532.
KDD-2008-MoiseS #approach #clustering #novel #statistics
Finding non-redundant, statistically significant regions in high dimensional data: a novel approach to projected and subspace clustering (GM, JS), pp. 533–541.
KDD-2008-NallapatiAXC #modelling #topic
Joint latent topic models for text and citations (RN, AA, EPX, WWC), pp. 542–550.
KDD-2008-NguyenC #classification
Classification with partial labels (NN, RC), pp. 551–559.
KDD-2008-PedreschiRT #data mining #mining
Discrimination-aware data mining (DP, SR, FT), pp. 560–568.
KDD-2008-PorteousNIASW #performance
Fast collapsed gibbs sampling for latent dirichlet allocation (IP, DN, ATI, AUA, PS, MW), pp. 569–577.
KDD-2008-SaigoKT #graph #mining
Partial least squares regression for graph mining (HS, NK, KT), pp. 578–586.
KDD-2008-SatoYN #graph #information management #parametricity #semantics #using #word
Knowledge discovery of semantic relationships between words using nonparametric bayesian graph model (IS, MY, HN), pp. 587–595.
KDD-2008-SeshadriMSBFL #graph #mobile
Mobile call graphs: beyond power-law and lognormal distributions (MS, SM, AS, JB, CF, JL), pp. 596–604.
KDD-2008-ShaoCTYA #mining #performance #sequence
Efficient ticket routing by resolution sequence mining (QS, YC, ST, XY, NA), pp. 605–613.
KDD-2008-ShengPI #data mining #mining #multi #quality #using
Get another label? improving data quality and data mining using multiple, noisy labelers (VSS, FJP, PGI), pp. 614–622.
KDD-2008-ShiehK #mining #named
iSAX: indexing and mining terabyte sized time series (JS, EJK), pp. 623–631.
KDD-2008-SiaCCT #performance #query
Efficient computation of personal aggregate queries on blogs (KCS, JC, YC, BLT), pp. 632–640.
KDD-2008-SimonKZ #agile #approach #reliability #scalability #set
Semi-supervised approach to rapid and reliable labeling of large data sets (GJS, VK, ZLZ), pp. 641–649.
KDD-2008-SinghG #learning #matrix #relational
Relational learning via collective matrix factorization (APS, GJG), pp. 650–658.
KDD-2008-SongJRG #linear
A bayesian mixture model with linear regression mixing proportions (XS, CJ, SR, JG), pp. 659–667.
KDD-2008-SunJY #classification #learning #multi
Hypergraph spectral learning for multi-label classification (LS, SJ, JY), pp. 668–676.
KDD-2008-TangLZN #community #evolution #multi #network
Community evolution in dynamic multi-mode networks (LT, HL, JZ, ZN), pp. 677–685.
KDD-2008-TongPSYF #graph #mining #named #performance #scalability
Colibri: fast mining of large static and dynamic graphs (HT, SP, JS, PSY, CF), pp. 686–694.
KDD-2008-MeloAL #behaviour #metric #network #predict #question
Can complex network metrics predict the behavior of NBA teams? (POSVdM, VAFA, AAFL), pp. 695–703.
KDD-2008-WalkerR #clustering #documentation #modelling
Model-based document clustering with a collapsed gibbs sampler (DDW, EKR), pp. 704–712.
KDD-2008-WangD #classification #kernel #semantics #using #wiki
Building semantic kernels for text classification using wikipedia (PW, CD), pp. 713–721.
KDD-2008-WickRSM #approach
A unified approach for schema matching, coreference and canonicalization (MLW, KR, KS, AM), pp. 722–730.
KDD-2008-WuHW #information management #wiki
Information extraction from Wikipedia: moving down the long tail (FW, RH, DSW), pp. 731–739.
KDD-2008-WuXC #clustering #incremental #learning #named
SAIL: summation-based incremental learning for information-theoretic clustering (JW, HX, JC), pp. 740–748.
KDD-2008-WuLCC #learning #symmetry
Asymmetric support vector machines: low false-positive learning under the user tolerance (SHW, KPL, CMC, MSC), pp. 749–757.
KDD-2008-XiangJFD #database #summary #transaction
Succinct summarization of transactional databases: an overlapped hyperrectangle scheme (YX, RJ, DF, FFD), pp. 758–766.
KDD-2008-XuWFY #database #transaction
Anonymizing transaction databases for publication (YX, KW, AWCF, PSY), pp. 767–775.
KDD-2008-YangZYW #detection
Local peculiarity factor and its application in outlier detection (JY, NZ, YY, JW), pp. 776–784.
KDD-2008-YenSMS #difference #metric #product line
A family of dissimilarity measures between nodes generalizing both the shortest-path and the commute-time distances (LY, MS, AM, MS), pp. 785–793.
KDD-2008-YuJ #kernel #using
Training structural svms with kernels using sampled cuts (CNJY, TJ), pp. 794–802.
KDD-2008-YuDL #feature model
Stable feature selection via dense feature groups (LY, CHQD, SL), pp. 803–811.
KDD-2008-ZhangZS #categorisation #concept #data type #mining
Categorizing and mining concept drifting data streams (PZ, XZ, YS), pp. 812–820.
KDD-2008-ZhangZW #algorithm #named #performance
Fastanova: an efficient algorithm for genome-wide association study (XZ, FZ, WW), pp. 821–829.
KDD-2008-ZhaoWZ #algorithm #named #performance #virtual machine
Cuts3vm: a fast semi-supervised svm algorithm (BZ, FW, CZ), pp. 830–838.
KDD-2008-ZhaoWLYC #data flow #identification #multi #semistructured data
Identifying biologically relevant genes via multiple heterogeneous data sources (ZZ, JW, HL, JY, YC), pp. 839–847.
KDD-2008-ZhouX #correlation #perspective
Volatile correlation computation: a checkpoint view (WZ, HX), pp. 848–856.
KDD-2008-BoriahKSPK #case study #detection
Land cover change detection: a case study (SB, VK, MS, CP, SAK), pp. 857–865.
KDD-2008-BouguessaDW #exclamation #identification
Identifying authoritative actors in question-answering forums: the case of Yahoo! answers (MB, BD, SW), pp. 866–874.
KDD-2008-CaoJPHLCL #mining #query
Context-aware query suggestion by mining click-through and session data (HC, DJ, JP, QH, ZL, EC, HL), pp. 875–883.
KDD-2008-ChihP #persuasion #visualisation
The persuasive phase of visualization (CHC, DSPJ), pp. 884–892.
KDD-2008-ChowGS #detection #privacy #using
Detecting privacy leaks using corpus-based association rules (RC, PG, JS), pp. 893–901.
KDD-2008-CuiDSAJ #learning
Learning methods for lung tumor markerless gating in image-guided radiotherapy (YC, JGD, GCS, BMA, SBJ), pp. 902–910.
KDD-2008-GodboleR #analysis #automation #classification #industrial
Text classification, business intelligence, and interactivity: automating C-Sat analysis for services industry (SG, SR), pp. 911–919.
KDD-2008-GrossmanG #data mining #mining #performance #using
Data mining using high performance data clouds: experimental studies using sector and sphere (RLG, YG), pp. 920–927.
KDD-2008-HoT #automation #multi #using
Automated cyclone discovery and tracking using knowledge sharing in multiple heterogeneous satellite data (SSH, AT), pp. 928–936.
KDD-2008-KoenigsteinST #analysis #query #string #using
Spotting out emerging artists using geo-aware analysis of P2P query strings (NK, YS, TT), pp. 937–945.
KDD-2008-MelvilleRL #modelling #using #web
Customer targeting models using actively-selected web content (PM, SR, RDL), pp. 946–953.
KDD-2008-MorchenDFEWB #roadmap
Anticipating annotations and emerging trends in biomedical literature (FM, MD, DF, JE, BW, MB), pp. 954–962.
KDD-2008-NorenBHSE #roadmap
Temporal pattern discovery for trends and transient effects: its application to patient records (GNN, AB, JH, KS, IRE), pp. 963–971.
KDD-2008-ParikhS #detection #query #realtime #scalability
Scalable and near real-time burst detection from eCommerce queries (NP, NS), pp. 972–980.
KDD-2008-Sindhgatta #developer #identification #source code
Identifying domain expertise of developers from source code (RS), pp. 981–989.
KDD-2008-TangZYLZS #mining #named #network #social
ArnetMiner: extraction and mining of academic social networks (JT, JZ, LY, JL, LZ, ZS), pp. 990–998.
KDD-2008-ChavesBB #named #process #reliability
Tagmark: reliable estimations of RFID tags for business processes (LWFC, EB, KB), pp. 999–1007.
KDD-2008-WuK #comparison #online #scalability
Experimental comparison of scalable online ad serving (GW, BK), pp. 1008–1015.
KDD-2008-YangAPM #graph #interactive #tool support
A visual-analytic toolkit for dynamic interaction graphs (XY, SA, SP, SM), pp. 1016–1024.
KDD-2008-YeCWLZPBJLAR #data fusion #semistructured data
Heterogeneous data fusion for alzheimer’s disease study (JY, KC, TW, JL, ZZ, RP, MB, RJ, HL, GEA, ER), pp. 1025–1033.
KDD-2008-YuFRKRDL #analysis #privacy
Privacy-preserving cox regression for survival analysis (SY, GF, RR, SK, RBR, CDO, PL), pp. 1034–1042.
KDD-2008-ZengMLBM #analysis #predict #using
Using predictive analysis to improve invoice-to-cash collection (SZ, PM, CAL, IMBM, CM), pp. 1043–1050.
KDD-2008-ZhangSPN #documentation #learning #multi #topic #web
Learning from multi-topic web documents for contextual advertisement (YZ, ACS, JCP, MN), pp. 1051–1059.
KDD-2008-KumarTFJKLT #network #social
Social networks: looking ahead (RK, AT, CF, DJ, GK, JL, AT), p. 1060.
KDD-2008-BlockeelCFGPR #database #induction #mining #prototype
An inductive database prototype based on virtual mining views (HB, TC, ÉF, BG, AP, CR), pp. 1061–1064.
KDD-2008-Christen08a #open source #user interface #visual notation
Febrl -: an open source data cleaning, deduplication and record linkage system with a graphical user interface (PC), pp. 1065–1068.
KDD-2008-CaroCS #using
Using tagflake for condensing navigable tag hierarchies from tag clouds (LDC, KSC, MLS), pp. 1069–1072.
KDD-2008-GodboleR08a #analysis #automation #industrial
An integrated system for automatic customer satisfaction analysis in the services industry (SG, SR), pp. 1073–1076.
KDD-2008-HuaP #named
DiMaC: a disguised missing data cleaning tool (MH, JP), pp. 1077–1080.
KDD-2008-KotsifakosNVT #data mining #mining #modelling #named
Pattern-Miner: integrated management and mining over data mining models (EEK, IN, YV, YT), pp. 1081–1084.
KDD-2008-LiuYLWHD #named #online #overview
CRO: a system for online review structurization (HL, HY, WL, WW, JH, XD), pp. 1085–1088.
KDD-2008-MullerAKJS #clustering #interactive #named
Morpheus: interactive exploration of subspace clustering (EM, IA, RK, TJ, TS), pp. 1089–1092.
KDD-2008-NguyenPS #recommendation
A software system for buzz-based recommendations (HN, NP, NS), pp. 1093–1096.
KDD-2008-ZhengSSW #interactive #named
Pictor: an interactive system for importing data from a website (SZ, MRS, RS, JRW), pp. 1097–1100.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.