Ying Li, Bing Liu, Sunita Sarawagi
Proceedings of the 14th International Conference on Knowledge Discovery and Data Mining
KDD, 2008.
@proceedings{KDD-2008, address = "Las Vegas, Nevada, USA", editor = "Ying Li and Bing Liu and Sunita Sarawagi", isbn = "978-1-60558-193-4", publisher = "{ACM}", title = "{Proceedings of the 14th International Conference on Knowledge Discovery and Data Mining}", year = 2008, }
Contents (134 items)
- KDD-2008-EdelmanS #design #internet
- Internet advertising and optimal auction design (BE, MS), p. 1.
- KDD-2008-GraepelH #data analysis #modelling #online #scalability
- Large scale data analysis and modelling in online services and advertising (TG, RH), p. 2.
- KDD-2008-HastieFT #coordination
- Regularization paths and coordinate descent (TH, JF, RT), p. 3.
- KDD-2008-Malik #future of #image
- The future of image search (JM), p. 4.
- KDD-2008-Miletzki #overview #pattern matching #pattern recognition #recognition
- Genesis of postal address reading, current state and future prospects: thirty years of pattern recognition on duty of postal services (UM), pp. 5–6.
- KDD-2008-AnagnostopoulosKM #correlation #network #social
- Influence and correlation in social networks (AA, RK, MM), pp. 7–15.
- KDD-2008-BecchettiBCG #algorithm #graph #performance
- Efficient semi-streaming algorithms for local triangle counting in massive graphs (LB, PB, CC, AG), pp. 16–24.
- KDD-2008-BhattacharyaGJ #categorisation #documentation #identification
- Structured entity identification and document categorization: two tasks with one joint model (IB, SG, SJ), pp. 25–33.
- KDD-2008-BifetG #adaptation #data type #mining
- Mining adaptively frequent closed unlabeled rooted trees in data streams (AB, RG), pp. 34–42.
- KDD-2008-BilgicG #classification #effectiveness
- Effective label acquisition for collective classification (MB, LG), pp. 43–51.
- KDD-2008-BonchiCDG #composition #query #topic
- Topical query decomposition (FB, CC, DD, AG), pp. 52–60.
- KDD-2008-BoutsidisMD #analysis #component #feature model
- Unsupervised feature selection for principal components analysis (CB, MWM, PD), pp. 61–69.
- KDD-2008-BrickellS #cost analysis #privacy
- The cost of privacy: destruction of data-mining utility in anonymized data publishing (JB, VS), pp. 70–78.
- KDD-2008-ChakrabartiKP #generative #web
- Generating succinct titles for web URLs (DC, RK, KP), pp. 79–87.
- KDD-2008-ChakrabartiKSB #learning #ranking
- Structured learning for non-smooth ranking losses (SC, RK, US, CB), pp. 88–96.
- KDD-2008-ChangYM
- Partitioned logistic regression for spam filtering (MWC, WtY, CM), pp. 97–105.
- KDD-2008-ChenJCLWY #classification #kernel #learning
- Learning subspace kernels for classification (JC, SJ, BC, QL, MW, JY), pp. 106–114.
- KDD-2008-ChenZC #collaboration #community #personalisation #recommendation
- Combinational collaborative filtering for personalized community recommendation (WC, DZ, EYC), pp. 115–123.
- KDD-2008-ChenW #classification #feature model #metric #named #performance #problem
- FAST: a roc-based feature selection metric for small samples and imbalanced data classification problems (XwC, MW), pp. 124–132.
- KDD-2008-ChengT #learning
- Semi-supervised learning with data calibration for long-term time series forecasting (HC, PNT), pp. 133–141.
- KDD-2008-ChoRC #data mining #identification #mining #network
- Reconstructing chemical reaction networks: data mining meets system identification (YJC, NR, YC), pp. 142–150.
- KDD-2008-Christen #automation #classification #nearest neighbour #using
- Automatic record linkage using seeded nearest neighbour and support vector machine classification (PC), pp. 151–159.
- KDD-2008-CrandallCHKS #community #feedback #online #similarity #social
- Feedback effects between similarity and social influence in online communities (DJC, DC, DPH, JMK, SS), pp. 160–168.
- KDD-2008-DasSN #category theory #dataset #detection
- Anomaly pattern detection in categorical datasets (KD, JGS, DBN), pp. 169–176.
- KDD-2008-SarmaGI #query #using
- Bypass rates: reducing query abandonment using negative inferences (ADS, SG, SI), pp. 177–185.
- KDD-2008-DasguptaKS
- De-duping URLs via rewrite rules (AD, RK, AS), pp. 186–194.
- KDD-2008-DavisD #learning #metric #problem
- Structured metric learning for high dimensional problems (JVD, ISD), pp. 195–203.
- KDD-2008-RaedtGN #constraints #mining #programming
- Constraint programming for itemset mining (LDR, TG, SN), pp. 204–212.
- KDD-2008-ElkanN #classification #learning
- Learning classifiers from only positive and unlabeled data (CE, KN), pp. 213–220.
- KDD-2008-EshghiR #locality #order #rank #statistics
- Locality sensitive hash functions based on concomitant rank order statistics (KE, SR), pp. 221–229.
- KDD-2008-FanZCGYHYV #mining #modelling
- Direct mining of discriminative and essential frequent patterns via model-based search tree (WF, KZ, HC, JG, XY, JH, PSY, OV), pp. 230–238.
- KDD-2008-FormanR #classification #file system #scalability
- Scaling up text classification for large file systems (GF, SR), pp. 239–246.
- KDD-2008-FujiwaraSY #identification #markov #modelling #named #performance
- SPIRAL: efficient and exact model identification for hidden Markov models (YF, YS, MY), pp. 247–255.
- KDD-2008-GallagherTEF #classification #network #using
- Using ghost edges for classification in sparsely labeled networks (BG, HT, TER, CF), pp. 256–264.
- KDD-2008-GantaKS #composition #privacy
- Composition attacks and auxiliary information in data privacy (SRG, SPK, AS), pp. 265–273.
- KDD-2008-GantiKV #categorisation #documentation #scalability
- Entity categorization over large document collections (VG, ACK, RV), pp. 274–282.
- KDD-2008-GaoFJH #information management #multi
- Knowledge transfer via multiple model local structure mapping (JG, WF, JJ, JH), pp. 283–291.
- KDD-2008-GarrigaJM #matrix
- Banded structure in binary matrices (GCG, EJ, HM), pp. 292–300.
- KDD-2008-GuptaFFSK #algorithm #approximate #evaluation #mining
- Quantitative evaluation of approximate frequent pattern mining algorithms (RG, GF, BF, MS, VK), pp. 301–309.
- KDD-2008-HallSM #dependence #using
- Unsupervised deduplication using cross-field dependencies (RH, CAS, AM), pp. 310–317.
- KDD-2008-HuYS #constraints #named #permutation #proximity
- Permu-pattern: discovery of mutable permutation patterns with proximity constraint (MH, JY, WS), pp. 318–326.
- KDD-2008-HuangDLL #clustering #equivalence #higher-order
- Simultaneous tensor subspace selection and clustering: the equivalence of high order svd and k-means clustering (HH, CHQD, DL, TL), pp. 327–335.
- KDD-2008-HwangKRZ #graph #mining
- Bridging centrality: graph mining from element level to group level (WH, TK, MR, AZ), pp. 336–344.
- KDD-2008-HyvonenMT #matrix
- Interpretable nonnegative matrix decompositions (SH, PM, ET), pp. 345–353.
- KDD-2008-IfrimBW #categorisation #n-gram #performance
- Fast logistic regression for text categorization with variable-length n-grams (GI, GHB, GW), pp. 354–362.
- KDD-2008-IwataYU #documentation #probability #semantics #topic #visualisation
- Probabilistic latent semantic visualization: topic model for visualizing documents (TI, TY, NU), pp. 363–371.
- KDD-2008-JensenFTM #automation #design #identification
- Automatic identification of quasi-experimental designs for discovering causal knowledge (DDJ, ASF, BJT, MEM), pp. 372–380.
- KDD-2008-JiTYY #classification #multi
- Extracting shared subspace for multi-label classification (SJ, LT, SY, JY), pp. 381–389.
- KDD-2008-JiangPLCH #mining
- Mining preferences from superior and inferior examples (BJ, JP, XL, DWC, JH), pp. 390–398.
- KDD-2008-JinAXR #effectiveness #performance #summary
- Effective and efficient itemset pattern summarization: regression-based approaches (RJ, MAA, YX, NR), pp. 399–407.
- KDD-2008-KeerthiSCHL #linear #multi #scalability
- A sequential dual method for large scale multi-class linear svms (SSK, SS, KWC, CJH, CJL), pp. 408–416.
- KDD-2008-KiernanT #scalability #sequence #summary
- Constructing comprehensive summaries of large event sequences (JK, ET), pp. 417–425.
- KDD-2008-Koren #collaboration #multi
- Factorization meets the neighborhood: a multifaceted collaborative filtering model (YK), pp. 426–434.
- KDD-2008-KossinetsKW #communication #network #social
- The structure of information pathways in a social communication network (GK, JMK, DJW), pp. 435–443.
- KDD-2008-KriegelSZ #detection
- Angle-based outlier detection in high-dimensional data (HPK, MS, AZ), pp. 444–452.
- KDD-2008-LaxmanTW #generative #modelling #predict #sequence #using
- Stream prediction using a generative model based on frequent episodes in event sequences (SL, VT, RWW), pp. 453–461.
- KDD-2008-LeskovecBKT #evolution #network #social
- Microscopic evolution of social networks (JL, LB, RK, AT), pp. 462–470.
- KDD-2008-LiFGMF #learning #linear #named #parallel #performance
- Cut-and-stitch: efficient parallel learning of linear dynamical systems on smps (LL, WF, FG, TCM, CF), pp. 471–479.
- KDD-2008-LingD #learning #query
- Active learning with direct query construction (CXL, JD), pp. 480–487.
- KDD-2008-LingDXYY #learning
- Spectral domain-transfer learning (XL, WD, GRX, QY, YY), pp. 488–496.
- KDD-2008-LingMZS #mining #multi #topic
- Mining multi-faceted overviews of arbitrary topics in a text collection (XL, QM, CZ, BRS), pp. 497–505.
- KDD-2008-LozanoA #multi
- Multi-class cost-sensitive boosting with p-norm loss functions (ACL, NA), pp. 506–514.
- KDD-2008-MadaniH #learning #on the
- On updates that constrain the features’ connections during learning (OM, JH), pp. 515–523.
- KDD-2008-McGlohonAF #component #generative #graph
- Weighted graphs and disconnected components: patterns and a generator (MM, LA, CF), pp. 524–532.
- KDD-2008-MoiseS #approach #clustering #novel #statistics
- Finding non-redundant, statistically significant regions in high dimensional data: a novel approach to projected and subspace clustering (GM, JS), pp. 533–541.
- KDD-2008-NallapatiAXC #modelling #topic
- Joint latent topic models for text and citations (RN, AA, EPX, WWC), pp. 542–550.
- KDD-2008-NguyenC #classification
- Classification with partial labels (NN, RC), pp. 551–559.
- KDD-2008-PedreschiRT #data mining #mining
- Discrimination-aware data mining (DP, SR, FT), pp. 560–568.
- KDD-2008-PorteousNIASW #performance
- Fast collapsed gibbs sampling for latent dirichlet allocation (IP, DN, ATI, AUA, PS, MW), pp. 569–577.
- KDD-2008-SaigoKT #graph #mining
- Partial least squares regression for graph mining (HS, NK, KT), pp. 578–586.
- KDD-2008-SatoYN #graph #information management #parametricity #semantics #using #word
- Knowledge discovery of semantic relationships between words using nonparametric bayesian graph model (IS, MY, HN), pp. 587–595.
- KDD-2008-SeshadriMSBFL #graph #mobile
- Mobile call graphs: beyond power-law and lognormal distributions (MS, SM, AS, JB, CF, JL), pp. 596–604.
- KDD-2008-ShaoCTYA #mining #performance #sequence
- Efficient ticket routing by resolution sequence mining (QS, YC, ST, XY, NA), pp. 605–613.
- KDD-2008-ShengPI #data mining #mining #multi #quality #using
- Get another label? improving data quality and data mining using multiple, noisy labelers (VSS, FJP, PGI), pp. 614–622.
- KDD-2008-ShiehK #mining #named
- iSAX: indexing and mining terabyte sized time series (JS, EJK), pp. 623–631.
- KDD-2008-SiaCCT #performance #query
- Efficient computation of personal aggregate queries on blogs (KCS, JC, YC, BLT), pp. 632–640.
- KDD-2008-SimonKZ #agile #approach #reliability #scalability #set
- Semi-supervised approach to rapid and reliable labeling of large data sets (GJS, VK, ZLZ), pp. 641–649.
- KDD-2008-SinghG #learning #matrix #relational
- Relational learning via collective matrix factorization (APS, GJG), pp. 650–658.
- KDD-2008-SongJRG #linear
- A bayesian mixture model with linear regression mixing proportions (XS, CJ, SR, JG), pp. 659–667.
- KDD-2008-SunJY #classification #learning #multi
- Hypergraph spectral learning for multi-label classification (LS, SJ, JY), pp. 668–676.
- KDD-2008-TangLZN #community #evolution #multi #network
- Community evolution in dynamic multi-mode networks (LT, HL, JZ, ZN), pp. 677–685.
- KDD-2008-TongPSYF #graph #mining #named #performance #scalability
- Colibri: fast mining of large static and dynamic graphs (HT, SP, JS, PSY, CF), pp. 686–694.
- KDD-2008-MeloAL #behaviour #metric #network #predict #question
- Can complex network metrics predict the behavior of NBA teams? (POSVdM, VAFA, AAFL), pp. 695–703.
- KDD-2008-WalkerR #clustering #documentation #modelling
- Model-based document clustering with a collapsed gibbs sampler (DDW, EKR), pp. 704–712.
- KDD-2008-WangD #classification #kernel #semantics #using #wiki
- Building semantic kernels for text classification using wikipedia (PW, CD), pp. 713–721.
- KDD-2008-WickRSM #approach
- A unified approach for schema matching, coreference and canonicalization (MLW, KR, KS, AM), pp. 722–730.
- KDD-2008-WuHW #information management #wiki
- Information extraction from Wikipedia: moving down the long tail (FW, RH, DSW), pp. 731–739.
- KDD-2008-WuXC #clustering #incremental #learning #named
- SAIL: summation-based incremental learning for information-theoretic clustering (JW, HX, JC), pp. 740–748.
- KDD-2008-WuLCC #learning #symmetry
- Asymmetric support vector machines: low false-positive learning under the user tolerance (SHW, KPL, CMC, MSC), pp. 749–757.
- KDD-2008-XiangJFD #database #summary #transaction
- Succinct summarization of transactional databases: an overlapped hyperrectangle scheme (YX, RJ, DF, FFD), pp. 758–766.
- KDD-2008-XuWFY #database #transaction
- Anonymizing transaction databases for publication (YX, KW, AWCF, PSY), pp. 767–775.
- KDD-2008-YangZYW #detection
- Local peculiarity factor and its application in outlier detection (JY, NZ, YY, JW), pp. 776–784.
- KDD-2008-YenSMS #difference #metric #product line
- A family of dissimilarity measures between nodes generalizing both the shortest-path and the commute-time distances (LY, MS, AM, MS), pp. 785–793.
- KDD-2008-YuJ #kernel #using
- Training structural svms with kernels using sampled cuts (CNJY, TJ), pp. 794–802.
- KDD-2008-YuDL #feature model
- Stable feature selection via dense feature groups (LY, CHQD, SL), pp. 803–811.
- KDD-2008-ZhangZS #categorisation #concept #data type #mining
- Categorizing and mining concept drifting data streams (PZ, XZ, YS), pp. 812–820.
- KDD-2008-ZhangZW #algorithm #named #performance
- Fastanova: an efficient algorithm for genome-wide association study (XZ, FZ, WW), pp. 821–829.
- KDD-2008-ZhaoWZ #algorithm #named #performance #virtual machine
- Cuts3vm: a fast semi-supervised svm algorithm (BZ, FW, CZ), pp. 830–838.
- KDD-2008-ZhaoWLYC #data flow #identification #multi #semistructured data
- Identifying biologically relevant genes via multiple heterogeneous data sources (ZZ, JW, HL, JY, YC), pp. 839–847.
- KDD-2008-ZhouX #correlation #perspective
- Volatile correlation computation: a checkpoint view (WZ, HX), pp. 848–856.
- KDD-2008-BoriahKSPK #case study #detection
- Land cover change detection: a case study (SB, VK, MS, CP, SAK), pp. 857–865.
- KDD-2008-BouguessaDW #exclamation #identification
- Identifying authoritative actors in question-answering forums: the case of Yahoo! answers (MB, BD, SW), pp. 866–874.
- KDD-2008-CaoJPHLCL #mining #query
- Context-aware query suggestion by mining click-through and session data (HC, DJ, JP, QH, ZL, EC, HL), pp. 875–883.
- KDD-2008-ChihP #persuasion #visualisation
- The persuasive phase of visualization (CHC, DSPJ), pp. 884–892.
- KDD-2008-ChowGS #detection #privacy #using
- Detecting privacy leaks using corpus-based association rules (RC, PG, JS), pp. 893–901.
- KDD-2008-CuiDSAJ #learning
- Learning methods for lung tumor markerless gating in image-guided radiotherapy (YC, JGD, GCS, BMA, SBJ), pp. 902–910.
- KDD-2008-GodboleR #analysis #automation #classification #industrial
- Text classification, business intelligence, and interactivity: automating C-Sat analysis for services industry (SG, SR), pp. 911–919.
- KDD-2008-GrossmanG #data mining #mining #performance #using
- Data mining using high performance data clouds: experimental studies using sector and sphere (RLG, YG), pp. 920–927.
- KDD-2008-HoT #automation #multi #using
- Automated cyclone discovery and tracking using knowledge sharing in multiple heterogeneous satellite data (SSH, AT), pp. 928–936.
- KDD-2008-KoenigsteinST #analysis #query #string #using
- Spotting out emerging artists using geo-aware analysis of P2P query strings (NK, YS, TT), pp. 937–945.
- KDD-2008-MelvilleRL #modelling #using #web
- Customer targeting models using actively-selected web content (PM, SR, RDL), pp. 946–953.
- KDD-2008-MorchenDFEWB #roadmap
- Anticipating annotations and emerging trends in biomedical literature (FM, MD, DF, JE, BW, MB), pp. 954–962.
- KDD-2008-NorenBHSE #roadmap
- Temporal pattern discovery for trends and transient effects: its application to patient records (GNN, AB, JH, KS, IRE), pp. 963–971.
- KDD-2008-ParikhS #detection #query #realtime #scalability
- Scalable and near real-time burst detection from eCommerce queries (NP, NS), pp. 972–980.
- KDD-2008-Sindhgatta #developer #identification #source code
- Identifying domain expertise of developers from source code (RS), pp. 981–989.
- KDD-2008-TangZYLZS #mining #named #network #social
- ArnetMiner: extraction and mining of academic social networks (JT, JZ, LY, JL, LZ, ZS), pp. 990–998.
- KDD-2008-ChavesBB #named #process #reliability
- Tagmark: reliable estimations of RFID tags for business processes (LWFC, EB, KB), pp. 999–1007.
- KDD-2008-WuK #comparison #online #scalability
- Experimental comparison of scalable online ad serving (GW, BK), pp. 1008–1015.
- KDD-2008-YangAPM #graph #interactive #tool support
- A visual-analytic toolkit for dynamic interaction graphs (XY, SA, SP, SM), pp. 1016–1024.
- KDD-2008-YeCWLZPBJLAR #data fusion #semistructured data
- Heterogeneous data fusion for alzheimer’s disease study (JY, KC, TW, JL, ZZ, RP, MB, RJ, HL, GEA, ER), pp. 1025–1033.
- KDD-2008-YuFRKRDL #analysis #privacy
- Privacy-preserving cox regression for survival analysis (SY, GF, RR, SK, RBR, CDO, PL), pp. 1034–1042.
- KDD-2008-ZengMLBM #analysis #predict #using
- Using predictive analysis to improve invoice-to-cash collection (SZ, PM, CAL, IMBM, CM), pp. 1043–1050.
- KDD-2008-ZhangSPN #documentation #learning #multi #topic #web
- Learning from multi-topic web documents for contextual advertisement (YZ, ACS, JCP, MN), pp. 1051–1059.
- KDD-2008-KumarTFJKLT #network #social
- Social networks: looking ahead (RK, AT, CF, DJ, GK, JL, AT), p. 1060.
- KDD-2008-BlockeelCFGPR #database #induction #mining #prototype
- An inductive database prototype based on virtual mining views (HB, TC, ÉF, BG, AP, CR), pp. 1061–1064.
- KDD-2008-Christen08a #open source #user interface #visual notation
- Febrl -: an open source data cleaning, deduplication and record linkage system with a graphical user interface (PC), pp. 1065–1068.
- KDD-2008-CaroCS #using
- Using tagflake for condensing navigable tag hierarchies from tag clouds (LDC, KSC, MLS), pp. 1069–1072.
- KDD-2008-GodboleR08a #analysis #automation #industrial
- An integrated system for automatic customer satisfaction analysis in the services industry (SG, SR), pp. 1073–1076.
- KDD-2008-HuaP #named
- DiMaC: a disguised missing data cleaning tool (MH, JP), pp. 1077–1080.
- KDD-2008-KotsifakosNVT #data mining #mining #modelling #named
- Pattern-Miner: integrated management and mining over data mining models (EEK, IN, YV, YT), pp. 1081–1084.
- KDD-2008-LiuYLWHD #named #online #overview
- CRO: a system for online review structurization (HL, HY, WL, WW, JH, XD), pp. 1085–1088.
- KDD-2008-MullerAKJS #clustering #interactive #named
- Morpheus: interactive exploration of subspace clustering (EM, IA, RK, TJ, TS), pp. 1089–1092.
- KDD-2008-NguyenPS #recommendation
- A software system for buzz-based recommendations (HN, NP, NS), pp. 1093–1096.
- KDD-2008-ZhengSSW #interactive #named
- Pictor: an interactive system for importing data from a website (SZ, MRS, RS, JRW), pp. 1097–1100.
20 ×#mining
16 ×#named
16 ×#using
15 ×#learning
13 ×#performance
12 ×#classification
12 ×#multi
9 ×#network
9 ×#scalability
8 ×#graph
16 ×#named
16 ×#using
15 ×#learning
13 ×#performance
12 ×#classification
12 ×#multi
9 ×#network
9 ×#scalability
8 ×#graph