Proceedings of the 13th International Conference on Knowledge Discovery and Data Mining
BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
EDIT!
CC-BY
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter

Pavel Berkhin, Rich Caruana, Xindong Wu
Proceedings of the 13th International Conference on Knowledge Discovery and Data Mining
KDD, 2007.

KER
DBLP
Scholar
Full names Links ISxN
@proceedings{KDD-2007,
	address       = "San Jose, California, USA",
	editor        = "Pavel Berkhin and Rich Caruana and Xindong Wu",
	isbn          = "978-1-59593-609-7",
	publisher     = "{ACM}",
	title         = "{Proceedings of the 13th International Conference on Knowledge Discovery and Data Mining}",
	year          = 2007,
}

Contents (114 items)

KDD-2007-Anderson
Calculating latent demand in the long tail (CA), p. 1.
KDD-2007-Fayyad #internet #mining #web
From mining the web to inventing the new sciences underlying the internet (UMF), pp. 2–3.
KDD-2007-Kleinberg #challenge #mining #network #privacy #process #social
Challenges in mining social network data: processes, privacy, and paradoxes (JMK), pp. 4–5.
KDD-2007-AgarwalBGYKS #effectiveness #performance #summary
Efficient and effective explanation of change in hierarchical summaries (DA, DB, DG, NEY, FK, DS), pp. 6–15.
KDD-2007-AgarwalBCDJS #multi
Estimating rates of rare events at multiple resolutions (DA, AZB, DC, DD, VJ, MS), pp. 16–25.
KDD-2007-AgarwalM #modelling #predict #scalability
Predictive discrete latent factor models for large scale dyadic data (DA, SM), pp. 26–35.
KDD-2007-AggarwalY #classification #data type #on the #string
On string classification in data streams (CCA, PSY), pp. 36–45.
KDD-2007-AggarwalTWFZ #clustering #documentation #framework #named #xml
Xproj: a framework for projected structural clustering of xml documents (CCA, NT, JW, JF, MJZ), pp. 46–55.
KDD-2007-ArchakGI #exclamation #mining #power of
Show me the money!: deriving the pricing power of product features by mining consumer reviews (NA, AG, PGI), pp. 56–65.
KDD-2007-ArnoldLA #modelling #visual notation
Temporal causal modeling with graphical granger methods (AA, YL, NA), pp. 66–75.
KDD-2007-Baeza-YatesT #query #semantics
Extracting semantic relations from query logs (RABY, AT), pp. 76–85.
KDD-2007-BeckerA #concept #ranking #realtime #using
Real-time ranking with concept drift using expert advice (HB, MA), pp. 86–94.
KDD-2007-BellKV #modelling #multi #recommendation #scalability
Modeling relationships at multiple scales to improve accuracy of large recommender systems (RMB, YK, CV), pp. 95–104.
KDD-2007-BhagwatEM #clustering #corpus #documentation #scalability #similarity
Content-based document routing and index partitioning for scalable similarity-based searches in a large corpus (DB, KE, PM), pp. 105–112.
KDD-2007-ChaovalitwongseFS #classification #process
Support feature machine for classification of abnormal brain activity (WAC, YJF, RCS), pp. 113–122.
KDD-2007-ChenZYL #adaptation #clustering #distance #learning #metric
Nonlinear adaptive distance metric learning for clustering (JC, ZZ, JY, HL), pp. 123–132.
KDD-2007-ChenT #clustering #realtime
Density-based clustering for real-time stream data (YC, LT), pp. 133–142.
KDD-2007-ChewBKA #information retrieval #using
Cross-language information retrieval using PARAFAC2 (PAC, BWB, TGK, AA), pp. 143–152.
KDD-2007-ChiSZHT #clustering
Evolutionary spectral clustering by incorporating temporal smoothness (YC, XS, DZ, KH, BLT), pp. 153–162.
KDD-2007-ChiZSTT #analysis #community
Structural and temporal analysis of the blogosphere through community factorization (YC, SZ, XS, JT, BLT), pp. 163–172.
KDD-2007-ChopraTLCL #parametricity
Discovering the hidden structure of house prices with a non-parametric latent manifold model (SC, TT, JL, AC, YL), pp. 173–182.
KDD-2007-CotofreiS #data mining #mining #probability #process
Stochastic processes and temporal data mining (PC, KS), pp. 183–190.
KDD-2007-CrabtreeAG #aspect-oriented #automation #query
Exploiting underrepresented query aspects for automatic query expansion (DC, PA, XG), pp. 191–200.
KDD-2007-CulottaWHMM #adaptation #database #metric #similarity #using
Canonicalization of database records using adaptive similarity measures (AC, MLW, RH, MM, AM), pp. 201–209.
KDD-2007-DaiXYY #classification #clustering #documentation
Co-clustering based classification for out-of-domain documents (WD, GRX, QY, YY), pp. 210–219.
KDD-2007-DasS #category theory #dataset #detection
Detecting anomalous records in categorical datasets (KD, JGS), pp. 220–229.
KDD-2007-DasguptaDHJM #classification #feature model
Feature selection methods for text classification (AD, PD, BH, VJ, MWM), pp. 230–239.
KDD-2007-DavidsonRE #clustering #incremental #performance
Efficient incremental constrained clustering (ID, SSR, ME), pp. 240–249.
KDD-2007-DeodharG #clustering #framework #learning
A framework for simultaneous co-clustering and learning from complex data (MD, JG), pp. 250–259.
KDD-2007-DingSJL #framework #kernel #learning #recommendation #using
A learning framework using Green’s function and kernel regularization with application to recommender system (CHQD, RJ, TL, HDS), pp. 260–269.
KDD-2007-DouFRFMT #development #framework #mining #ontology
Development of NeuroElectroMagnetic ontologies(NEMO): a framework for mining brainwave ontologies (DD, GAF, JR, RMF, ADM, DMT), pp. 270–279.
KDD-2007-DruckPMZ #classification #generative #hybrid
Semi-supervised classification with hybrid generative/discriminative methods (GD, CP, AM, XZ), pp. 280–289.
KDD-2007-FriedlandJ #identification
Finding tribes: identifying close-knit individuals from employment patterns (LF, DJ), pp. 290–299.
KDD-2007-FungYLY
Time-dependent event hierarchy construction (GPCF, JXY, HL, PSY), pp. 300–309.
KDD-2007-GaoESCX #consistency #data mining #mining #problem #set
The minimum consistent subset cover problem and its applications in data mining (BJG, ME, JyC, OS, HX), pp. 310–319.
KDD-2007-GeEJD #clustering #constraints
Constraint-driven clustering (RG, ME, WJ, ID), pp. 320–329.
KDD-2007-GiannottiNPP #mining
Trajectory pattern mining (FG, MN, FP, DP), pp. 330–339.
KDD-2007-GuoZXF #data mining #database #learning #mining #multimodal
Enhanced max margin learning on multimodal data mining in a multimedia database (ZG, ZZ, EPX, CF), pp. 340–349.
KDD-2007-HeikinheimoSHMM #set
Finding low-entropy sets and trees from binary data (HH, JKS, EH, HM, TM), pp. 350–359.
KDD-2007-JanssensGM #analysis #clustering #hybrid #mining
Dynamic hybrid clustering of bioinformatics by incorporating text mining and citation analysis (FALJ, WG, BDM), pp. 360–369.
KDD-2007-JoLG #correlation #detection #graph #research #topic
Detecting research topics via the correlation between graphs and texts (YJ, CL, CLG), pp. 370–379.
KDD-2007-KarrasMS #summary
Exploiting duality in summarization with deterministic guarantees (PK, DS, NM), pp. 380–389.
KDD-2007-KeCN #correlation #database #graph
Correlation search in graph databases (YK, JC, WN), pp. 390–399.
KDD-2007-KolczY #classification
Raising the baseline for high-precision text classifiers (AK, WtY), pp. 400–409.
KDD-2007-LaxmanSU #algorithm #performance
A fast algorithm for finding frequent episodes in event streams (SL, PSS, KPU), pp. 410–419.
KDD-2007-LeskovecKGFVG #detection #effectiveness #network
Cost-effective outbreak detection in networks (JL, AK, CG, CF, JMV, NSG), pp. 420–429.
KDD-2007-LiLW #equivalence #mining #statistics
Mining statistically important equivalence classes and delta-discriminative emerging patterns (JL, GL, LW), pp. 430–439.
KDD-2007-Li #random #reduction
Very sparse stable random projections for dimension reduction in lalpha (0 &lt;alpha<=2) norm (PL0), pp. 440–449.
KDD-2007-LiuJJ #clustering #constraints #named
BoostCluster: boosting clustering by pairwise constraints (YL, RJ, AKJ), pp. 450–459.
KDD-2007-LoKL #mining #performance #specification
Efficient mining of iterative patterns for software specification discovery (DL, SCK, CL), pp. 460–469.
KDD-2007-LongZY #clustering #framework #probability #relational
A probabilistic framework for relational clustering (BL, Z(Z, PSY), pp. 470–479.
KDD-2007-MannilaT
Nestedness and segmented nestedness (HM, ET), pp. 480–489.
KDD-2007-MeiSZ #automation #modelling #multi #topic
Automatic labeling of multinomial topic models (QM, XS, CZ), pp. 490–499.
KDD-2007-MimnoM #modelling
Expertise modeling for matching papers with reviewers (DMM, AM), pp. 500–509.
KDD-2007-MoserGE #analysis #clustering #specification
Joint cluster analysis of attribute and relationship data withouta-priori specification of the number of clusters (FM, RG, ME), pp. 510–519.
KDD-2007-NallapatiDLU #multi #topic
Multiscale topic tomography (RN, SD, JDL, KU), pp. 520–529.
KDD-2007-NijssenF #mining
Mining optimal decision trees from itemset lattices (SN, ÉF), pp. 530–539.
KDD-2007-PandeySGGK #case study #interactive #network #predict
Association analysis-based transformations for protein interaction networks: a function prediction case study (GP, MS, RG, TG, VK), pp. 540–549.
KDD-2007-ParkP #collaboration #ranking
Applying collaborative filtering techniques to movie search for better ranking and browsing (STP, DMP), pp. 550–559.
KDD-2007-PonCBC #multi #topic
Tracking multiple topics for finding interesting articles (RKP, AFC, DB, TC), pp. 560–569.
KDD-2007-RadlinskiJ #learning #ranking
Active exploration for learning rankings from clickthrough data (FR, TJ), pp. 570–579.
KDD-2007-Sandler #analysis #modelling #probability
Hierarchical mixture models: a probabilistic analysis (MS), pp. 580–589.
KDD-2007-SatoN #documentation #information management #multi #parametricity #topic #using
Knowledge discovery of multiple-topic document using parametric mixture model with dirichlet prior (IS, HN), pp. 590–598.
KDD-2007-Schickel-ZuberF #clustering #learning #recommendation #using
Using hierarchical clustering for learning theontologies used in recommendation systems (VSZ, BF), pp. 599–608.
KDD-2007-Sculley #feedback #learning
Practical learning from one-sided feedback (DS), pp. 609–618.
KDD-2007-ShaparenkoJ #database #documentation
Information genealogy: uncovering the flow of ideas in non-hyperlinked document databases (BS, TJ), pp. 619–628.
KDD-2007-ShehataKK #categorisation #concept
A concept-based model for enhancing text categorization (SS, FK, MK), pp. 629–637.
KDD-2007-ShengL #learning
Partial example acquisition in cost-sensitive learning (VSS, CXL), pp. 638–646.
KDD-2007-ShigaTM #approach #clustering #composition #network
A spectral clustering approach to optimally combining numericalvectors with a modular network (MS, IT, HM), pp. 647–656.
KDD-2007-SmithE #bias #classification #generative #robust
Making generative classifiers robust to selection bias (ATS, CE), pp. 657–666.
KDD-2007-SongWJR #detection #multi #statistics
Statistical change detection for multi-dimensional data (XS, MW, CMJ, SR), pp. 667–676.
KDD-2007-SrihariXS #documentation #generative #using
Use of ranked cross document evidence trails for hypothesis generation (RKS, LX, TS), pp. 677–686.
KDD-2007-SunFPY #graph #mining #named #scalability
GraphScope: parameter-free mining of large time-evolving graphs (JS, CF, SP, PSY), pp. 687–696.
KDD-2007-TandonC #detection #network #validation
Weighting versus pruning in rule validation for detecting network and host anomalies (GT, PKC), pp. 697–706.
KDD-2007-TangWXZ #clustering #perspective
Enhancing semi-supervised clustering: a feature projection perspective (WT, HX, SZ, JW), pp. 707–716.
KDD-2007-TantipathananandhBK #community #framework #identification #network #social
A framework for community identification in dynamic social networks (CT, TYBW, DK), pp. 717–726.
KDD-2007-TeoSVL #composition #scalability
A scalable modular convex solver for regularized risk minimization (CHT, AJS, SVNV, QVL), pp. 727–736.
KDD-2007-TongFGE #graph #pattern matching #performance #scalability
Fast best-effort pattern matching in large attributed graphs (HT, CF, BG, TER), pp. 737–746.
KDD-2007-TongFK #graph #mining #performance #proximity
Fast direction-aware proximity for graph mining (HT, CF, YK), pp. 747–756.
KDD-2007-VogelAS #linear #scalability
Scalable look-ahead linear regression trees (DSV, OA, TS), pp. 757–764.
KDD-2007-VreekenLS #difference
Characterising the difference (JV, MvL, AS), pp. 765–774.
KDD-2007-WanNHL #privacy
Privacy-preservation for gradient descent methods (LW, WKN, SH, VCSL), pp. 775–783.
KDD-2007-WangZHS #coordination #correlation #mining #topic
Mining correlated bursty topic patterns from coordinated text streams (XW, CZ, XH, RS), pp. 784–793.
KDD-2007-WangPM #analysis #component
Generalized component analysis for text with heterogeneous attributes (XW, CP, AM), pp. 794–803.
KDD-2007-WongFPW #mining
Mining favorable facets (RCWW, JP, AWCF, KW), pp. 804–813.
KDD-2007-WuWCX #analysis #composition
Local decomposition for rare class analysis (JW, HX, PW, JC), pp. 814–823.
KDD-2007-XuYFS #algorithm #clustering #named #network
SCAN: a structural clustering algorithm for networks (XX, NY, ZF, TAJS), pp. 824–833.
KDD-2007-YanTS #classification #multi
Model-shared subspace boosting for multi-label classification (RY, JT, JRS), pp. 834–843.
KDD-2007-YankovKMCZ #detection #scalability
Detecting time series motifs under uniform scaling (DY, EJK, JM, BYcC, VBZ), pp. 844–853.
KDD-2007-YeJC #analysis #kernel #learning #matrix #polynomial #programming
Learning the kernel matrix in discriminant analysis via quadratically constrained quadratic programming (JY, SJ, JC), pp. 854–863.
KDD-2007-YuanWY #semantics #visual notation
From frequent itemsets to semantically meaningful visual patterns (JY, YW, MY), pp. 864–873.
KDD-2007-ZhangHZLC #distance
Information distance from a question to an answer (XZ, YH, XZ, ML, DRC), pp. 874–883.
KDD-2007-ZhaoMY #mining
Mining templates from search result records of search engines (HZ, WM, CTY), pp. 884–893.
KDD-2007-ZhengSWW #detection #generative #optimisation
Joint optimization of wrapper generation and template detection (SZ, RS, JRW, DW), pp. 894–902.
KDD-2007-ZhuZNWH #approach #comprehension
Webpage understanding: an integrated approach (JZ, BZ, ZN, JRW, HWH), pp. 903–912.
KDD-2007-AsurPU #behaviour #framework #graph #interactive
An event-based framework for characterizing the evolutionary behavior of interaction graphs (SA, SP, DU), pp. 913–921.
KDD-2007-CastanoWCST #analysis
On-board analysis of uncalibrated data for a spacecraft at mars (RC, KW, SAC, TMS, BT), pp. 922–930.
KDD-2007-FastFMTJGK #detection #preprocessor #relational
Relational data pre-processing techniques for improved securities fraud detection (ASF, LF, MEM, BJT, DJ, HGG, JK), pp. 941–949.
KDD-2007-HuaP #approach #heuristic
Cleaning disguised missing data: a heuristic approach (MH, JP), pp. 950–958.
KDD-2007-KohaviHS #web
Practical guide to controlled experiments on the web: listen to your customers not to the hippo (RK, RMH, DS), pp. 959–967.
KDD-2007-LuoXLS #classification #distributed #network #peer-to-peer
Distributed classification in peer-to-peer networks (PL, HX, KL, ZS), pp. 968–976.
KDD-2007-PerlichRLZ #estimation #modelling
High-quantile modeling for customer wallet estimation and other applications (CP, SR, RDL, BZ), pp. 977–985.
KDD-2007-ZhaoDZ #mining #network
Mining complex power networks for blackout prevention (JHZ, ZYD, PZ), pp. 986–994.
KDD-2007-ZhaoB #web
Corroborate and learn facts from the web (SZ, JB), pp. 995–1003.
KDD-2007-ZhuBK #automation
Extracting relevant named entities for automated expense reimbursement (GZ, TJB, VK), pp. 1004–1012.
KDD-2007-Aggarwal #classification #data type #framework #segmentation
A framework for classification and segmentation of massive audio data streams (CCA), pp. 1013–1017.
KDD-2007-CurryGLVB #case study #detection #scalability #set
Detecting changes in large data sets of payment card data: a case study (CC, RLG, DL, SV, JB), pp. 1018–1022.
KDD-2007-PanZZPSPY #mining #modelling #network
Domain-constrained semi-supervised mining of tracking models in sensor networks (RP, JZ, VWZ, JJP, DS, SJP, QY), pp. 1023–1027.
KDD-2007-PengPLW #summary
Event summarization for system management (WP, CP, TL, HW), pp. 1028–1032.
KDD-2007-RaoBFSON #detection #machine learning #named
LungCAD: a clinically approved, machine learning system for lung cancer detection (RBR, JB, GF, MS, NO, DPN), pp. 1033–1037.
KDD-2007-YanL #machine learning
Machine learning for stock selection (RJY, CXL), pp. 1038–1042.
KDD-2007-YeWLY #detection #named
IMDS: intelligent malware detection system (YY, DW, TL, DY), pp. 1043–1047.
KDD-2007-YinHY #multi #web
Truth discovery with multiple conflicting information providers on the web (XY, JH, PSY), pp. 1048–1052.
KDD-2007-Parthasarathy #data mining #learning #mining
Data mining at the crossroads: successes, failures and learning from them (SP), pp. 1053–1055.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.