Pavel Berkhin, Rich Caruana, Xindong Wu
Proceedings of the 13th International Conference on Knowledge Discovery and Data Mining
KDD, 2007.
@proceedings{KDD-2007, address = "San Jose, California, USA", editor = "Pavel Berkhin and Rich Caruana and Xindong Wu", isbn = "978-1-59593-609-7", publisher = "{ACM}", title = "{Proceedings of the 13th International Conference on Knowledge Discovery and Data Mining}", year = 2007, }
Contents (114 items)
- KDD-2007-Anderson
- Calculating latent demand in the long tail (CA), p. 1.
- KDD-2007-Fayyad #internet #mining #web
- From mining the web to inventing the new sciences underlying the internet (UMF), pp. 2–3.
- KDD-2007-Kleinberg #challenge #mining #network #privacy #process #social
- Challenges in mining social network data: processes, privacy, and paradoxes (JMK), pp. 4–5.
- KDD-2007-AgarwalBGYKS #effectiveness #performance #summary
- Efficient and effective explanation of change in hierarchical summaries (DA, DB, DG, NEY, FK, DS), pp. 6–15.
- KDD-2007-AgarwalBCDJS #multi
- Estimating rates of rare events at multiple resolutions (DA, AZB, DC, DD, VJ, MS), pp. 16–25.
- KDD-2007-AgarwalM #modelling #predict #scalability
- Predictive discrete latent factor models for large scale dyadic data (DA, SM), pp. 26–35.
- KDD-2007-AggarwalY #classification #data type #on the #string
- On string classification in data streams (CCA, PSY), pp. 36–45.
- KDD-2007-AggarwalTWFZ #clustering #documentation #framework #named #xml
- Xproj: a framework for projected structural clustering of xml documents (CCA, NT, JW, JF, MJZ), pp. 46–55.
- KDD-2007-ArchakGI #exclamation #mining #power of
- Show me the money!: deriving the pricing power of product features by mining consumer reviews (NA, AG, PGI), pp. 56–65.
- KDD-2007-ArnoldLA #modelling #visual notation
- Temporal causal modeling with graphical granger methods (AA, YL, NA), pp. 66–75.
- KDD-2007-Baeza-YatesT #query #semantics
- Extracting semantic relations from query logs (RABY, AT), pp. 76–85.
- KDD-2007-BeckerA #concept #ranking #realtime #using
- Real-time ranking with concept drift using expert advice (HB, MA), pp. 86–94.
- KDD-2007-BellKV #modelling #multi #recommendation #scalability
- Modeling relationships at multiple scales to improve accuracy of large recommender systems (RMB, YK, CV), pp. 95–104.
- KDD-2007-BhagwatEM #clustering #corpus #documentation #scalability #similarity
- Content-based document routing and index partitioning for scalable similarity-based searches in a large corpus (DB, KE, PM), pp. 105–112.
- KDD-2007-ChaovalitwongseFS #classification #process
- Support feature machine for classification of abnormal brain activity (WAC, YJF, RCS), pp. 113–122.
- KDD-2007-ChenZYL #adaptation #clustering #distance #learning #metric
- Nonlinear adaptive distance metric learning for clustering (JC, ZZ, JY, HL), pp. 123–132.
- KDD-2007-ChenT #clustering #realtime
- Density-based clustering for real-time stream data (YC, LT), pp. 133–142.
- KDD-2007-ChewBKA #information retrieval #using
- Cross-language information retrieval using PARAFAC2 (PAC, BWB, TGK, AA), pp. 143–152.
- KDD-2007-ChiSZHT #clustering
- Evolutionary spectral clustering by incorporating temporal smoothness (YC, XS, DZ, KH, BLT), pp. 153–162.
- KDD-2007-ChiZSTT #analysis #community
- Structural and temporal analysis of the blogosphere through community factorization (YC, SZ, XS, JT, BLT), pp. 163–172.
- KDD-2007-ChopraTLCL #parametricity
- Discovering the hidden structure of house prices with a non-parametric latent manifold model (SC, TT, JL, AC, YL), pp. 173–182.
- KDD-2007-CotofreiS #data mining #mining #probability #process
- Stochastic processes and temporal data mining (PC, KS), pp. 183–190.
- KDD-2007-CrabtreeAG #aspect-oriented #automation #query
- Exploiting underrepresented query aspects for automatic query expansion (DC, PA, XG), pp. 191–200.
- KDD-2007-CulottaWHMM #adaptation #database #metric #similarity #using
- Canonicalization of database records using adaptive similarity measures (AC, MLW, RH, MM, AM), pp. 201–209.
- KDD-2007-DaiXYY #classification #clustering #documentation
- Co-clustering based classification for out-of-domain documents (WD, GRX, QY, YY), pp. 210–219.
- KDD-2007-DasS #category theory #dataset #detection
- Detecting anomalous records in categorical datasets (KD, JGS), pp. 220–229.
- KDD-2007-DasguptaDHJM #classification #feature model
- Feature selection methods for text classification (AD, PD, BH, VJ, MWM), pp. 230–239.
- KDD-2007-DavidsonRE #clustering #incremental #performance
- Efficient incremental constrained clustering (ID, SSR, ME), pp. 240–249.
- KDD-2007-DeodharG #clustering #framework #learning
- A framework for simultaneous co-clustering and learning from complex data (MD, JG), pp. 250–259.
- KDD-2007-DingSJL #framework #kernel #learning #recommendation #using
- A learning framework using Green’s function and kernel regularization with application to recommender system (CHQD, RJ, TL, HDS), pp. 260–269.
- KDD-2007-DouFRFMT #development #framework #mining #ontology
- Development of NeuroElectroMagnetic ontologies(NEMO): a framework for mining brainwave ontologies (DD, GAF, JR, RMF, ADM, DMT), pp. 270–279.
- KDD-2007-DruckPMZ #classification #generative #hybrid
- Semi-supervised classification with hybrid generative/discriminative methods (GD, CP, AM, XZ), pp. 280–289.
- KDD-2007-FriedlandJ #identification
- Finding tribes: identifying close-knit individuals from employment patterns (LF, DJ), pp. 290–299.
- KDD-2007-FungYLY
- Time-dependent event hierarchy construction (GPCF, JXY, HL, PSY), pp. 300–309.
- KDD-2007-GaoESCX #consistency #data mining #mining #problem #set
- The minimum consistent subset cover problem and its applications in data mining (BJG, ME, JyC, OS, HX), pp. 310–319.
- KDD-2007-GeEJD #clustering #constraints
- Constraint-driven clustering (RG, ME, WJ, ID), pp. 320–329.
- KDD-2007-GiannottiNPP #mining
- Trajectory pattern mining (FG, MN, FP, DP), pp. 330–339.
- KDD-2007-GuoZXF #data mining #database #learning #mining #multimodal
- Enhanced max margin learning on multimodal data mining in a multimedia database (ZG, ZZ, EPX, CF), pp. 340–349.
- KDD-2007-HeikinheimoSHMM #set
- Finding low-entropy sets and trees from binary data (HH, JKS, EH, HM, TM), pp. 350–359.
- KDD-2007-JanssensGM #analysis #clustering #hybrid #mining
- Dynamic hybrid clustering of bioinformatics by incorporating text mining and citation analysis (FALJ, WG, BDM), pp. 360–369.
- KDD-2007-JoLG #correlation #detection #graph #research #topic
- Detecting research topics via the correlation between graphs and texts (YJ, CL, CLG), pp. 370–379.
- KDD-2007-KarrasMS #summary
- Exploiting duality in summarization with deterministic guarantees (PK, DS, NM), pp. 380–389.
- KDD-2007-KeCN #correlation #database #graph
- Correlation search in graph databases (YK, JC, WN), pp. 390–399.
- KDD-2007-KolczY #classification
- Raising the baseline for high-precision text classifiers (AK, WtY), pp. 400–409.
- KDD-2007-LaxmanSU #algorithm #performance
- A fast algorithm for finding frequent episodes in event streams (SL, PSS, KPU), pp. 410–419.
- KDD-2007-LeskovecKGFVG #detection #effectiveness #network
- Cost-effective outbreak detection in networks (JL, AK, CG, CF, JMV, NSG), pp. 420–429.
- KDD-2007-LiLW #equivalence #mining #statistics
- Mining statistically important equivalence classes and delta-discriminative emerging patterns (JL, GL, LW), pp. 430–439.
- KDD-2007-Li #random #reduction
- Very sparse stable random projections for dimension reduction in lalpha (0 <alpha<=2) norm (PL0), pp. 440–449.
- KDD-2007-LiuJJ #clustering #constraints #named
- BoostCluster: boosting clustering by pairwise constraints (YL, RJ, AKJ), pp. 450–459.
- KDD-2007-LoKL #mining #performance #specification
- Efficient mining of iterative patterns for software specification discovery (DL, SCK, CL), pp. 460–469.
- KDD-2007-LongZY #clustering #framework #probability #relational
- A probabilistic framework for relational clustering (BL, Z(Z, PSY), pp. 470–479.
- KDD-2007-MannilaT
- Nestedness and segmented nestedness (HM, ET), pp. 480–489.
- KDD-2007-MeiSZ #automation #modelling #multi #topic
- Automatic labeling of multinomial topic models (QM, XS, CZ), pp. 490–499.
- KDD-2007-MimnoM #modelling
- Expertise modeling for matching papers with reviewers (DMM, AM), pp. 500–509.
- KDD-2007-MoserGE #analysis #clustering #specification
- Joint cluster analysis of attribute and relationship data withouta-priori specification of the number of clusters (FM, RG, ME), pp. 510–519.
- KDD-2007-NallapatiDLU #multi #topic
- Multiscale topic tomography (RN, SD, JDL, KU), pp. 520–529.
- KDD-2007-NijssenF #mining
- Mining optimal decision trees from itemset lattices (SN, ÉF), pp. 530–539.
- KDD-2007-PandeySGGK #case study #interactive #network #predict
- Association analysis-based transformations for protein interaction networks: a function prediction case study (GP, MS, RG, TG, VK), pp. 540–549.
- KDD-2007-ParkP #collaboration #ranking
- Applying collaborative filtering techniques to movie search for better ranking and browsing (STP, DMP), pp. 550–559.
- KDD-2007-PonCBC #multi #topic
- Tracking multiple topics for finding interesting articles (RKP, AFC, DB, TC), pp. 560–569.
- KDD-2007-RadlinskiJ #learning #ranking
- Active exploration for learning rankings from clickthrough data (FR, TJ), pp. 570–579.
- KDD-2007-Sandler #analysis #modelling #probability
- Hierarchical mixture models: a probabilistic analysis (MS), pp. 580–589.
- KDD-2007-SatoN #documentation #information management #multi #parametricity #topic #using
- Knowledge discovery of multiple-topic document using parametric mixture model with dirichlet prior (IS, HN), pp. 590–598.
- KDD-2007-Schickel-ZuberF #clustering #learning #recommendation #using
- Using hierarchical clustering for learning theontologies used in recommendation systems (VSZ, BF), pp. 599–608.
- KDD-2007-Sculley #feedback #learning
- Practical learning from one-sided feedback (DS), pp. 609–618.
- KDD-2007-ShaparenkoJ #database #documentation
- Information genealogy: uncovering the flow of ideas in non-hyperlinked document databases (BS, TJ), pp. 619–628.
- KDD-2007-ShehataKK #categorisation #concept
- A concept-based model for enhancing text categorization (SS, FK, MK), pp. 629–637.
- KDD-2007-ShengL #learning
- Partial example acquisition in cost-sensitive learning (VSS, CXL), pp. 638–646.
- KDD-2007-ShigaTM #approach #clustering #composition #network
- A spectral clustering approach to optimally combining numericalvectors with a modular network (MS, IT, HM), pp. 647–656.
- KDD-2007-SmithE #bias #classification #generative #robust
- Making generative classifiers robust to selection bias (ATS, CE), pp. 657–666.
- KDD-2007-SongWJR #detection #multi #statistics
- Statistical change detection for multi-dimensional data (XS, MW, CMJ, SR), pp. 667–676.
- KDD-2007-SrihariXS #documentation #generative #using
- Use of ranked cross document evidence trails for hypothesis generation (RKS, LX, TS), pp. 677–686.
- KDD-2007-SunFPY #graph #mining #named #scalability
- GraphScope: parameter-free mining of large time-evolving graphs (JS, CF, SP, PSY), pp. 687–696.
- KDD-2007-TandonC #detection #network #validation
- Weighting versus pruning in rule validation for detecting network and host anomalies (GT, PKC), pp. 697–706.
- KDD-2007-TangWXZ #clustering #perspective
- Enhancing semi-supervised clustering: a feature projection perspective (WT, HX, SZ, JW), pp. 707–716.
- KDD-2007-TantipathananandhBK #community #framework #identification #network #social
- A framework for community identification in dynamic social networks (CT, TYBW, DK), pp. 717–726.
- KDD-2007-TeoSVL #composition #scalability
- A scalable modular convex solver for regularized risk minimization (CHT, AJS, SVNV, QVL), pp. 727–736.
- KDD-2007-TongFGE #graph #pattern matching #performance #scalability
- Fast best-effort pattern matching in large attributed graphs (HT, CF, BG, TER), pp. 737–746.
- KDD-2007-TongFK #graph #mining #performance #proximity
- Fast direction-aware proximity for graph mining (HT, CF, YK), pp. 747–756.
- KDD-2007-VogelAS #linear #scalability
- Scalable look-ahead linear regression trees (DSV, OA, TS), pp. 757–764.
- KDD-2007-VreekenLS #difference
- Characterising the difference (JV, MvL, AS), pp. 765–774.
- KDD-2007-WanNHL #privacy
- Privacy-preservation for gradient descent methods (LW, WKN, SH, VCSL), pp. 775–783.
- KDD-2007-WangZHS #coordination #correlation #mining #topic
- Mining correlated bursty topic patterns from coordinated text streams (XW, CZ, XH, RS), pp. 784–793.
- KDD-2007-WangPM #analysis #component
- Generalized component analysis for text with heterogeneous attributes (XW, CP, AM), pp. 794–803.
- KDD-2007-WongFPW #mining
- Mining favorable facets (RCWW, JP, AWCF, KW), pp. 804–813.
- KDD-2007-WuWCX #analysis #composition
- Local decomposition for rare class analysis (JW, HX, PW, JC), pp. 814–823.
- KDD-2007-XuYFS #algorithm #clustering #named #network
- SCAN: a structural clustering algorithm for networks (XX, NY, ZF, TAJS), pp. 824–833.
- KDD-2007-YanTS #classification #multi
- Model-shared subspace boosting for multi-label classification (RY, JT, JRS), pp. 834–843.
- KDD-2007-YankovKMCZ #detection #scalability
- Detecting time series motifs under uniform scaling (DY, EJK, JM, BYcC, VBZ), pp. 844–853.
- KDD-2007-YeJC #analysis #kernel #learning #matrix #polynomial #programming
- Learning the kernel matrix in discriminant analysis via quadratically constrained quadratic programming (JY, SJ, JC), pp. 854–863.
- KDD-2007-YuanWY #semantics #visual notation
- From frequent itemsets to semantically meaningful visual patterns (JY, YW, MY), pp. 864–873.
- KDD-2007-ZhangHZLC #distance
- Information distance from a question to an answer (XZ, YH, XZ, ML, DRC), pp. 874–883.
- KDD-2007-ZhaoMY #mining
- Mining templates from search result records of search engines (HZ, WM, CTY), pp. 884–893.
- KDD-2007-ZhengSWW #detection #generative #optimisation
- Joint optimization of wrapper generation and template detection (SZ, RS, JRW, DW), pp. 894–902.
- KDD-2007-ZhuZNWH #approach #comprehension
- Webpage understanding: an integrated approach (JZ, BZ, ZN, JRW, HWH), pp. 903–912.
- KDD-2007-AsurPU #behaviour #framework #graph #interactive
- An event-based framework for characterizing the evolutionary behavior of interaction graphs (SA, SP, DU), pp. 913–921.
- KDD-2007-CastanoWCST #analysis
- On-board analysis of uncalibrated data for a spacecraft at mars (RC, KW, SAC, TMS, BT), pp. 922–930.
- KDD-2007-FastFMTJGK #detection #preprocessor #relational
- Relational data pre-processing techniques for improved securities fraud detection (ASF, LF, MEM, BJT, DJ, HGG, JK), pp. 941–949.
- KDD-2007-HuaP #approach #heuristic
- Cleaning disguised missing data: a heuristic approach (MH, JP), pp. 950–958.
- KDD-2007-KohaviHS #web
- Practical guide to controlled experiments on the web: listen to your customers not to the hippo (RK, RMH, DS), pp. 959–967.
- KDD-2007-LuoXLS #classification #distributed #network #peer-to-peer
- Distributed classification in peer-to-peer networks (PL, HX, KL, ZS), pp. 968–976.
- KDD-2007-PerlichRLZ #estimation #modelling
- High-quantile modeling for customer wallet estimation and other applications (CP, SR, RDL, BZ), pp. 977–985.
- KDD-2007-ZhaoDZ #mining #network
- Mining complex power networks for blackout prevention (JHZ, ZYD, PZ), pp. 986–994.
- KDD-2007-ZhaoB #web
- Corroborate and learn facts from the web (SZ, JB), pp. 995–1003.
- KDD-2007-ZhuBK #automation
- Extracting relevant named entities for automated expense reimbursement (GZ, TJB, VK), pp. 1004–1012.
- KDD-2007-Aggarwal #classification #data type #framework #segmentation
- A framework for classification and segmentation of massive audio data streams (CCA), pp. 1013–1017.
- KDD-2007-CurryGLVB #case study #detection #scalability #set
- Detecting changes in large data sets of payment card data: a case study (CC, RLG, DL, SV, JB), pp. 1018–1022.
- KDD-2007-PanZZPSPY #mining #modelling #network
- Domain-constrained semi-supervised mining of tracking models in sensor networks (RP, JZ, VWZ, JJP, DS, SJP, QY), pp. 1023–1027.
- KDD-2007-PengPLW #summary
- Event summarization for system management (WP, CP, TL, HW), pp. 1028–1032.
- KDD-2007-RaoBFSON #detection #machine learning #named
- LungCAD: a clinically approved, machine learning system for lung cancer detection (RBR, JB, GF, MS, NO, DPN), pp. 1033–1037.
- KDD-2007-YanL #machine learning
- Machine learning for stock selection (RJY, CXL), pp. 1038–1042.
- KDD-2007-YeWLY #detection #named
- IMDS: intelligent malware detection system (YY, DW, TL, DY), pp. 1043–1047.
- KDD-2007-YinHY #multi #web
- Truth discovery with multiple conflicting information providers on the web (XY, JH, PSY), pp. 1048–1052.
- KDD-2007-Parthasarathy #data mining #learning #mining
- Data mining at the crossroads: successes, failures and learning from them (SP), pp. 1053–1055.
20 ×#mining
17 ×#clustering
11 ×#detection
10 ×#classification
10 ×#learning
10 ×#network
9 ×#multi
9 ×#scalability
8 ×#analysis
8 ×#framework
17 ×#clustering
11 ×#detection
10 ×#classification
10 ×#learning
10 ×#network
9 ×#multi
9 ×#scalability
8 ×#analysis
8 ×#framework