Tag #semistructured data
147 papers:
CIKM-2019-0002LG #data type #performance- Efficient Join Processing Over Incomplete Data Streams (WR0, XL, KG), pp. 209–218.
ICML-2019-AntelmiARL #analysis #multi- Sparse Multi-Channel Variational Autoencoder for the Joint Analysis of Heterogeneous Data (LA, NA, PR, ML), pp. 302–311.
ICML-2019-MatteiF #generative #modelling #named #set- MIWAE: Deep Generative Modelling and Imputation of Incomplete Data Sets (PAM, JF), pp. 4413–4423.
CIKM-2018-0002WYTZZ #named #predict #social- vec2Link: Unifying Heterogeneous Data for Social Link Prediction (FZ0, BW, YY, GT, KZ, TZ), pp. 1843–1846.
CIKM-2018-BuenrostroTRPC #ecosystem #privacy- Single-Setup Privacy Enforcement for Heterogeneous Data Ecosystems (IB, AT, VR, EP, ZC), pp. 1943–1946.
KDD-2018-CardosoDV #learning #personalisation #recommendation #towards- Product Characterisation towards Personalisation: Learning Attributes from Unstructured Data to Recommend Fashion Products (ÂC, FD, SV), pp. 80–89.
KDD-2018-HanSSZ #collaboration #learning #multi- Multi-label Learning with Highly Incomplete Data via Collaborative Embedding (YH, GS, YS, XZ0), pp. 1494–1503.
JCDL-2017-CastroCWPF #framework #platform #using- Classifying Short Unstructured Data Using the Apache Spark Platform (EPSC, SC, EW, DAP, EAF), pp. 129–138.
CIKM-2017-ZhangGHCL #probability- Probabilistic Skyline on Incomplete Data (KZ, HG, XH, ZC, JL), pp. 427–436.
KDD-2017-BojchevskiMG #clustering #modelling #robust- Robust Spectral Clustering for Noisy Data: Modeling Sparse Corruptions Improves Latent Embeddings (AB, YM, SG), pp. 737–746.
KDD-2017-MaMXLGSZ #data flow- Unsupervised Discovery of Drug Side-Effects from Heterogeneous Data Sources (FM, CM, HX, QL0, JG0, LS, AZ), pp. 967–976.
ICPR-2016-NguyenNVP #clustering #multi #named #parametricity- MCNC: Multi-Channel Nonparametric Clustering from heterogeneous data (TBN, VN0, SV, DQP), pp. 3633–3638.
POPL-2016-RaychevBVK #learning #source code- Learning programs from noisy data (VR, PB, MTV, AK0), pp. 761–774.
MSR-2015-PonzanelliML #development #mining- Summarizing Complex Development Artifacts by Mining Heterogeneous Data (LP, AM, ML), pp. 401–405.
CIKM-2015-BhowmickDCA #feedback #paradigm #query #visual notation- Interruption-Sensitive Empty Result Feedback: Rethinking the Visual Query Feedback Paradigm for Semistructured Data (SSB, CED, BC, MHA), pp. 723–732.
CIKM-2015-KantereOKS #data flow #query- Query Relaxation across Heterogeneous Data Sources (VK, GO, AK, TKS), pp. 473–482.
MLDM-2015-MojahedBWI #analysis #clustering #matrix #similarity #using- Applying Clustering Analysis to Heterogeneous Data Using Similarity Matrix Fusion (SMF) (AM, JHBS, WW, BdlI), pp. 251–265.
SIGIR-2015-MorenoD #adaptation #dataset #metric- Adapted B-CUBED Metrics to Unbalanced Datasets (JGM, GD), pp. 911–914.
JCDL-2014-ParkS #library #named- PerCon: A personal digital library for heterogeneous data (SIP, FS), pp. 97–106.
PODS-2014-Libkin #how #what- Incomplete data: what went wrong, and how to fix it (LL), pp. 1–13.
SIGMOD-2014-LiLGZFH #estimation #reliability- Resolving conflicts in heterogeneous data by truth discovery and source reliability estimation (QL, YL, JG, BZ, WF, JH), pp. 1187–1198.
MSR-2014-MertenMBP #natural language- Classifying unstructured data into natural language text and technical information (TM, BM, SB, BP), pp. 300–303.
CIKM-2014-GressD #flexibility #framework- A Flexible Framework for Projecting Heterogeneous Data (AG, ID), pp. 1169–1178.
ICPR-2014-ChaudhariM #clustering #matrix #symmetry #using- Average Overlap for Clustering Incomplete Data Using Symmetric Non-negative Matrix Factorization (SC, MNM), pp. 1431–1436.
KDIR-2014-MojahedI #approach #distance- A Fusion Approach to Computing Distance for Heterogeneous Data (AM, BdlI), pp. 269–276.
JCDL-2013-ChenPJP #modelling #perspective #research- Modeling heterogeneous data resources for social-ecological research: a data-centric perspective (MC, UP, SJ, BP), pp. 309–312.
SIGMOD-2013-SongYYHS #data flow #retrieval #scalability- Inter-media hashing for large-scale retrieval from heterogeneous data sources (JS, YY, YY, ZH, HTS), pp. 785–796.
ICEIS-v1-2013-RodriguesAGSCS #case study #data flow #data transformation- Integrated Data Management — A Case Study in Heterogeneous Data Sources in Brazilian Government (SAR, MA, AFG, RTdS, MC, JMdS), pp. 316–321.
KDD-2013-DanilevskyWTNCDWH #mining #named #topic- AMETHYST: a system for mining and exploring topical hierarchies of heterogeneous data (MD, CW, FT, SN, GC, ND, LW, JH), pp. 1458–1461.
PPDP-2013-StewartBN #data flow #data type #dependent type #policy- Dependent types for enforcement of information flow and erasure policies in heterogeneous data structures (GS, AB, AN), pp. 145–156.
SAC-2013-KhaniHAB #algorithm #clustering #set- An algorithm for discovering clusters of different densities or shapes in noisy data sets (FK, MJH, AAA, HB), pp. 144–149.
SAC-2013-ZengC #data fusion #matrix #recommendation- Heterogeneous data fusion via matrix factorization for augmenting item, group and friend recommendations (WZ, LC), pp. 237–244.
ASPLOS-2013-DelimitrouK #named #scheduling- Paragon: QoS-aware scheduling for heterogeneous datacenters (CD, CK), pp. 77–88.
VLDB-2012-MurthyDDHMPRS #data transformation- Exploiting Evidence from Unstructured Data to Enhance Master Data Management (KM, PMD, AD, RH, MKM, DP, JR, SS), pp. 1862–1873.
ICML-2012-MnihH #image #learning- Learning to Label Aerial Images from Noisy Data (VM, GEH), p. 31.
ICPR-2012-BriaMMT #approach- A ranking-based cascade approach for unbalanced data (AB, CM, MM, FT), pp. 3439–3442.
KDIR-2012-MountassirBB #analysis #problem #sentiment #set- Addressing the Problem of Unbalanced Data Sets in Sentiment Analysis (AM, HB, IB), pp. 306–311.
SAC-2012-ParadiesMSKS #in the cloud- Entity matching for semistructured data in the Cloud (MP, SM, JS, SK, KUS), pp. 453–458.
ICLP-2012-BryS #query #simulation #unification- Simulation Unification: Beyond Querying Semistructured Data (FB, SS), pp. 1–13.
VLDB-2011-BeyerEGBEKOS #data analysis #named #scalability #scripting language- Jaql: A Scripting Language for Large Scale Semistructured Data Analysis (KSB, VE, RG, AB, MYE, CCK, FÖ, EJS), pp. 1272–1283.
VLDB-2011-RazniewskiN #database #query- Completeness of Queries over Incomplete Databases (SR, WN), pp. 749–760.
ICPC-2011-BettenburgAHS #approach #lightweight- A Lightweight Approach to Uncover Technical Artifacts in Unstructured Data (NB, BA, AEH, MS), pp. 185–188.
KMIS-2011-OugoutiBAB #architecture #data flow #integration- Architecture of Medpeer — A New P2P-based System for Integration of Heterogeneous Data Sources (NSO, HB, YA, ANB), pp. 351–354.
CIKM-2010-AzizR #data flow #multi #predict #robust- Robust prediction from multiple heterogeneous data sources with partial information (MSA, CKR), pp. 1857–1860.
ICPR-2010-GriptonL #kernel #using- Kernel Domain Description with Incomplete Data: Using Instance-Specific Margins to Avoid Imputation (AG, WL), pp. 2921–2924.
ICEIS-J-2009-AliPTD #data flow #distributed #framework #named #xquery- DeXIN: An Extensible Framework for Distributed XQuery over Heterogeneous Data Sources (MIA, RP, HLT, SD), pp. 172–183.
CIKM-2009-HaghaniMA #data type #query- Evaluating top-k queries over incomplete data streams (PH, SM, KA), pp. 877–886.
CIKM-2009-ZhongL #graph #named- 3se: a semi-structured search engine for heterogeneous data in graph model (MZ, ML), pp. 1405–1408.
ECIR-2009-KimXC #probability #retrieval- A Probabilistic Retrieval Model for Semistructured Data (JK, XX, WBC), pp. 228–239.
ICML-2009-DeodharGGCD #clustering #framework #scalability- A scalable framework for discovering coherent co-clusters in noisy data (MD, GG, JG, HC, ISD), pp. 241–248.
ICML-2008-DickHS #infinity #learning- Learning from incomplete data with infinite imputations (UD, PH, TS), pp. 232–239.
ICPR-2008-LiaoJ #learning #network #parametricity- Exploiting qualitative domain knowledge for learning Bayesian network parameters with incomplete data (WL, QJ), pp. 1–4.
KDD-2008-YeCWLZPBJLAR #data fusion- Heterogeneous data fusion for alzheimer’s disease study (JY, KC, TW, JL, ZZ, RP, MB, RJ, HL, GEA, ER), pp. 1025–1033.
KDD-2008-ZhaoWLYC #data flow #identification #multi- Identifying biologically relevant genes via multiple heterogeneous data sources (ZZ, JW, HL, JY, YC), pp. 839–847.
SIGMOD-2007-Resende #data flow- Handling heterogeneous data sources in a SOA environment with service data objects (SDO) (LR), pp. 895–897.
VLDB-2007-ChuBCDN #approach #incremental #query #relational- A Relational Approach to Incrementally Extracting and Querying Structure in Unstructured Data (EC, AB, TC, AD, JFN), pp. 1045–1056.
CIKM-2007-RaghuveerJMDD #approach #performance #towards- Towards efficient search on unstructured data: an intelligent-storage approach (AR, MJ, MFM, BKD, DHCD), pp. 951–954.
ICML-2007-LiaoLC #classification- Quadratically gated mixture of experts for incomplete data classification (XL, HL, LC), pp. 553–560.
SOFTVIS-2006-HoskingKD #visualisation- A tool for visualizing schemas for semistructured data (JGH, NK, GD), pp. 149–150.
SIGMOD-2005-Choy #integration- Integration of structured and unstructured data in IBM content manager (DMC), pp. 811–816.
SIGMOD-2005-SinhaK #named #navigation- Magnet: Supporting Navigation in Semistructured Data Environments (VS, DRK), pp. 97–106.
KDD-2005-GaoLZCM #clustering #consistency #graph #higher-order- Consistent bipartite graph co-partitioning for star-structured high-order heterogeneous data co-clustering (BG, TYL, XZ, QC, WYM), pp. 41–50.
KDD-2005-MeruguG #data flow #distributed #framework #learning- A distributed learning framework for heterogeneous data sources (SM, JG), pp. 208–217.
SAC-2005-YangEY #database #security #specification- Mediation security specification and enforcement for heterogeneous databases (LY, RKE, HY), pp. 354–358.
ECDL-2004-RavindranathanSGFFF #case study #data flow #library #prototype- Prototyping Digital Libraries Handling Heterogeneous Data Sources — The ETANA-DL Case Study (UR, RS, MAG, WF, EAF, JWF), pp. 186–197.
CAiSE-2004-BoydKLMR #data flow #integration #named- AutoMed: A BAV Data Integration System for Heterogeneous Data Sources (MB, SK, CL, PM, NR), pp. 82–97.
ICEIS-v1-2004-DelgadoM #architecture #concept #database #integration #semantics #towards- Towards Conceptual Mediation: A Semantic Architecture For Dynamic Integration of Heterogeneous Databases (IND, JFAM), pp. 169–176.
ICEIS-v1-2004-FerrandinC #consistency #database #integration #power of #using #xml- Referencial Integrity Model for XML Data Integrated from Heterogeneous Databases Systems — Using the Power of XML for Consistent Data Integration (MF, MSdC), pp. 15–20.
KDD-2004-WrightY #distributed #network #privacy- Privacy-preserving Bayesian network structure computation on distributed heterogeneous data (RNW, ZY), pp. 713–718.
SAC-2004-CombiOQ #approach #constraints #modelling #specification- Specifying temporal data models for semistructured data by a constraint-based approach (CC, BO, EQ), pp. 1103–1108.
PODS-2003-CaliLR #complexity #consistency #database #decidability #on the #query- On the decidability and complexity of query answering over inconsistent and incomplete databases (AC, DL, RR), pp. 260–271.
VLDB-2003-BergerBSW #query #visual notation #xml- Xcerpt and visXcerpt: From Pattern-Based to Visual Querying of XML and Semistructured Data (SB, FB, SS, CW), pp. 1053–1056.
VLDB-2003-Kuhn #database- The Zero-Delay Data Warehouse: Mobilizing Heterogeneous Databases (EK), pp. 1035–1040.
ICALP-2003-BleichenbacherKY - Decoding of Interleaved Reed Solomon Codes over Noisy Data (DB, AK, MY), pp. 97–108.
ICEIS-v1-2003-ClaypoolR #database #framework #modelling #named- Sangam: A Framework for Modeling Heterogeneous Database Transformations (KTC, EAR), pp. 219–224.
ICEIS-v1-2003-MimounePA #approach #database #ontology- An Ontology-Based Approach for Exchanging Data Between Heterogeneous Database Systems (MEHM, GP, YAA), pp. 512–524.
ICEIS-v2-2003-BendouM #learning #network- Learning Bayesian Networks From Noisy Data (MB, PM), pp. 26–33.
ICEIS-v3-2003-MangoldRM #named #using- Föderal: Management of Engineering Data Using a Semistructured Data Model (CM, RR, BM), pp. 382–389.
CIKM-2003-MaGCC #documentation #web- Extracting unstructured data from template generated web documents (LM, NG, AC, MC), pp. 512–515.
ECIR-2003-AunimoHKMPV - Question Answering System for Incomplete and Noisy Data (LA, OH, RK, JM, RP, OV), pp. 193–206.
ICML-2003-SebbanJ #approach #grammar inference #on the #statistics- On State Merging in Grammatical Inference: A Statistical Approach for Dealing with Noisy Data (MS, JCJ), pp. 688–695.
SAC-2003-LiZLO #classification #functional #learning- Gene Functional Classification by Semisupervised Learning from Heterogeneous Data (TL, SZ, QL, MO), pp. 78–82.
STOC-2003-CoppersmithS #higher-order- Reconstructing curves in three (and higher) dimensional space from noisy data (DC, MS), pp. 136–142.
SIGMOD-2002-PapakonstantinouPV #named #query- QURSED: querying and reporting semistructured data (YP, MP, VV), pp. 192–203.
CAiSE-2002-DomenigD #data flow #query- Query Explorativeness for Integrated Search in Heterogeneous Data Sources (RD, KRD), pp. 715–718.
CAiSE-2002-McBrienP #approach #architecture #database #evolution- Schema Evolution in Heterogeneous Database Architectures, A Schema Transformation Approach (PM, AP), pp. 484–499.
CAiSE-2002-StavrakasG #multi #representation #web- Multidimensional Semistructured Data: Representing Context-Dependent Information on the Web (YS, MG), pp. 183–199.
ICEIS-2002-AlarconGYG #approach #data flow #integration- Data Sources Server: An Approach to Heterogeneous Data Integration (PPA, JG, AY, CG), pp. 3–10.
CIKM-2002-Amer-YahiaFGS #logic #physics- Logical and physical support for heterogeneous data (SAY, MFF, RG, DS), pp. 270–281.
SAC-2002-CooperSS #parallel- A parallel index for semistructured data (BFC, NS, MS), pp. 890–896.
PDP-2002-GoldschmidtLDK #database #distributed #mobile- Mobile Agents in a Distributed Heterogeneous Database Syste (BG, ZL, MD, HK), pp. 123–128.
STOC-2002-AroraK #algebra- Fitting algebraic curves to noisy data (SA, SK), pp. 162–169.
ICLP-2002-BryS #declarative #model transformation #query #simulation #towards #transformation language #unification #xml- Towards a Declarative Query and Transformation Language for XML and Semistructured Data: Simulation Unification (FB, SS), pp. 255–270.
PODS-2001-KanzaS #flexibility #query- Flexible Queries Over Semistructured Data (YK, YS).
VLDB-2001-CooperSFHS #performance- A Fast Index for Semistructured Data (BFC, NS, MJF, GRH, MS), pp. 341–350.
VLDB-2001-ManolescuFK #data flow #query #xml- Answering XML Queries on Heterogeneous Data Sources (IM, DF, DK), pp. 241–250.
ICEIS-v1-2001-RossiterNH #modelling- A Universal Technique for Relating Heterogeneous Data Models (BNR, DAN, MAH), pp. 96–103.
CIKM-2001-SankeyW - Structural Inference for Semistructured Data (JS, RKW), pp. 159–166.
ICML-2001-Jiang #aspect-oriented- Some Theoretical Aspects of Boosting in the Presence of Noisy Data (WJ), pp. 234–241.
ICML-2001-KriegerLW - Boosting Noisy Data (AK, CL, AJW), pp. 274–281.
KDD-2001-AggarwalP #concept #mining #re-engineering #set- Mining massively incomplete data sets by conceptual reconstruction (CCA, SP), pp. 227–232.
KDD-2001-PadmanabhanZK #personalisation #what- Personalization from incomplete data: what you don’t know can hurt (BP, Z(Z, SOK), pp. 154–163.
SAC-2001-StaudtKR #data flow #execution #process- Access to heterogeneous data sources for supporting business process execution (MS, JUK, UR), pp. 197–206.
LICS-2001-Abiteboul #theory and practice- Semistructured Data: from Practice to Theory (SA), pp. 379–386.
ECDL-2000-CamposS #data flow #documentation #integration #named- ActiveXML: Compound Documents for Integration of Heterogeneous Data Sources (JPC, MJS), pp. 380–384.
CIKM-2000-DomenigD #approach #data flow #query- A Query based Approach for Integrating Heterogeneous Data Sources (RD, KRD), pp. 453–460.
ICML-2000-Eskin #detection #probability #using- Anomaly Detection over Noisy Data using Learned Probability Distributions (EE), pp. 255–262.
ICML-2000-ZupanBBD #concept #induction- Induction of Concept Hierarchies from Noisy Data (BZ, IB, MB, JD), pp. 1199–1206.
SAC-2000-OchKO #data flow #using- Integrating Heterogeneous Data Sources Using the COIL Mediator Definition Language (CO, RK, RO), pp. 991–1000.
PODS-1999-KanzaNS #query- Queries with Incomplete Answers over Semistructured Data (YK, WN, YS), pp. 227–236.
PODS-1999-MiloS #query #type inference- Type Inference for Queries on Semistructured Data (TM, DS), pp. 215–226.
SIGMOD-1999-DeutschFS - Storing Semistructured Data with STORED (AD, MFF, DS), pp. 431–442.
SIGMOD-1999-PapakonstantinouV #query- Query Rewriting for Semistructured Data (YP, VV), pp. 455–466.
VLDB-1999-BouganimCDDGS #data flow #data type #multi #web- Miro Web: Integrating Multiple Data Sources through Semistructured Data Types (LB, TCSY, TTDN, JLD, GG, FS), pp. 750–753.
VLDB-1999-DyresonBJ #aspect-oriented #multi #query- Capturing and Querying Multiple Aspects of Semistructured Data (CED, MHB, CSJ), pp. 290–301.
ICML-1999-Teng - Correcting Noisy Data (CMT), pp. 239–248.
KDD-T-1999-Feldman #mining- Mining Unstructured Data (RF), pp. 182–236.
SIGMOD-1998-Cohen #database #integration #query #similarity #using- Integration of Heterogeneous Databases Without Common Domains Using Queries Based on Textual Similarity (WWC), pp. 201–212.
SIGMOD-1998-NestorovAM - Extracting Schema from Semistructured Data (SN, SA, RM), pp. 295–306.
SIGMOD-1998-ReinwaldP #data access #sql- SQL Open Heterogeneous Data Access (BR, HP), pp. 506–507.
VLDB-1998-AbiteboulMRVW #incremental #maintenance- Incremental Maintenance for Materialized Views over Semistructured Data (SA, JM, MR, VV, JLW), pp. 38–49.
VLDB-1998-MiloZ #using- Using Schema Matching to Simplify Heterogeneous Data Translation (TM, SZ), pp. 122–133.
VLDB-1998-OoiGT #database #performance- Fast High-Dimensional Data Search in Incomplete Databases (BCO, CHG, KLT), pp. 357–367.
VLDB-1998-VenkataramanZ #database #optimisation #query- Heterogeneous Database Query Optimization in DB2 Universal DataJoiner (SV, TZ), pp. 685–689.
KDD-1998-PinheiroS #database #mining- Methods for Linking and Mining Massive Heterogeneous Databases (JCP, DXS), pp. 309–313.
DAC-1998-WangBS #named- Potential-NRG: Placement with Incomplete Data (MW, PB, MS), pp. 279–282.
PODS-1997-Buneman - Semistructured Data (PB), pp. 117–121.
SIGMOD-1997-AtzeniT #database #multi #named- MDM: a Multiple-Data-Model Tool for the Management of Heterogeneous Database Schemes (PA, RT), pp. 528–531.
VLDB-1997-GoldmanW #database #named #optimisation #query- DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases (RG, JW), pp. 436–445.
KDD-1997-WangL - Schema Discovery for Semistructured Data (KW, HL), pp. 271–274.
SIGMOD-1996-BunemanDHS #optimisation #query- A Query Language and Optimization Techniques for Unstructured Data (PB, SBD, GGH, DS), pp. 505–516.
SIGMOD-1996-QuassWGHLMNRRAUW #lightweight #named #repository- LORE: A Lightweight Object REpository for Semistructured Data (DQ, JW, RG, KH, QL, JM, SN, AR, HR, SA, JDU, JLW), p. 549.
VLDB-1996-Dyreson #information retrieval- Information Retrieval from an Incomplete Data Cube (CED), pp. 532–543.
VLDB-1996-Levy #database- Obtaining Complete Answers from Incomplete Databases (AYL), pp. 402–412.
VLDB-1996-Suciu #composition #maintenance #query- Query Decomposition and View Maintenance for Query Languages for Unstructured Data (DS), pp. 227–238.
PODS-1995-Libkin #database #normalisation- Normalizing Incomplete Databases (LL), pp. 219–230.
SIGMOD-1995-LiC #database #integration #named #prototype #semantics- Semint: A System Prototype for Semantic Integration in Heterogeneous Databases (WSL, CC), p. 484.
VLDB-1995-MillinerBP #architecture #database #interactive #scalability- A Scalable Architecture for Autonomous Heterogeneous Database Interactions (SM, AB, MPP), pp. 515–526.
KDD-1995-Thiesson #network #quantifier- Accelerated Quantification of Bayesian Networks with Incomplete Data (BT), pp. 306–311.
AdaEurope-1995-Kempe #ada #classification #data type- Heterogeneous Data Structures and Cross-Classification of Objects with Ada95 (MK), pp. 71–80.
SAC-1995-FordTT #visualisation- Supporting heterogeneous data import for data visualization (RF, RT, DT), pp. 81–85.
VLDB-1994-LiC #database #integration #network #semantics #using- Semantic Integration in Heterogeneous Databases Using Neural Networks (WSL, CC), pp. 1–12.
HPDC-1994-CrandallQ #composition- A Decomposition Advisory System for Heterogeneous Data-Parallel Processing (PC, MJQ), pp. 114–121.
VLDB-1990-WangM #database #perspective- A Polygen Model for Heterogeneous Database Systems: The Source Tagging Perspective (YRW, SEM), pp. 519–538.
VLDB-1985-AbiteboulG #database #semantics- Update Semantics for Incomplete Databases (SA, GG), pp. 1–12.
POPL-1984-Thiel #data type #specification- Stop Losing Sleep Over Incomplete Data Type Specifications (JJT), pp. 76–82.
PODS-1983-Gonnet #database #performance- Unstructured Data Bases or Very Efficient Text Searching (GHG), pp. 117–124.