Travelled to:
1 × Australia
1 × Austria
1 × Canada
1 × France
1 × Norway
5 × USA
Collaborated with:
J.F.Naughton R.Ramakrishnan X.Chai Y.Chiang W.Shen A.Baid S.Vaithyanathan N.Rampalli M.J.Franklin M.Sayyadian A.Rosenthal T.Chen Y.Lee F.Chen E.Chu P.DeRose J.W.Shavlik A.Rajaraman S.Das R.McCann C.Sun F.Yang D.Kossmann T.Kraska F.Niu C.Ré B.Vuong J.Huang D.Burdick W.Wu C.T.Yu W.Meng D.S.Lamba S.Subramaniam V.Harinarayan S.Amer-Yahia J.M.Kleinberg N.Koudas I.Rae J.Li B.J.Gao J.Yang L.Seligman R.Dhamankar A.Y.Halevy P.M.Domingos W.Lam L.Liu S.Prasad Z.Vacheri B.K.AlShebli Q.Le H.Nguyen L.Vu C.Gokhale X.Zhu O.Deshpande M.Tourn A.Gattani N.Garera M.Tiwari P.S.G.C. K.G.K. H.Zhang S.Prasad E.Arcaute G.Krishnan R.Deep V.Raghavendra
Talks about:
data (11) extract (9) use (5) approach (4) integr (4) inform (4) entiti (4) queri (4) match (4) crowdsourc (3)
Person: AnHai Doan
DBLP: Doan:AnHai
Facilitated 1 volumes:
Contributed to:
Wrote 27 papers:
- SIGMOD-2015-CSKZYRPAKDRD #big data #industrial #what #why
- Why Big Data Industrial Systems Need Rules and What We Can Do About It (PSGC, CS, KGK, HZ, FY, NR, SP, EA, GK, RD, VR, AD), pp. 265–276.
- SIGMOD-2014-ChiangDN #evolution #modelling
- Modeling entity evolution for temporal record matching (YHC, AD, JFN), pp. 1175–1186.
- SIGMOD-2014-GokhaleDDNRSZ #crowdsourcing #named
- Corleone: hands-off crowdsourcing for entity matching (CG, SD, AD, JFN, NR, JWS, XZ), pp. 601–612.
- VLDB-2014-ChiangDN #algorithm #performance
- Tracking Entities in the Dynamic World: A Fast Algorithm for Matching Temporal Records (YHC, AD, JFN), pp. 469–480.
- VLDB-2014-SunRYD #classification #crowdsourcing #machine learning #named #scalability #using
- Chimera: Large-Scale Classification using Machine Learning, Rules, and Crowdsourcing (CS, NR, FY, AD), pp. 1529–1540.
- SIGMOD-2013-DeshpandeLTDSRHD #knowledge base #maintenance #using
- Building, maintaining, and using knowledge bases: a report from the trenches (OD, DSL, MT, SD, SS, AR, VH, AD), pp. 1209–1220.
- VLDB-2013-GattaniLGTCDSRHD #approach #classification #social #social media
- Entity Extraction, Linking, Classification, and Tagging for Social Media: A Wikipedia-Based Approach (AG, DSL, NG, MT, XC, SD, SS, AR, VH, AD), pp. 1126–1137.
- VLDB-2012-LamLPRVD #named #performance
- Muppet: MapReduce-Style Processing of Fast Data (WL, LL, SP, AR, ZV, AD), pp. 1814–1825.
- VLDB-2011-DoanFKK #crowdsourcing #data transformation #perspective #platform
- Crowdsourcing Applications and Platforms: A Data Management Perspective (AD, MJF, DK, TK), pp. 1508–1509.
- VLDB-2011-NiuRDS #logic #markov #named #network #scalability #statistics #using
- Tuffy: Scaling up Statistical Inference in Markov Logic Networks using an RDBMS (FN, CR, AD, JWS), pp. 373–384.
- SIGMOD-2010-Amer-YahiaDKKF #algorithm #big data
- Crowds, clouds, and algorithms: exploring the human side of “big data” applications (SAY, AD, JMK, NK, MJF), pp. 1259–1260.
- VLDB-2010-BaidRLDN #keyword #relational #scalability #towards
- Toward Scalable Keyword Search over Relational Data (AB, IR, JL, AD, JFN), pp. 140–149.
- SIGMOD-2009-ChaiVDN #feedback #information management #integration #source code
- Efficiently incorporating user feedback into information extraction and integration programs (XC, BQV, AD, JFN), pp. 87–100.
- SIGMOD-2009-ChenGDYR #evolution #optimisation #source code
- Optimizing complex extraction programs over evolving text data (FC, BJG, AD, JY, RR), pp. 321–334.
- SIGMOD-2009-ChuBCDN #ad hoc #database #keyword #query
- Combining keyword search and forms for ad hoc querying of databases (EC, AB, XC, AD, JFN), pp. 349–360.
- SIGMOD-2008-ShenDMDR #information management #towards
- Toward best-effort information extraction (WS, PD, RM, AD, RR), pp. 1031–1042.
- VLDB-2008-ChaiSDRS #integration
- Analyzing and revising data integration schemas to improve their matchability (XC, MS, AD, AR, LS), pp. 773–784.
- VLDB-2008-HuangCDN #on the #query
- On the provenance of non-answers to queries over extracted data (JH, TC, AD, JFN), pp. 736–747.
- VLDB-2007-BurdickDRV #constraints
- OLAP over Imprecise Data with Domain Constraints (DB, AD, RR, SV), pp. 39–50.
- VLDB-2007-ChuBCDN #approach #incremental #query #relational #semistructured data
- A Relational Approach to Incrementally Extracting and Querying Structure in Unstructured Data (EC, AB, TC, AD, JFN), pp. 1045–1056.
- VLDB-2007-DeRoseSCDR #approach #community #composition #incremental #top-down #web
- Building Structured Web Community Portals: A Top-Down, Compositional, and Incremental Approach (PD, WS, FC, AD, RR), pp. 399–410.
- VLDB-2007-ShenDNR #datalog #declarative #embedded #information management #using
- Declarative Information Extraction Using Datalog with Embedded Extraction Predicates (WS, AD, JFN, RR), pp. 1033–1044.
- SIGMOD-2006-DoanRV #information management #research #state of the art
- Managing information extraction: state of the art and research directions (AD, RR, SV), pp. 799–800.
- VLDB-2005-McCannALNVD #integration #maintenance
- Mapping Maintenance for Data Integration Systems (RM, BKA, QL, HN, LV, AD), pp. 1018–1030.
- VLDB-2005-SayyadianLDR #using
- Tuning Schema Matching Software using Synthetic Scenarios (MS, YL, AD, AR), pp. 994–1005.
- SIGMOD-2004-LeeDDHD #database #named
- iMAP: Discovering Complex Mappings between Database Schemas (RD, YL, AD, AYH, PMD), pp. 383–394.
- SIGMOD-2004-WuYDM #approach #clustering #interactive #interface #query #web
- An Interactive Clustering-based Approach to Integrating Source Query interfaces on the Deep Web (WW, CTY, AD, WM), pp. 95–106.