BibSLEIGH
BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
EDIT!
CC-BY
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter
Travelled to:
1 × Australia
1 × Austria
1 × Canada
1 × France
1 × Norway
5 × USA
Collaborated with:
J.F.Naughton R.Ramakrishnan X.Chai Y.Chiang W.Shen A.Baid S.Vaithyanathan N.Rampalli M.J.Franklin M.Sayyadian A.Rosenthal T.Chen Y.Lee F.Chen E.Chu P.DeRose J.W.Shavlik A.Rajaraman S.Das R.McCann C.Sun F.Yang D.Kossmann T.Kraska F.Niu C.Ré B.Vuong J.Huang D.Burdick W.Wu C.T.Yu W.Meng D.S.Lamba S.Subramaniam V.Harinarayan S.Amer-Yahia J.M.Kleinberg N.Koudas I.Rae J.Li B.J.Gao J.Yang L.Seligman R.Dhamankar A.Y.Halevy P.M.Domingos W.Lam L.Liu S.Prasad Z.Vacheri B.K.AlShebli Q.Le H.Nguyen L.Vu C.Gokhale X.Zhu O.Deshpande M.Tourn A.Gattani N.Garera M.Tiwari P.S.G.C. K.G.K. H.Zhang S.Prasad E.Arcaute G.Krishnan R.Deep V.Raghavendra
Talks about:
data (11) extract (9) use (5) approach (4) integr (4) inform (4) entiti (4) queri (4) match (4) crowdsourc (3)

Person: AnHai Doan

DBLP DBLP: Doan:AnHai

Facilitated 1 volumes:

SIGMOD 2003Ed

Contributed to:

SIGMOD 20152015
SIGMOD 20142014
VLDB 20142014
SIGMOD 20132013
VLDB 20132013
VLDB 20122012
VLDB 20112011
SIGMOD 20102010
VLDB 20102010
SIGMOD 20092009
SIGMOD 20082008
VLDB 20082008
VLDB 20072007
SIGMOD 20062006
VLDB 20052005
SIGMOD 20042004

Wrote 27 papers:

SIGMOD-2015-CSKZYRPAKDRD #big data #industrial #what #why
Why Big Data Industrial Systems Need Rules and What We Can Do About It (PSGC, CS, KGK, HZ, FY, NR, SP, EA, GK, RD, VR, AD), pp. 265–276.
SIGMOD-2014-ChiangDN #evolution #modelling
Modeling entity evolution for temporal record matching (YHC, AD, JFN), pp. 1175–1186.
SIGMOD-2014-GokhaleDDNRSZ #crowdsourcing #named
Corleone: hands-off crowdsourcing for entity matching (CG, SD, AD, JFN, NR, JWS, XZ), pp. 601–612.
VLDB-2014-ChiangDN #algorithm #performance
Tracking Entities in the Dynamic World: A Fast Algorithm for Matching Temporal Records (YHC, AD, JFN), pp. 469–480.
VLDB-2014-SunRYD #classification #crowdsourcing #machine learning #named #scalability #using
Chimera: Large-Scale Classification using Machine Learning, Rules, and Crowdsourcing (CS, NR, FY, AD), pp. 1529–1540.
SIGMOD-2013-DeshpandeLTDSRHD #knowledge base #maintenance #using
Building, maintaining, and using knowledge bases: a report from the trenches (OD, DSL, MT, SD, SS, AR, VH, AD), pp. 1209–1220.
VLDB-2013-GattaniLGTCDSRHD #approach #classification #social #social media
Entity Extraction, Linking, Classification, and Tagging for Social Media: A Wikipedia-Based Approach (AG, DSL, NG, MT, XC, SD, SS, AR, VH, AD), pp. 1126–1137.
VLDB-2012-LamLPRVD #named #performance
Muppet: MapReduce-Style Processing of Fast Data (WL, LL, SP, AR, ZV, AD), pp. 1814–1825.
VLDB-2011-DoanFKK #crowdsourcing #data transformation #perspective #platform
Crowdsourcing Applications and Platforms: A Data Management Perspective (AD, MJF, DK, TK), pp. 1508–1509.
VLDB-2011-NiuRDS #logic #markov #named #network #scalability #statistics #using
Tuffy: Scaling up Statistical Inference in Markov Logic Networks using an RDBMS (FN, CR, AD, JWS), pp. 373–384.
SIGMOD-2010-Amer-YahiaDKKF #algorithm #big data
Crowds, clouds, and algorithms: exploring the human side of “big data” applications (SAY, AD, JMK, NK, MJF), pp. 1259–1260.
VLDB-2010-BaidRLDN #keyword #relational #scalability #towards
Toward Scalable Keyword Search over Relational Data (AB, IR, JL, AD, JFN), pp. 140–149.
SIGMOD-2009-ChaiVDN #feedback #information management #integration #source code
Efficiently incorporating user feedback into information extraction and integration programs (XC, BQV, AD, JFN), pp. 87–100.
SIGMOD-2009-ChenGDYR #evolution #optimisation #source code
Optimizing complex extraction programs over evolving text data (FC, BJG, AD, JY, RR), pp. 321–334.
SIGMOD-2009-ChuBCDN #ad hoc #database #keyword #query
Combining keyword search and forms for ad hoc querying of databases (EC, AB, XC, AD, JFN), pp. 349–360.
SIGMOD-2008-ShenDMDR #information management #towards
Toward best-effort information extraction (WS, PD, RM, AD, RR), pp. 1031–1042.
VLDB-2008-ChaiSDRS #integration
Analyzing and revising data integration schemas to improve their matchability (XC, MS, AD, AR, LS), pp. 773–784.
VLDB-2008-HuangCDN #on the #query
On the provenance of non-answers to queries over extracted data (JH, TC, AD, JFN), pp. 736–747.
VLDB-2007-BurdickDRV #constraints
OLAP over Imprecise Data with Domain Constraints (DB, AD, RR, SV), pp. 39–50.
VLDB-2007-ChuBCDN #approach #incremental #query #relational #semistructured data
A Relational Approach to Incrementally Extracting and Querying Structure in Unstructured Data (EC, AB, TC, AD, JFN), pp. 1045–1056.
VLDB-2007-DeRoseSCDR #approach #community #composition #incremental #top-down #web
Building Structured Web Community Portals: A Top-Down, Compositional, and Incremental Approach (PD, WS, FC, AD, RR), pp. 399–410.
VLDB-2007-ShenDNR #datalog #declarative #embedded #information management #using
Declarative Information Extraction Using Datalog with Embedded Extraction Predicates (WS, AD, JFN, RR), pp. 1033–1044.
SIGMOD-2006-DoanRV #information management #research #state of the art
Managing information extraction: state of the art and research directions (AD, RR, SV), pp. 799–800.
VLDB-2005-McCannALNVD #integration #maintenance
Mapping Maintenance for Data Integration Systems (RM, BKA, QL, HN, LV, AD), pp. 1018–1030.
VLDB-2005-SayyadianLDR #using
Tuning Schema Matching Software using Synthetic Scenarios (MS, YL, AD, AR), pp. 994–1005.
SIGMOD-2004-LeeDDHD #database #named
iMAP: Discovering Complex Mappings between Database Schemas (RD, YL, AD, AYH, PMD), pp. 383–394.
SIGMOD-2004-WuYDM #approach #clustering #interactive #interface #query #web
An Interactive Clustering-based Approach to Integrating Source Query interfaces on the Deep Web (WW, CTY, AD, WM), pp. 95–106.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.