Web2Text: Deep Structured Boilerplate Removal
BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
EDIT!
CC-BY
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter

Thijs Vogels, Octavian-Eugen Ganea, Carsten Eickhoff
Web2Text: Deep Structured Boilerplate Removal
ECIR, 2018.

ECIR 2018
DBLP
Scholar
DOI
Full names Links ISxN
@inproceedings{ECIR-2018-VogelsGE,
	author        = "Thijs Vogels and Octavian-Eugen Ganea and Carsten Eickhoff",
	booktitle     = "{Proceedings of the 40th European Conference on Information Retrieval Research: Advances in Information Retrieval}",
	doi           = "10.1007/978-3-319-76941-7_13",
	isbn          = "['978-3-319-76940-0', '978-3-319-76941-7']",
	pages         = "167--179",
	publisher     = "{Springer}",
	title         = "{Web2Text: Deep Structured Boilerplate Removal}",
	year          = 2018,
}

Tags:



Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.