Amit Agarwal, Hema Swetha Koppula, Krishna P. Leela, Krishna Prasad Chitrapura, Sachin Garg, Pavan Kumar GM, Chittaranjan Haty, Anirban Roy, Amit Sasturkar
URL normalization for de-duplication of web pages
CIKM, 2009.
@inproceedings{CIKM-2009-AgarwalKLCGGHRS, author = "Amit Agarwal and Hema Swetha Koppula and Krishna P. Leela and Krishna Prasad Chitrapura and Sachin Garg and Pavan Kumar GM and Chittaranjan Haty and Anirban Roy and Amit Sasturkar", booktitle = "{Proceedings of the 18th ACM International Conference on Information and Knowledge Management}", doi = "10.1145/1645953.1646283", isbn = "978-1-60558-512-3", pages = "1987--1990", publisher = "{ACM}", title = "{URL normalization for de-duplication of web pages}", year = 2009, }