Assigning documents to master sites in distributed search

Publication
Oct 28, 2011
Abstract

An appealing solution to scale Web search with the growth of the Internet is the use of distributed architectures. Distributed search engines rely on multiple sites deployed in distant regions across the world, where each site is special- ized to serve queries issued by the users of its region. This paper investigates the problem of assigning each document to a master site. We show that by leveraging similarities between a document and the activity of the users, we can accurately detect which site is the most relevant to place a document. We conduct various experiments using two docu- ment assignment approaches, showing performance improvements of up to 20.8% over a baseline technique which assigns the documents to search sites based on their language.

  • Proceedings of the 20th ACM Conference on Information and Knowledge Management, CIKM 2011
  • Conference/Workshop Paper

BibTeX