David S. Rosenblum

ROAR: Increasing the Flexibility and Performance of Distributed Search

Costin Raiciu, Felipe Huici, Mark Handley, and David S. Rosenblum
Appears in: 
CCR October 2009

To search the web quickly, search engines partition the web index over many machines, and consult every partition when answering a query. To increase throughput, replicas are added for each of these machines. The key parameter of these algorithms is the trade-off between replication and partitioning: increasing the partitioning level improves query completion time since more servers handle the query, but may incur non-negligible startup costs for each subquery. Finding the right operating point and adapting to it can significantly improve performance and reduce costs.

Syndicate content