摘要:This study begins with a review on the research status of search engine, followed by discussion on goals of search engine and then the principle of distributed computing is explained. Consequently the MapReduce distributed computing model and the Hadoop Distributed File System (HDFS) are analyzed in detail. Finally the distributed search engine architecture is presented. On the basis of the architecture, future challenges and opportunities of the distributed search engine are highlighted.