我爱Aspx >> 数据库 >> Google的技术剖析:创始人Sergey Brin 和 Lawrence Page的研究论文Apart from the problems of scaling traditional search techniques to data of this magnitude, there are new technical challenges involved with using the additional information present in hypertext to produce better search results. This paper addresses this question of how to build a practical large-scale system which can exploit the additional information present in hypertext. Also we look at the problem of how to effectively deal with uncontrolled hypertext collections where anyone can publish anything they want.
Keywords: World Wide Web, Search Engines, Information Retrieval, PageRank, Google
1. Introduction(Note: There are two versions of this paper -- a longer full version and a shorter printed version. The full version is available on the web and the conference CD-ROM.)
The web creates new challenges for information retrieval. The amount of information on the web is growing rapidly, as well as the number of new users inexperienced in the art of web research. People are likely to surf the web using its link graph, often starting with high quality human maintained indices such as Yahoo! or with search engines. Human maintained lists cover popular topics effectively but are subjective, expensive to build and maintain, slow to improve, and cannot cover all esoteric topics. Automated search engines that rely on keyword matching usually return too many low quality matches. To make matters worse, some advertisers attempt to gain people@#s attention by taking measures meant to mislead automated search engines. We have built a large-scale search engine which addresses many of the problems of existing systems. It makes especially heavy use of the additional structure present in hypertext to provide much higher quality search results. We chose our system name, Google, because it is a common spelling of googol, or 10100 and fits well with our goal of building very large-scale search engines.
【我对这篇文章有话说?】
微软公开在华招聘"Google杀..[05-21]
Google周三市值超越eBay 成为最大..[05-21]
传百度将推硬盘搜索 抗衡Google桌..[05-21]
Google雅虎eBay:均有意并购Trad..[05-21]
Google:线下广告拍卖惨淡收场 8..[05-21]
Google:参股AOL落定 双方签署最..[05-21]
Google:收购AOL 5%股权 拟再融资..[05-21]
Google:网络服务将对微软造成致..[05-21]
Google:拟收购新搜索算法 微软雅..[05-21]
Google中国:诸多难题待解 牌照门..[05-21]
实用的存储过程之二[05-22]
一道褒贬不一的 SQL 考试题[05-22]
PFC的使用与探索(一)[05-22]
如何将全文检索中的“干扰词”去..[05-22]
深入研究SQL结构化查询语言中的L..[05-22]
CREATE TABLE – SQL Command[05-22]
Fetching Across Commits: Where..[05-22]
动态的连接到数据库的方法[05-22]
锁定某一列实现同Excel中的样式[05-22]
使用SQL-DMO备份数据库并进行校验[05-22]