|
Lots of LinuxGoogle knows Linux will crunch it's data
4,000 Linux Servers Used as Backbone of Google Search Engineby Dave MurphyISSN 1535-3613
The choice to use Linux rather than Windows NT/2000 will save Google over $6 million this year in software cost alone. Overall, I estimate the savings will be more than double that, because Linux is cheaper to buy, more quickly installed, and requires less physical periodic system maintenance. "The hypertext analysis is computationally expensive," said Sergey Brin, founder and president of Google.com. "We need to have an efficient system for doing that. That's why we use a lot of cheap PCs. It's a cheaper platform. The dollar per MIPS is better for PCs." The Linux systems will be used to rank the importance of submitted webpages by counting how many referential links to that page exist and the importance of the referential pages. The system will also conduct a hypertext analysis to determine where keywords are located on submitted pages. This work is computationally intensive, with 500 million variables and 2 million terms in a search equation to index the web, performed about every month, resulting in about 1 TB of data to index 300 million webpages. One terabyte (TB) is the equivalent of 1,024 gigabytes (2^40 bytes). Google has in-house talent to maintain the Linux servers, and it values the ability to look at the source code of the operating system and applications to correct problems as they appear. Linux allows the Google staff to be less reliant on external vendors.
Call for CommentsWhat do you think? Leave your comments on the message center.
ReferencesGoogleRed Hat Message Center
Damar Group, Ltd. helps business use technology. ITINFO is again accepting sponsors. Sponsor messages are included in ITINFO's email newsletter and are permanently posted to DGL's website and online reference areas. ITINFO is an electronic publication of Damar Group, Ltd., publisher of Training Express computer learning guides. Comments and submissions to info@dgl.com. Previous issues are on our website at http://dgl.com/itinfo/.
updated May 31, 2000
Return to DGL homepage |