Monday, October 6, 2008

Which database does google use?

Someone claims that google use oracle, someone claims that google use MySQL database in order to maintain their search engine indexing. But as far as I know neither is true. Google crawler crawls millions pages over the internet and make Google Spreadsheets and thus form a Database.

So their database is just a flat file which is visible and readable by everyone.

You can think it as a normal excel file with some fields and under the fields the entry of different websites and their content are put. If crawler detects any changes in the page then the entry under the webpage is modified.

Google uses it's own API in order to find out any matches for searched words from the spreadsheets.

Thus for search engine indexing google uses spreadsheets.

But google's other application like google mail, blogspot, google doc etc uses a database. They use MySQL and or oracle for other applications. Of course they modified the version of database and use that one.

2 comments:

  1. Google uses their own Db engine they call BigTable. They have published a whitepaper at http://labs.google.com/papers/bigtable-osdi06.pdf.

    ReplyDelete
  2. of course that they use BigTable and I don't know how someone can even think of storing petabytes of data to some so called spreadsheets! use imagination first of all you'll be unable to store that amount of data and second is for indexing depending of their machines will be needed several hundreds years !!! and when billion users type some query you should lock file for each one and what we got... ole For other services they use same BIGtable.

    ReplyDelete