A High Level Architecture of Google Search will be discussed in this post.
Google Searches work with the help of Web Crawlers.
Web Crawling is the process of downloading the web pages.
Downloading of web pages (web crawling) is not done by single crawler. It is done by various distributed crawlers.
It starts with URL Server that is there to send the list of all URLs that are needed to be fetched.
Fetched web pages are forwarded to Storeserver.
There is a repository to store the compressed web pages.
Note that a every webpage fetched has a url and a unique ID which is assigned to it.
It is known as docID.
Indexer box shown in the image does the process of indexing and sorting.
Indexer has some set of roles – Reading Repository, Uncompressing documents and parsing them.
All the documents are later converted into hits – Set of Word Occurrences.
Hits are there to fetch details like word and its position, font size capitalization etc.
Indexer distributes the hits into barrels. Here a partially sorted forward index is formed.
Indexer performs an important function of parsing every link in the web page and identifying the information hidden in anchor tags.
Indexer file has information to understand about the referring pages of the links and the text of links.
We can see that information from Anchors box is being forwarded to URL Resolver.
Url Resolver is a reader of anchor files and is a converter of Relative URL into Absolute URLs.
This gives the unique docID. Anchor text is not put into the forward index that is associated with the docID to which anchor is pointing.
A database of links (pair of docIDs) is generated, this Database of link is used to determine the PageRanks of documents.
Barrels are taken by Sorter. Barrels are sorted by docID. In order to generate the word index the Sorter resorts the barrels by wordID.
There is a program called DumpLexicon. List of wordIDs and the lexicon (a kind of vocabulary) that are produced by indexer are taken together by the DumpLexicon and a new lexicon to be used by the Searcher is produced.
A web server is used to run Searcher. The lexicon built by DumpLexicon, the inverted index, and the PageRank of documents are used together to answer the search queries by users.
This is how Architecture of Google Search works.
Reference of Article –
We have learn about basics of Digital Marketing Technology so far. If you are viewing this Lesson I suggest you to have a look at our previous Lessons on Basics of Digital Marketing.
Those who are following the tutorial or want to read about Different Types of Websites and CMS tools can continue reading.
As a Digital Marketing expert you must be aware about the structure of websites and technology used to create a website. Base of every website is HTML (Hyper Text Markup Language). Our Web Browsers like Chrome, Mozilla, and Opera Mini etc. display only HTML. If you will look at the Page Source of any webpage (Press ctrl + u on your keyboard while viewing a webpage) you will find the basic HTML code.
You must be aware of basic HTML structure. If you are not familiar with basics of HTML, don’t worry. We will deal with that in the later Chapters of this tutorial.
Different Types of Website
Based on the basic functionality of a website it can be categorized as:
This type of website is also called a Content Management System. There are many CMS Tools available on internet. WordPress, Joomla, Drupal etc. are famous CMS tools which can be used to change the content of any website, add pages, add posts, add media etc. Above mentioned coding languages and frameworks can also be used to create such dynamic websites. These websites have database. MySql, MS-Sql server, MongoDB, Sqlite etc. are some commonly used databases in dynamic applications. Database are needed to store the content and retrieve data to display in the pages.
Social Networking and Community Websites are one of the most common websites being used now a days. A famous example is Facebook. Facebook is used by billions of people. Some other famous names are Twitter, Linked in etc. These websites provide a platform where people sharing common interests can connect with each other. Website of a college is also a social networking website where new and old students can interact with each other. Users can create groups, pages, forums etc. and people interested in the content of that group can join and participate in group posts. Matrimonial websites are also social networking sites where one can find a perfect match. You can use various filters to find people and send messages and friend requests to connect with each other. These website are one of the biggest source of Digital Marketing.
4. E-Commerce Websites
E-Commerce stands for Electronic Commerce or Online Commerce. Commerce refer to the commercial activities that are performed online. Majority of startups are based on Online Selling of Products and Services. People Book Rail Tickets, Flight Tickets and Hotels Online. Shopping of Groceries, Electronics Item, Apparels and almost everything online is very common now a days. A Digital Marketing expert must be aware about the front-end (client / user) and back-end(administrator) operations of these websites. You must be familiar with the Basics of operations of Travel Industry, Trading, Stocks, Inventory, Categories of Good etc.
Portfolio websites are created by / for Artists, Music Bands, Authors, Photographers or any professional service provider. A portfolio website contains introductory information of the person / enterprise, professional experience and mainly detail of the works performed by them with Images, Videos and Trials etc. Before planning a Digital Marketing strategy for portfolio websites an in-depth keyword research, competitor analysis and understanding the basics of services offered is very important.
Classifieds website are those website which allow users to post classifieds advertisement on their websites. Classifieds can be also called Categorized in this case. There are various categories and sub categories in this kind of website. Users post Buy, Sell, Rent or Requirement related advertisement on online classifieds website. These ads can be searched using specific keywords and category filters. Information like Business / Product Name, Email, Phone, Business Address etc. are usually taken while posting ads. An example of Classifieds Website developed by us is www.Trademyad.com .