Contacts

Functions Search Engines. What is a search engine? The concept and function of the search engine

From five separate software components consist of search engines, namely:

  • Spider (spider): His task is to download Web Pages; A program that is similar to the Web browser.
  • Crawler.: Spider, which is called "traveling"; It automatically goes to all links that were found on the page.
  • Indexer (indexer): A program called "blind"; Its task to analyze Web pages that were downloaded by spiders.
  • Database (database): It is a repository of pages that were first downloaded, and then treated.
  • System of issuing results (sEARCH ENGINE RESULTS ENGINE): this system Helps retrieve search results from the database.

Read more about each of the search engine component

Spider: Spider - His task is simple - download Web pages. The principle of his work is not different from your browser, if you simply connect with the site and start uploading a page. The visualization of the spider is absent. A similar situation (downloading) can be seen when you start viewing some page and choose in your Web browser "View HTML code".

Crawler.: Like a spider, he also downloads pages, also in its functions there are "stripping" of pages and finding all links. This is his task - to determine where the spider must move on, it is based only on links or using a predetermined address list.

Indexer: Indexer helps to disassemble the page to different parts of it and analyze them. The headers, elements of any page titles, text, links, elements of Bold, Italic, structural elements, and other style pieces of the page are selected and analyzed.

Database: Database is a repository of any data that the search engine is going to download and analyze. In most cases, this requires huge resources.

System of issuing results: SEARCH ENGINE RESULTS Engine is the heart of the search engine. It is this system that will decide which pages will satisfy each request for a regular user. With this part of the search engine and the search is carried out.

If the user has entered the keyword and started the search, the search engine begins to select results, based on constantly changing criteria. The method according to which the search engine takes any solutions is called an algorithm. "Algos" - this term sometimes use professional - this is what we are talking about.

Search criteria in the formation of issuing search engines

Even due to the fact that the search engines have changed very much, most of them are in our time selects the search results, based on these criteria:

  • Title (Title): Is there a keyword in the title?
  • Domain / address (Domain / URL): Is there a keyword in the domain address or in the domain name?
  • Style (Style): Head headers, Rounding (I or EM), fat (B or Strong): Is there a place on the page where the keyword is used in the mean, oily, or HX (H1, H2, ...) text headers?
  • Density (DENSITY): How often is the keyword is used on the page? The keyword density is the number of keywords regarding the page text.
  • Meta Data (MetainFormation): Although many are denied, but some search engines today are still read by META description and meta description keywords (Meta Keywords).
  • Links outside (Outbound Links): Where are the links on the page, and is there a keyword link in the text?
  • External links (InBound Links): Who else on the Internet there is a link to this site? What is the link text? The author of the page is not in each case can control this criterion, therefore it is called "non-disabilities".
  • Links inside page (Insite Links): Does the page contain a link to some other pages of this site?

As a result, we see that the search engine must be able to make many clarifying requests using the entire page of the entire page.

This article is only a reduced description of the functioning. search engines.

The most popular web service of modernity is the search engine. Everything is explained here, because those times when representatives of the first Internet users could observe new items in the network long ago left.

The information appears and accumulates so much that the person has become very difficult to find exactly the one that he would be needed. Imagine, as if a search on the Internet, if an ordinary user would have to look for information not to understand where. It is not that we do not understand where, because you can't find a lot of information for a manual search.

Search engine, what is it?

Well, if the user is already known to know sites on which it is possible to have the necessary information, but what to do otherwise? In order to facilitate the life of a person in the search necessary information In the Internet and search engines or just search engines were invented. The search engine performs one very important function, without which the Internet would not like it as we used to see - this is a search for information on the network.

Search system - This is a special web node or a different site that provides users with a hyperlink to pages, sites that meet the specified search query.

To be slightly more accurate, then search for information on the Internet, which is carried out by software and hardware functional setting and web interface for interacting with users.

To interact a person with a search engine and a web interface was created, that is, the visible and understandable shell. This approach of development developers facilitates the search for many people. As a rule, it is on the Internet that a search is carried out using search engines, but also there are search systems for FTP servers, individual types of goods in the World Wide Web, or news information or other search directions.

The search can be carried out not only by text filling sites, but also by other types of information that a person can search: images, video, sound files etc.

How is the search for the search engine?

The search itself is on the Internet, exactly the same as viewing web sites is possible with the Internet browser Internet browser. Only after the user asked his query in the search bar, the search itself is directly.

Any search engine contains a software part on which the entire search engine is based, it is called the search engine - this is a software package and providing the ability to search for information. After turning to the search engine, the formation of a person's search query and enter it into the search string, the search engine generates a page with a list of search results, the most relevant, according to the search engine here are located above.

Search relevance - search for the most responding service to the user's materials and the location of the hyperlink on them on the issuance page with more accurate results above the others. The distribution itself is called the ranking of sites.

So how does the search engine prepare for issuing your materials and how does the search engine name be found? The collection of information in the network contributes unique for each search system a robot or a different bot, which also has a number of other synonyms as a crawler or spider, and the search system itself can be divided into three stages:

To the first step of the search engine work, you can attribute site scanning in global Network And collecting on your own servers copies of web pages. It forms great amount Not yet processed and not suitable information for search results.

The second stage of the search engine is reduced to bringing into order of the previously obtained, at the first stage of information from sites. This sorting is produced, which for the smallest time will favorably favors the highest quality search, which users are actually waiting for the search engine. The stage is called indexing, it means that pages are already prepared for extradition, and the current base will be considered an index.

Just the third stage and causes search results after receiving a request from its client, based on key or about keywords specified in the request. This contributes to the selection of the most relevant request for information and subsequent issuance. Since information, very, very many, the search engine performs ranking in line with its algorithms.
The best search engine is the one that can provide the most correctly responding material to the user's request. But here they can meet the results that were influenced by people interested in promoting their site, such sites are not always, but often appear in the search results, but not for a long time.

Although world leaders in many regions are defined, search engines continue to develop their high-quality, search. The better the search they will be able to provide, the more people will use it.

How to use the search engine?

What is a search engine and how it works already understandable, but how to use it right? Most sites are always present a search string, and next to it is the Find button or search. A request is entered into the search string, after which you need to press the search button or how it happens more often, press the Enter key on the keyboard and in a matter of seconds you get the result of the query as a list.

But to get the right answer to the search request, it is not always possible to get the first time. In order for the search for the desired did not become painful, it is necessary to properly compose search query and follow the recommendations below.

Make a search query correctly

Next will indicate tips on how to use the search engine. Following some tricks and rules when searching for information in the search engine will give the opportunity to get the desired result much faster. Follow these recommendations:

  1. Competent writing words provides maximum amount Coincidences with the desired information facility (at least modern search engines have already learned to correct spelling errors, but it is not worth neglected by this advice).
  2. Through the use of synonyms in the query, you can reach a wider search range.
  3. Sometimes changing the word in the query text can bring a greater result. Request a request.
  4. Promote species to the request, use the exact entry of phrases to determine the main essence of the search.
  5. Experiment with keywords. The use of keywords and phrases can help identify the main essence, and the search engine will give a more relevant result.

So such a search engine is nothing but the opportunity to find the information of interest and is usually completely free to use it, to learn something, to understand something or make the right conclusion for yourself. Many no longer represent their lives without voice searchWith which the text does not have to gain, you only need to give your request, and the microphone input device is here. All this indicates a constant development of search technologies on the Internet and the need for them.

The search for the search pointer occurs in three stages, of which the two are preparatory and invisible for the user. First, the search pointer collects information from the World Wide Web. For this use special programssimilar browsers. They are able to copy the specified Web page to the search pointer server, watch it, find all the hyperlinks that are on it the resources found there, to again find the hyperlinks available in them. Such programs are called worms, spiders, caterpillars, cragolers, spiders and other similar names. Each search pointer operates its own unique programwhich is often developing and develops. It is conveniently, with a good input, the spider is able to play all the Web space for one dive, but it takes a lot of time, and it is still necessary to periodically return to previously visited resources to control the changes occurring there and identify " Dead "references, i.e. lost the relevance.

After copying the wrank Web resources to the search engine server, the second stage of work begins - indexation. In the course of indexing, special databases are created, with which you can install, where and when it was found on the Internet, a particular word. Consider the indexed database is a kind of dictionary. It is necessary to ensure that the search engine can very quickly respond to user requests. Modern systems Can give answers for a split second, but if you do not prepare indexes in advance, then the processing of one request will continue for hours.

At the third stage, the client's request is handling and issuing a search results in the form of a list of hyperlinks. Suppose the client wants to know where there are Web pages on the Internet, on which the famous Dutch mechanic, optics and mathematician Christians Guigens are mentioned. It enters the word Guigens in the keyword set field and clicks the "Find" button. According to its pointer bases, the search engine in the fraction of a second is looking for suitable Web resources and forms the search results page on which the recommendations are presented in the form of hyperlinks. Next, the client can use these links to transition to its resources.

All this looks simple enough, but in fact there are problems here. The main problem of the modern Internet is related to the abundance of Web pages. It is enough to enter in the search field such a simple word, as, for example, football, and the Russian search engine will give a few thousand links, grouped them by 10-20 pieces on the displayed page.

However, for an ordinary consumer, absolutely anyway, they will give him a thousand search results or a million. As a rule, customers look at no more than 50 references standing first, and what is going on, few people are bothering. However, customers are very and very worried about the quality of the very first heads. Customers do not like when there are references in the first top ten, they have lost the relevance, they are annoyed when the links are going to the neighboring files of the same server. The very bad option - when in a row are several links leading to the same resource, but located on different servers.

The client has the right to expect the most useful links to be the first to stand. Here and the problem arises. A person is easily distinguished by a useful resource from useless, but how to explain this program? Therefore, the best search engines show miracles artificial Intelligence In an attempt to sort the found links for the quality of their resources. And they should do it quickly - the client does not like to wait.

All search engines draw initial information from the same web space, so the initial databases they may be relatively similar. And only in the third stage, when issuing search results, each search engine begins to show its best (or worst) individual traits. Operation of sorting the results obtained is called ranking. Each Web page found system assigns some rating that should reflect the quality of the material. But the quality is a notion of a subjective, and the program needs objective criteria that can be expressed by numbers suitable for comparison.

High ratings are received by Web pages that have a keyword used in, query, enters the title. The rating level rises if this word is found on the Web page several times, but not too often. Posseably affect the ranking the entry of the desired word in the first 5-6 paragraphs of the text - they are considered the most important when indexing. For this reason, experienced Web Masters avoid giving the table at the beginning of their pages. For the search engine, each cell of the table looks like a paragraph, and therefore the meaningful basic text seems to be far back (although it is not noticeable on the screen) and ceases to play a decisive role for the search engine.

Very good if the keywords used in the query are included in the alternative text accompanying illustrations. For the search engine, this is a sure sign that this page Specifying the query. Another sign of the quality of the Web page is the fact that it has links to some other Web pages. What they are more, the better. It means that this Web page is popular and has a high quoting indicator. The perfect search engines are followed by the level of citation registered by them Web pages and take into account it when ranking.

information search engine

The Internet is required to many users in order to receive answers to requests (questions) that they are injected.

If there were no search engines, users would have to independently search for the necessary sites, remember them, write. In many cases, it would be very difficult to find "manually", and often it is simply impossible.

For us all this routine work on the search, storage and sorting of information on sites are made by search engines.

Let's start with the well-known search engines of the Runet.

Search engines on the Internet in Russian

1) Let's start with the domestic search engine. Yandex works not only in Russia, but also works in Belarus and Kazakhstan, in Ukraine, in Turkey. There is also Yandex in English.

2) Google's search engine came to us from America, has a Russian-speaking localization:

3) Domestic search engine Male Ru, which at the same time represents social network VKontakte, Odnoklassniki, also my world, famous answers Mail.ru and other projects.

4) Intellectual Search Engine

Nigma (Nigma) http://www.nigma.ru/

From September 19, 2017, the "intellectual" Nigma does not work. She stopped presenting financial interest to her creators, they switched to another search engine called Coccoc.

5) The well-known company Rostelecom has created a satellite exploratory system.

There is a search engine satellite, designed specifically for children, about which I wrote.

6) Rambler was one of the first domestic search engines:

There are other famous search engines in the world:

  • Bing,
  • Yahoo !,
  • Baidu,
  • Ecosia,

Let's try to figure out how the search engine works, namely, how the site indexes occurs, the analysis of the results of indexing and the formation of search results. The principles of search engines are approximately the same: search for information on the Internet, its storage and sorting for issuing in response to user requests. But algorithms for which search engines work can be very different. These algorithms are kept secret and is prohibited by its disclosure.

Entering the same query in search lines of different search engines, you can get different answers. The reason is that all search engines use their own algorithms.

Purpose of search engines

First of all, you need to know that search engines are commercial organizations. Their goal is to receive profits. Profit can be obtained from contextual advertising, other types of advertising, with the promotion of the right sites on the top lines of extradition. In general, there are many ways.

It depends on what the size of the audience is, that is, how many people use this search engine. The more the audience, the greater the number of people will be shown advertising. Accordingly, this advertising will cost. Increase the audience of the search engines can at the expense of their own advertising, as well as attracting users by improving the quality of their services, algorithm and search convenience.

The most important and complex here is the development of a full-fledged functioning search algorithm that would provide relevant results to most user requests.

Work of the search engine and the actions of webmasters

Each search engine has its own algorithm, which should take into account a huge number of different factors When analyzing information and preparation of issuing in response to a user request:

  • age of a particular site
  • site domain characteristics,
  • quality content on the site and its types
  • features of the navigation and structure of the site,
  • usability (convenience for users),
  • behavioral factors (the search engine can determine if the user found what he was looking for on the site or the user came back to the search engine and there again looking for an answer to the same request)
  • etc.

All this needs to ensure that the issuance of the user's request has been the most relevant satisfying user requests. In this case, the search engine algorithms are constantly changing, refined. As they say, there is no limit to perfection.

On the other hand, webmasters and optimizers constantly invent new ways to promote their sites that are far from always honest. The task of developers of the search engine algorithm is to make changes to it that would not allow the "bad" sites of dishonest optimizers to be provided in the top.

How does the search engine work?

Now about how the direct work of the search engine occurs. It consists of at least three stages:

  • scanning,
  • indexing,
  • ranging.

The number of sites on the Internet reaches just an astronomical value. And each site is information, information content that is created for readers (living people).

Scanning

This is a wandering of the search engine on the Internet to collect new information, to analyze links and search for new content, which can be used to issue a user in response to its requests. For scanning from search engines there are special robots, which are called search robots or spiders.

Search robots are programs that automatic mode We visit sites and collect information from them. Scanning can be primary (the robot comes to a new site for the first time). After the initial collection of information from the site and bring it to the search engine database, the robot begins with a certain regularity to enter its pages. If some changes have occurred (new content has been added, the old one has retired), then all these changes will be fixed by the search engine.

The main task of the search spider is to find new information and give it a search engine to the next processing stage, that is, on indexing.

Indexing

The search engine can search for information only among those sites that are already listed in its database (indexed by it). If scanning is the process of finding and collecting information, which is available on a particular site, the indexation is the process of entering this information to the search engine database. At this stage, the search engine automatically decides whether to make one or another information to its database and where to make it, in which database section. For example, Google index almost all the information found by his robots on the Internet, and Yandex is more picky and indexes not all.

For new sites, the index stage may be long, so visitors from search engines new sites can wait long. BUT new informationwhich appears on old, promoted sites, can be indexed almost instantly and almost immediately fall into the "index", that is, in the search database.

Ranging

Ranking is building information that was previously indexed and entered into a database of a particular search engine, in rank, that is, what information search engine will show your users first and foremost, and what information to put "rank" below. Ranking can be attributed to the service stage of their client's search engine.

On the search engine servers, the information received and the formation of issuing various requests on the huge spectrum. The search engine algorithms are already engaging here. All sites recorded in the database are classified on topics, the subjects are divided into groups of requests. For each of the query groups, a preliminary issuance may be compiled, which will later be adjusted.



Did you like the article? Share it