Internet search engines: review of popular and little-known search engines

Total

Introduction

Few people can now imagine the Internet without search, search results, and information search systems (IRS) that organize it all. But until recently, all Internet information fit into several directories, the names of which are still well-known (DMOZ, Yahoo).

Today, the volume of information on the Internet is so huge that it is impossible to fit it into any catalogues. To process, store information, and organize searches, powerful software products have been created and continue to be created, which we call search engines (SE). Each search engine (search engine) has its own databases, its own algorithms for processing, searching, ranking and displaying information.

Internet search engines are

The following academic definition of search engines can be given. A search system is a set of programs and technical means for organizing a user search on the Internet, in which, when responding to a text query, the user receives a list of relevant (corresponding to the request) results.

The issuance is made in the form of a list of links to the source of information with a brief description (preview), sometimes with a photo.

For the first example, let’s remember the world search leader “Google” and the leader of the Runet search engine “Yandex”. In addition to these search engines, you can name a dozen more existing search engines, which we will talk about below.

Opinion: Search engines Google, Yandex and others are not generators (producers) of content, but are aggregators (accumulators) of content and, for the most part, other people’s content. It is worth remembering that using someone else’s content to create your own traffic and monetize it can be characterized as “piracy,” which, of course, does not happen in reality.

Rating

  • and Google share the first two places of leaders: about 49% and 45%.
  • Third place: Search Mail.ru about 3%;
  • Other search engines float below 1%.

I look at the statistics on Google Analytics:

  • yandex/organic 40.26%
  • google/organic 38.93%
  • mail.ru/organic 0.60%
  • rambler/organic 0.52%
  • bing/organic 0.12%

The statistics are inexorable: Yandex searches are used most of all, and if you consider that 3% is a good result compared to 45%, then Mail.ru search can be called the third most popular.

In this regard, discussions about the popularity of search engines other than Yandex and Google can be attributed to superstition, and special promotion of sites in other search engines (not Yandex and Google) does not deserve attention.

How search engines work

The question of how search engines work is as common as the question “what color is the sky.” If the sky is blue, then search engines collect information on the Internet, process it, rank it and send it to the user based on the search query.

The theory of Internet search is much more extensive and cannot be presented in the article. However, the main points will be useful to us:

Internet search engines do not store documents, that is, they do not download and upload documents completely to their repositories;

IRSs use the Internet as a decentralized document repository. Search engines periodically crawl the Internet, select the information they need based on their algorithms, and partially place it (the information) in their database (Database). This leads to several problems:

  • Information retrieval systems do not use all the information on the Internet, but only part of it;
  • Internet information changes frequently. About 1,500 thousand pages are added per day, hence the possible “empty output”;
  • There are a large number of duplicates (duplicate content). Unfortunately, I don’t have exact data on takes, and the reported figure of 25% of takes seems too high;
  • There is a lot of advertising, which is also bypassed by search engines;
  • “Wandering” of search robots on the network greatly increases the load on resources (does not apply to search engines);
  • Most sites are commercial (about 83%) and have little informational value.

For these and some other reasons, the vast majority of Internet information retrieval systems use a keyword search scheme (search engines), rather than a classic search scheme based on information classification.

Features of keyword search

Despite the changing algorithms of search engines, whose advertising tries to convince us that machines are becoming smarter and more understanding, the basis of the work of search engines is keyword search.

I like this keyword search scheme.

As you can see, the work of Internet search engines is based on searching for new documents (search robot Spider + Crawler), indexing detected documents (Indexer) and executing a user query (Search Engine Results Engine). The names of search robots used for these purposes are listed in brackets.

As I said, most search engines do not copy the full text of documents into their database. For searching, when indexing a document, a search image is created. To organize a search by , the indexing robot creates an image of the document using the so-called derived method. That is, the document image contains a title and a set of keywords.

However, it can be stated quite accurately that all IPS pay attention to the following:

  • Presence of a keyword in document;</li><li>The presence of a key in the URL or domain;</li><li>The presence of a key in the subtitle;</li><li>Total number of keys on the page (density%);</li><li>Presence of keys in the description;</li><li>What web links lead to this page;</li><li>What internal links are there on this page?</li> </ul><h2><span>Page ranking</span></h2><p>At the end of the theory, it is worth mentioning. More often, page ranking in SERPs is mentioned in the context of relevance. That is, search engines must build search results to match the search query as closely as possible. As Yandex writes, nothing should be lost (completeness of the output) and nothing unnecessary should be found (accuracy of the output). You see how this works out in practice every day.</p><h2>Conclusion</h2><ul><li>Internet search engines are complex software products, the work of which is supported by thousands of specialists and enormous material resources.</li><li>Search engine algorithms are kept secret, although the underlying focus of algorithm updates is publicly available and bears proper names.</li><li>Despite the different approaches to generating search results, all search engines are based on the general principles of page indexing, which to this day remain basic for promotion.</li> </ul><h2><span>Yandex search engine</span></h2><p>A popular Runet search engine that often becomes the most popular. According to statistics from 2009, Yandex constantly crawls 15 million pages of the Runet, processing 140 thousand GB of text data, 1.6 billion unique pictures out of 2.1 billion pictures in total.</p><p>Yandex search engine was created in 1993. The word Yandex does not mean anything, although it is generally accepted that it is a transformation of the word “Index”, or the phrase “yet another indexer”. Today, Yandex.Search processes a quarter of a billion requests a day, and if it were so intrusive, it would be my favorite search engine.</p><h2>Search Yandex</h2><p>https://yandex.ru/: Yandex user search is organized on the Internet, taking into account the user’s region. Ability to search by images, videos, maps, news, blogs, products and dictionaries.</p><p><img src='/uploads/81cdf90374f142a0b848bb428f86180b.png' height="592" width="1276" loading=lazy loading=lazy></p><p>For fine-grained searches, there is a search language here (https://yandex.ru/support/search/query-language/).</p><p><img src='/uploads/f9c075b14d6eb3d10ec1251607e73d53.png' height="592" width="876" loading=lazy loading=lazy></p><p>Internet search engines Yandex</p><h2>Google search engine</h2><p>In the Google search engine, the search is organized without topics (main search) and searches by sections: pictures, news, maps, videos, shopping, books, air tickets, finance.</p><p><img src='/uploads/f0c4694c6d8b5679a24edaffa0c222e1.png' height="262" width="822" loading=lazy loading=lazy></p><p>There are settings:</p><p><b>Safe search.</b> Allows you to block inappropriate content and sexual images from Google search results. This feature does not guarantee 100% protection, but it hides most of such content.</p><p><img src='/uploads/566a19c01c71076faba0785473d5588b.png' height="1174" width="1280" loading=lazy loading=lazy></p><p><img src='/uploads/a7233a577c3d32a89fc4043f9128fd5a.png' height="235" width="811" loading=lazy loading=lazy></p><p><b>Setting the number of results</b> per page (default 10).</p><p><b>Personal results</b>. Find links, pictures and videos on Google that your friends have shared with you on social networks.</p><p><b>Region selection</b>. The default is the current region.</p><p><b>Languages.</b> You can specify the search language.</p><p><b>Advanced Search.</b> Allows you to search using advanced parameters.</p><p><b>Tools.</b> Here you can select the search language, specify the time the information appeared, and select an exact match or the entire search result.</p><p><img src='/uploads/64e0fcbab4eab9e65ac9aadaa417f244.png' height="339" width="827" loading=lazy loading=lazy></p><p>Internet search engines Google</p><h2>Mail search engine</h2><p>https://go.mail.ru/. Here the search is organized on the Internet (general search), by videos and pictures. There is a separate search for applications for mobile devices.</p><p> (<span>https://www.bing.com/?scope=web&FORM=Z9LH</span>). General search, search by pictures, videos, news, maps.</p><p><img src='/uploads/d28dca3dd59c3ce1b31b2c025c148458.png' height="220" width="806" loading=lazy loading=lazy></p><p><b>Yahoo search in Russian</b>. https://ru.search.yahoo.com/. Pure search without advertising. Search the Internet, using pictures and news. Select the time to add information.</p><h2>Other search engines</h2><ul><li>DuckDuckGo (https://duckduckgo.com/) Smart search.</li><li>Pipl (https://pipl.com/) Search for people in the USA.</li><li>Findsounds ( <span>http://www.findsounds.com/ 11 Tools for analyzing the relevance of site pages to a search query</span></li> </ul><br> <br> </div> </article> </section> <section id="sidebar" class="secondary clearfix" role="complementary"> <aside id="nav_menu-3" class="widget widget_nav_menu clearfix"> <h3 class="widgettitle"><span>Categories</span></h3> <div class="menu-menyu1-container"> <ul id="menu-menyu1" class="menu"> <li id="menu-item-" class="menu-item menu-item-type-taxonomy menu-item-object-category menu-item-"><a href="https://sks-m.ru/en/category/beauty/">beauty</a></li> <li id="menu-item-" class="menu-item menu-item-type-taxonomy menu-item-object-category menu-item-"><a href="https://sks-m.ru/en/category/psychology/">Psychology</a></li> <li id="menu-item-" class="menu-item menu-item-type-taxonomy menu-item-object-category menu-item-"><a href="https://sks-m.ru/en/category/internet/">Internet</a></li> <li id="menu-item-" class="menu-item menu-item-type-taxonomy menu-item-object-category menu-item-"><a href="https://sks-m.ru/en/category/cooking/">Cooking</a></li> <li id="menu-item-" class="menu-item menu-item-type-taxonomy menu-item-object-category menu-item-"><a href="https://sks-m.ru/en/category/fashion-and-style/">Fashion & Style</a></li> <li id="menu-item-" class="menu-item menu-item-type-taxonomy menu-item-object-category menu-item-"><a href="https://sks-m.ru/en/category/finance/">Finance</a></li> <li id="menu-item-" class="menu-item menu-item-type-taxonomy menu-item-object-category menu-item-"><a href="https://sks-m.ru/en/category/real-estate/">Real estate</a></li> <li id="menu-item-" class="menu-item menu-item-type-taxonomy menu-item-object-category menu-item-"><a href="https://sks-m.ru/en/category/sport/">Sport</a></li> </ul> </div> </aside> <aside id="recent-posts-2" class="widget widget_recent_entries clearfix"> <h3 class="widgettitle"><span>Recent Entries</span></h3> <ul> <li> <a href="https://sks-m.ru/en/sport/takhikardija_kak_lechit_uchashhennyjj_puls.html">Tachycardia: how to treat rapid pulse</a> </li> <li> <a href="https://sks-m.ru/en/sport/10_veshhejj_kotorye_vy_ne_dolzhny_delat_posle_trenirovki.html">10 things you SHOULD NOT do after a workout</a> </li> <li> <a href="https://sks-m.ru/en/fashion-and-style/kulirka_chto_za_tkan_iz_chego_ee_delajut.html">Kulirka: what kind of fabric is it made of?</a> </li> <li> <a href="https://sks-m.ru/en/cooking/pochemu_mutneet_ogurechnyjj_rassol_v_bankakh_i_vzduvajutsja_kryshki_chto_delat.html">Why does cucumber pickle in jars become cloudy and the lids swell, what to do and how to resuscitate them</a> </li> <li> <a href="https://sks-m.ru/en/beauty/kak_pravilno_i_naskolko_chasto_nuzhno_myt_golovu.html">How to wash your hair correctly and how often?</a> </li> <li> <a href="https://sks-m.ru/en/psychology/rastorzhenie_braka_v_organakh_zagsa.html">Divorce in the registry office</a> </li> <li> <a href="https://sks-m.ru/en/fashion-and-style/muzhskojj_stil_denim_v_muzhskom_garderobe.html">Men's style: Denim in the men's wardrobe</a> </li> <li> <a href="https://sks-m.ru/en/internet/chem_otlichaetsja_ajjfon_ot_ajjpoda_ili_kak_ne_zaputatsja_v_vybore_jablochnogo.html">What is the difference between an iPhone and an iPod, or how not to get confused when choosing an Apple device?</a> </li> <li> <a href="https://sks-m.ru/en/fashion-and-style/kulirka__chto_jeto_za_tkan.html">Kulirka - what kind of fabric is it?</a> </li> <li> <a href="https://sks-m.ru/en/finance/kurban_omarov_biografija_rod_zanjatijj_chem_znamenit_muzh_borodinojj.html">What is Borodina's husband famous for?</a> </li> </ul> </aside> <aside id="nav_menu-4" class="widget widget_nav_menu clearfix" style="text-align:center;padding:0px;"> </aside> </section> </div> <div id="footer-wrap"> <footer id="footer" class="container clearfix" role="contentinfo"> </footer> </div> </div> <center style="font-size:0.8em;"><br><a href="https://sks-m.ru/en/" title="Child's world. Beauty. Cooking. Internet. Fashion & Style. Real estate. Animals">Child's world. Beauty. Cooking. Internet. Fashion & Style. Real estate. Animals</a> <br>2023 sks-m.ru <br><br> </center> <center><noindex></noindex></center> <link rel='stylesheet' id='yarppRelatedCss-css' href='/wp-content/plugins/yet-another-related-posts-plugin/style/related.css?ver=4.9.1' type='text/css' media='all' /> <script type='text/javascript'> var q2w3_sidebar_options = new Array(); q2w3_sidebar_options[0] = { "sidebar" : "sidebar", "margin_top" : 10, "margin_bottom" : 115, "stop_id" : "", "screen_max_width" : 800, "screen_max_height" : 0, "width_inherit" : false, "refresh_interval" : 1500, "window_load_hook" : false, "disable_mo_api" : false, "widgets" : ['nav_menu-4'] } ; </script> <script type='text/javascript' src='https://sks-m.ru/wp-content/plugins/q2w3-fixed-widget/js/q2w3-fixed-widget.min.js?ver=5.0.4'></script> <script type='text/javascript' src='/wp-includes/js/wp-embed.min.js?ver=4.9.1'></script> <script async="async" type='text/javascript' src='https://sks-m.ru/wp-content/plugins/akismet/_inc/form.js?ver=4.0.2'></script> <script src="//yastatic.net/es5-shims/0.0.2/es5-shims.min.js"></script><br> <br> </body> </html>