What is a DNS address?
Briefly indicate the meaning and importance of DNS for the Internet, it is no exaggeration to say that this is one of the foundations of the World Wide Web. Absolutely…

Continue reading →

What is cms
To better understand what a content management system is, it’s useful to refresh your memory first - and what is a regular HTML site? What is a static site? A…

Continue reading →

How to connect to a remote computer
In this manual, we will consider two solutions to the question “How to connect to a remote computer?” Both options allow you to remotely use PC resources on OC Windows…

Continue reading →

How to create robots.txt correctly

Nowadays, the Internet has spread around the world. We are almost inconceivable our day without access to the Internet, where you can view a list of news, find the necessary information. New sites appear, new ones appear along with them
protocols for the performance of certain operations. The webmaster should be familiar with both the old methods of writing protocols and be able to instantly and timely master the latest programs and protocols.
Search engine robots initially access the robots.txt file when they enter the portal. It is this file that contains the protocol on which the further actions of the search engine robot depend, as well as which files and areas are not subject to indexing by robots.

Each programmer and layout designer should be able to correctly write such a text file and correctly create robots.txt, as the violations made entail a large number of undesirable consequences. The main goal of robots.txt is to ban indexing. It is worth noting that this document is not mandatory for use in search work, it rather acts as a letter of recommendation, referring to which it is necessary to carry out search work.

This file has the txt extension. It is created using the standard Notepad office program, and subsequently it is placed in the root folder of the site, which contains information on indexing during the search process. It is worth noting that
indexing recommendations can be applied to all search engines as well as to certain types of robots.

The programmer should be guided by the following rules when writing such a file:

First of all, the name should remain unchanged, “robots.txt” should not be modified, for example, to “robot.txt”. If the name is different, the robot will simply ignore the instructions.

The name should be written with a small letter, this item is also mandatory, that is, “robots.txt”, and not “ROBOTS.TXT”.

The most important thing is the location of the file. Only installation in the root folder of the site will warn against unwanted errors and consequences.

One of the important points is that the spelling of the file must also be respected. Since if mistakes are made part of the resource portal, and in some cases the entire content of the site will undergo the indexing process.

The three components that make up this text file are:

User-agent directive: *

Disallow protocol: / adminka /

Disallow instruction: / image /

Let’s consider each of the components in more detail.
User-agent component: *. The presence of an asterisk indicates that the manual in the file is relevant and applies to the vast majority of robots entering the portal. If the rules apply to a certain type of robotic
search engines, it becomes necessary to indicate its specific name in the text.

The Disallow: / adminka / protocol and Disallow: / image / protocol prohibit indexing of the marked content of the resource. It is important that each area that is not subject to indexing is prescribed in a new line. Combining areas or combining them in one line is strictly prohibited, this violates the basic rules of writing. As for line wrapping in one protocol, this action is also erroneous.
The following are examples of the design and creation of such a text file:

The goal is to prohibit indexing of the entire content of an information resource by all types of search engine robots:
User-agent: *
Disallow: /

The goal is to allow indexing of all portal content by any kind of robotic search engines:
User-agent: *

The task is to create a ban on indexing the contents of the portal and the entire resource as a whole from a particular search robot (as an example, yandexbot):
User-agent: yandexbot
Disallow: /

The task is to allow the indexing process to one of the robots (as an example, yandexbot) and at the same time to prohibit indexing to the remaining robotic search engines:
User-agent: yandexbot

User-agent: *
Disallow: /

It is necessary to prohibit the indexing process of several areas of the information resource:
User-agent: *
Disallow: / directoria-1 /
Disallow: / directoria-2 /
Disallow: / hidedirectoria /

The task is to prohibit indexing several areas of the portal by all search automated systems:
User-agent: *
Disallow: /hide.php
Disallow: /secret.html

At the end of everything, you can summarize and compile a set of rules that you must use when creating this text document:

All text contained in the file must be written with a lowercase letter except for the first letter at the beginning of each line;

The Disallow protocol is intended for only one portal section or single file;

It is strictly forbidden to change the writing order of Disallow and User-agent instructions.

Elastix setup
Elastix today is a universal multifunctional solution with which you can quickly deploy a VoIP service. At the moment, it is he who remains the most convenient and multi-tasking solution.…


How to transfer a site to another domain: instructions
The question of how to transfer a site to another domain often arises before the "site owners". This may be necessary for a number of reasons. For example, I wanted…


Shared hosting
Shared or shared hosting is the best solution in a situation of limited funding for a web project. Or when starting a startup - no one can guarantee that a…


How to create an online store yourself
An increasing number of Internet users are thinking about how to create an online store with minimal costs and thus earn money. An online store can be attributed to the…