
How Google assigns a rating to a web page
This document is based on a patent application filed by Google with the United States Patent Office on April 26, 2007 (1). He details how each page is assigned a score that will determine its position in the search engine results. All criteria that determine the location of the page are analyzed, and, therefore, the reasons that cause the sandbox effect are revealed.
Document date
Date is important for assigning a score. There are several ways to determine the date of a document: This can be the date of indexing or the date of posting a link on the page.
If the number of links on a page grows faster than for an older page, this will give a better score, but can also signal spamming.
If the document is newer than the average of the pages in the result, it can be given the best score to improve the position to take into account its novelty.
Change page content
The score is not the same depending on whether the content of the document often changes or not.
To identify changes, you can store the entire document, or the signature that represents the entire document, or the portion that is deemed necessary for the document.
The score may be positive or negative depending on these changes.
Analyzing Queries and Mouse Clicks on Results
You can consider how the document is selected from the query results.
If some terms are more common in user requests, the document associated with these terms (containing them or containing their backlinks) will have a better score.
If a document often responds to such requests, then this document will receive the best score.
It should be borne in mind that some requests remain valid, while the pages responding to them differ from each other (for example, in sports results). The score drops if the document no longer responds to the request.
In some areas, such as the FAQ, the novelty of the document is important and improves the score.
However, if users click on the link of an older document and ignore the latter, then this document will have a better score.
A document that appears more often in requests for a topic, but less when the field is specified, will have a lower score (for example, the topic may be a sport, and it returns to the topic of a particular sports club).
If the document appears in requests without a connection between them, this signals spam, and the score decreases.
Condition for links on page
The appearance of backlinks and their disappearance are taken into account.
If the appearance of new backlinks decreases over time, it means that the document becomes outdated, then its account will be reduced.
Conversely, if that number seeks progress, it will have a better outcome.
If the content of a document is changed but the link it contains on another page is retained, this adds value to that link and therefore increases the rating of the linked page.
Links grow in value if they are "trusted," which happens with government sites, for example.
The rate at which backlinks appear reports spam. It is assumed that pages of this type attract links to a given speed. If too many backlinks appear, this involves the exchange of links or a purchase, or free registration pages (for example, directories, N.D.T.) and this is spam.
Anchor text
Changing the anchor text indicates that the document has been updated.
If the text changes and differs from the wording of the bindings, this means that the document is reworded, but that it is no longer relevant with bindings, which is undesirable.
From now on, you can determine the date when this or that area will be replaced by the topic, and links to this date will be ignored.
If the document is experiencing minor changes, you need to keep the wording of the anchors, their experience is the key to relevance.
Traffic on page
If the traffic, that is, the number of reads per page, is significantly reduced, then the document is outdated. Time and periods are compared to estimate the decrease in traffic.
Traffic from advertising is taken into account. If you advertise on other sites with high traffic, then the page will have a better score than ads on small sites.
Visitor behavior
The number of times a page is selected in search results matters, as does the time spent accessing the page.
Depending on whether the visitor spends more or less time on the page, it will be considered relevant or outdated. If visitors spend less and less time on the page over time, it will be considered outdated.
Domain Name Information
Includes hosting, Intranet, Internet, or document database network.
The latter domains can be used by spammers and therefore be considered less legitimate.
The data of the name server, domain owner, contacts, addresses of the name server are taken into account. Frequent changes are signs of spam. IP addresses and other data used for these unstable nodes are registered in the database along with the corresponding documents.
A name server is best considered if it references different domains for different registries. It's bad if it hosts porn sites, spam sites, domains containing commercial words.
The assessment of a document depends on the domain and its location.
Previous ranks
Former ranks are taken into account. The number of items that a document earns in a given time changes its valuation. However, if the rank remains high, while positions tend to change over time on a topic, this indicates a commercial topic and a higher likelihood of spam.
If the number of selections for a page increases, or if the selection is more frequent, the page will have a better score.
The engine monitors peaks in the rank of documents, synonymous with news or spam. To change the situation, different factors are taken into account. For example, a document mentioned in the news is not spam.
On the contrary, a sudden drop in the rank of the document suggests that it is outdated.
In conclusion, changing a document's rank affects its grade and future rank.
Bookmarks
Bookmarks and other such data affect the valuation of the document. The fact of adding or removing from this type of list is taken into account. It also affects the fact that the document in the list is often accessed.
Cache, temporary directories, and cookies are taken into account. All this indicates whether the document is being viewed or not.
Unique and anchored words
The frequency of a single word or sentence in bindings is taken into account in connection with references to them.
If there are suspicious anchors, in particular, since there are many Indent inscriptions in different documents, this will affect the account of these documents and those who have a link to them.
Inappropriate references
Inbound or outbound irrelevant links are an indicator of spam and reduce page ratings.
Subject of the document
It is used to determine its score.
The subject of the document is determined by rare words, URL, summary, content, etc.
If the document set theme changes, it points to a different owner or theme and all information about the document becomes out of date. Or it means that the document is used for spam.
(1) U.S. Patent and Trademark Office source.
A FreshRank calculation patent, a note on page freshness, was also filed.