How Google Search Results Work

Political leanings don't factor into Google's search algorithm. But the authoritativeness of page links that the algorithm spits out and the perception of thousands of human raters do.

Here's a quick look at how Google returns results when you search for things
What Google's bots do
At its core, Google indexes the entire web — some hundreds of billions of pages — using programs called web crawlers. These bots collect descriptions of pages and their incoming links and save this information in Google's data centers. When you search on Google, it scans this index — which is more than 100 million gigabytes large — to quickly provide what it thinks are the most relevant results.
从本质上说,谷歌使用名为“网络爬虫”的程序对整个web——大约数千亿页——进行索引。这些机器人收集页面及其输入链接的描述,并将这些信息保存在谷歌的数据中心。当你在谷歌上搜索时,它会扫描这个超过1亿gb的索引,以快速提供它认为最相关的结果。
Google knows the most popular search terms and, if you're typing, offers to complete the words as you go.
What humans do
Search results are created by an algorithm that has been fine-tuned to incorporate the reviews of some 10,000-plus employees commonly known as search quality raters.
搜索结果是由一种算法创建的,该算法经过微调,以包含大约1万名员工的评论,这些员工通常被称为搜索质量评分者。
These individual follow a set of guidelines to judge the quality of search results, particularly when Google engineers are considering changes to the search algorithm.

Last year, Google engineers tweaked the search algorithm 2,400 times based on the results of more than 270,000 experiments, rater reviews and live user tests.
去年,谷歌的工程师根据超过27万次的实验结果、评论和现场用户测试的结果,对搜索算法进行了2400次的调整。
When it comes to judging the quality of the top news stories that Google displays, three major issues come into play, according to Google: Freshness, relevancy and authoritativeness. Google's crawlers scan pages more frequently if they change regularly.
在评判谷歌展示的顶级新闻报道的质量时,根据谷歌的说法,三个主要的问题开始发挥作用:新鲜度、相关性和权威性。如果谷歌的爬虫程序经常更换的话,它会更频繁地扫描页面。
In the case of news sites, new stories can be added to the index within seconds of publication. Fresher stories will get bumped up in search results.
Results that are more relevant to a search tend to appear higher on the results page.

What is authority
Raters measure the authoritativeness, expertise and the trustworthiness of the sources that appear in search results. Google suggests that raters consider recommendations from professional societies and experts to determine a page's authority.
评分者衡量在搜索结果中出现的来源的权威性、专业知识和可信度。谷歌建议评级机构考虑专业协会和专家的建议,以确定页面的权威性。
Examples of high-quality news sources include ones that have won Pulitzer Prizes, that clearly label advertising as such, and that garner positive reviews from users. Pages that spread hate, cause harm or misinform or deceive users are given low ratings, Google says.
高质量新闻来源的例子包括那些曾获得普利策奖的奖项,它们清楚地将广告贴上这样的标签,并从用户那里获得了积极的评价。谷歌表示,那些传播仇恨、造成伤害、误导或欺骗用户的页面被给予了较低的评级。
The guidelines tell raters to give a low ranking to pages "deliberately created to deceive users." They provide an example of a source that "looks like a news source" but "in fact has articles to manipulate users in order to benefit a person, business, government or other organization politically, monetarily, or otherwise."
该指南告诉评级机构将“故意创建欺骗用户”的页面排名较低。它们提供了一个“看起来像新闻来源”的例子,但“实际上有文章可以操纵用户,以便在政治上、金钱上或其他方面对一个人、企业、政府或其他组织带来好处。”
Results for most people look the same, but Google results are heavily impacted by location, especially if you search for a physical location like a store. Users' search history can also impact results slightly based on frequently conducted searches.
大多数人的结果看起来都一样,但谷歌搜索结果会受到地理位置的严重影响,尤其是当你搜索一个像商店一样的物理位置时。用户的搜索历史也会根据频繁的搜索结果对结果产生轻微的影响。

L128的更多相关文章

  1. WPF实现无边框窗体拖拽右下角▲ 改变窗体大小【framwork4.0】 谢谢大家关注

    效果图:(右下角拖拽改变窗体大小) 第一步:添加xaml代码: <Border Name="ResizeBottomRight" MouseMove="Resize ...

  2. MongoDB数据库用户名和密码的设置

    首先是对MongoDB用户和权限的设置,如果不设置用户的话,直接使用mongo命令就可以进入客户端shell界面进行操作了,但是如果没有设置用户的话,总感觉少了点什么,于是经过半天的查找和实践,差不多 ...

  3. WPF系列:画图

    Line 在两个坐标点之间画一条直线,通过四个属性设置它的起始和结束 <Line Stroke="Blue" StrokeThickness="3" X1 ...

  4. mongodb不同版本之间有很大的差异

    今天主要说下我为了给mongodb数据库添加authorization,大家应该知道,mongo默认是无auth运行的.这可能是方便小伙伴学习命令吧. 由于之前发布的一个项目,在亚马逊的云上,处于内部 ...

  5. dmalloc 原文 翻译整理

    http://blog.csdn.net/cardinal_508/article/details/5553387 L13 从快速入门开始(Quickstart) 这个库是一个文件中所有简化用法中最常 ...

  6. 程序编码(机器级代码+汇编代码+C代码+反汇编)

    [-1]相关声明 本文总结于csapp: 了解详情,或有兴趣,建议看原版书籍: [0]程序编码 GCC调用了一系列程序,将源代码转化成可执行代码的流程如下: (1)C预处理器扩展源代码,插入所有用#i ...

  7. mongodb副本集自动切换修复节点解决方案

    副本集部署 1.启动mongod 在每台运行mongod服务的机器上增加配置文件/etc/mongodb-rs.conf,内容为: [root@MongodbF-A etc]# vi /etc/mon ...

  8. SIFT算法

     备注:源代码还未理解,所以未附上——下周任务 一.SIFT算法 1.算法简介 尺度不变特征转换即SIFT (Scale-invariant feature transform)是一种计算机视觉的算法 ...

  9. DSO windowed optimization 代码 (2)

    3 非 Schur Complement 部分信息计算 参考<DSO windowed optimization 公式>,非Schur Complement 部分指 \(H_{XX}\) ...

随机推荐

  1. 使用Free命令查看Linux服务器内存使用状况(-/+ buffers/cache详解)

    free命令可选参数 -b,-k,-m,-g show output in bytes, KB, MB, or GB -h human readable output (automatic unit ...

  2. cas无缝单点登录(原创)

    之前一直有一个问题残绕着自己,今天,终于很粗糙的解决了这个问题. 众所周知,按照cas单点登录,默认情况下,在不登录的情况下,打开网站是必须要跳转到登录页面的.那有什么方法可以控制吗,当然有,很简单, ...

  3. Django CSRF cookie not set.错误

    post提交表单报错: Forbidden (403) CSRF verification failed. Request aborted. You are seeing this message b ...

  4. begoo——路由设置

    路由本质是URL与要为该URL调用的视图函数之间的映射表,其实就是你定义的使用那个URL调用那段代码的关系对应表. 首先看一下最简单的路由: package routers import ( &quo ...

  5. Codeforces Round #303 (Div. 2)

    A.Toy Cars 题意:给出n辆玩具车两两碰撞的结果,找出没有翻车过的玩具车. 思路:简单题.遍历即可. #include<iostream> #include<cstdio&g ...

  6. selenium打不开chrome

    selenium打不开chrome,出现下面的报错 requests.exceptions.ChunkedEncodingError: ("Connection broken: Connec ...

  7. PAT 天梯赛 L1-012. 计算指数 【水】

    题目链接 https://www.patest.cn/contests/gplt/L1-012 AC代码 #include <iostream> #include <cstdio&g ...

  8. 使用Socket&反射&Java流操作进行方法的远程调用(模拟RPC远程调用)

    写在前面 阅读本文首先得具备基本的Socket.反射.Java流操作的基本API使用知识:否则本文你可能看不懂... 服务端的端口监听 进行远程调用,那就必须得有客户端和服务端.服务端负责提供服务,客 ...

  9. bind方法代替闭包

    <a href="JavaScript:">a</a> <a href="JavaScript:">b</a> ...

  10. MySQL详解--锁,事务(转)

    锁是计算机协调多个进程或线程并发访问某一资源的机制.在数据库中,除传统的计算资源(如CPU.RAM.I/O等)的争用以外,数据也是一种供许多用户共享的资源.如何保证数据并发访问的一致性.有效性是所有数 ...