what are stop words

一、总结

一句话总结:就是在seo的关键词中不要有stop words,不然的话搜索引擎会直接忽略

stop words  most common  words language

In computingstop words are words which are filtered out before or after processing of natural language data (text).[1] Though "stop words" usually refers to the most common words in a language, there is no single universal list of stop words used by all natural language processing tools, and indeed not all tools even use such a list. Some tools specifically avoid removing these stop words to support phrase search.

Any group of words can be chosen as the stop words for a given purpose. For some search engines, these are some of the most common, short function words, such as theisatwhich, and on. In this case, stop words can cause problems when searching for phrases that include them, particularly in names such as "The Who", "The The", or "Take That". Other search engines remove some of the most common words—including lexical words, such as "want"—from a query in order to improve performance.[2]

Hans Peter Luhn, one of the pioneers in information retrieval, is credited with coining the phrase and using the concept.[3] The phrase "stop word", which is not in Luhn's 1959 presentation, and the associated terms "stop list" and "stoplist" appear in the literature shortly afterwards.[4]

A predecessor concept was used in creating some concordances. For example, the first Hebrew concordance, Me’ir nativ, contained a one-page list of unindexed words, with nonsubstantive prepositions and conjunctions which are similar to modern stop words.[5]

In SEO terminology, stop words are the most common words that most search engines avoid, saving space and time in processing large data during crawling or indexing. This helps search engines to save space in their databases.

1、Stop words list?

What exactly are stop words? According to Wikipedia, stop words are the most common words in a language. Since there is no single universal list of stop words available, we’ve created our own list. Learn more about stop words here.

The following list contains most of the stop words used by Yoast SEO and Yoast SEO Premium in English. The full list can be found here.

  • a
  • about
  • above
  • after
  • again
  • against
  • all
  • am
  • an
  • and
  • any
  • are
  • as
  • at
  • be
  • because
  • been
  • before
  • being
  • below
  • between
  • both
  • but
  • by
  • could
  • did
  • do
  • does
  • doing
  • down
  • during
  • each
  • few
  • for
  • from
  • further
  • had
  • has
  • have
  • having
  • he
  • he’d
  • he’ll
  • he’s
  • her
  • here
  • here’s
  • hers
  • herself
  • him
  • himself
  • his
  • how
  • how’s
  • I
  • I’d
  • I’ll
  • I’m
  • I’ve
  • if
  • in
  • into
  • is
  • it
  • it’s
  • its
  • itself
  • let’s
  • me
  • more
  • most
  • my
  • myself
  • nor
  • of
  • on
  • once
  • only
  • or
  • other
  • ought
  • our
  • ours
  • ourselves
  • out
  • over
  • own
  • same
  • she
  • she’d
  • she’ll
  • she’s
  • should
  • so
  • some
  • such
  • than
  • that
  • that’s
  • the
  • their
  • theirs
  • them
  • themselves
  • then
  • there
  • there’s
  • these
  • they
  • they’d
  • they’ll
  • they’re
  • they’ve
  • this
  • those
  • through
  • to
  • too
  • under
  • until
  • up
  • very
  • was
  • we
  • we’d
  • we’ll
  • we’re
  • we’ve
  • were
  • what
  • what’s
  • when
  • when’s
  • where
  • where’s
  • which
  • while
  • who
  • who’s
  • whom
  • why
  • why’s
  • with
  • would
  • you
  • you’d
  • you’ll
  • you’re
  • you’ve
  • your
  • yours
  • yourself
  • yourselves

二、List of stop words

 

随机推荐

  1. 【4opencv】求解向量和轮廓的交点

    在“学习OpenCV3"的QQ群众,网友且行且珍惜针对前期博客(https://www.cnblogs.com/jsxyhelu/p/9345590.html)中的内容提出了以下问题: 比如 ...

  2. hdu 4366 Successor - CDQ分治 - 线段树 - 树分块

    Sean owns a company and he is the BOSS.The other Staff has one Superior.every staff has a loyalty an ...

  3. Codeforces Round #425 (Div. 2) Problem D Misha, Grisha and Underground (Codeforces 832D) - 树链剖分 - 树状数组

    Misha and Grisha are funny boys, so they like to use new underground. The underground has n stations ...

  4. ODAC(V9.5.15) 学习笔记(十一)TOraEncryptor、TOraPackage和TOraAlerter

    TOraEncryptor 名称 类型 说明 DataHeader TCREncDataHeader 一些附加信息放入加密数据中,包括: ehNone 无附加信息 ehTag   GUID和随机生成的 ...

  5. checkbox勾选事件,JQ设置css,下拉框JQ选中

    <input id="CheckMainCompany" type="checkbox"/> $(function() { $("#Che ...

  6. Nginx 安装及配置

    目录 概念 安装 配置文件 主要文件位置 注意点 Nginx运行 FAQ Q1:nginx: [error] open() "/usr/local/var/run/nginx.pid&quo ...

  7. SP10707 COT2 - Count on a tree II 莫队

    链接 https://vjudge.net/problem/SPOJ-COT2 https://www.luogu.org/problemnew/show/SP10707 思路 dfs欧拉序转化为普通 ...

  8. P3178 [HAOI2015]树上操作

    P3178 [HAOI2015]树上操作 思路 板子嘛,其实我感觉树剖没啥脑子 就是debug 代码 #include <bits/stdc++.h> #define int long l ...

  9. 160CrackMe练手 001

    peid判断无壳,打开输入伪码注册,根据报错od查找字符串 接下来定位到字符串周边代码 0042FA15 |. 8D55 F0 lea edx,[local.4] 0042FA18 |. 8B83 D ...

  10. 【ASP.NET】The CodeDom provider type “Microsoft.CodeDom.Providers.DotNetCompilerPlatform.CSharpCodeProvider” could not be located

    一般是asp.net的项目在启动的时候会报这个错误. 页面显示成: 我推测的原因是由于project的build的输出属性改了, 非bin目录下, 导致这个问题. 解决这个问题的方案有两个: 1. 改 ...