what are stop words

一、总结

一句话总结:就是在seo的关键词中不要有stop words,不然的话搜索引擎会直接忽略

stop words  most common  words language

In computingstop words are words which are filtered out before or after processing of natural language data (text).[1] Though "stop words" usually refers to the most common words in a language, there is no single universal list of stop words used by all natural language processing tools, and indeed not all tools even use such a list. Some tools specifically avoid removing these stop words to support phrase search.

Any group of words can be chosen as the stop words for a given purpose. For some search engines, these are some of the most common, short function words, such as theisatwhich, and on. In this case, stop words can cause problems when searching for phrases that include them, particularly in names such as "The Who", "The The", or "Take That". Other search engines remove some of the most common words—including lexical words, such as "want"—from a query in order to improve performance.[2]

Hans Peter Luhn, one of the pioneers in information retrieval, is credited with coining the phrase and using the concept.[3] The phrase "stop word", which is not in Luhn's 1959 presentation, and the associated terms "stop list" and "stoplist" appear in the literature shortly afterwards.[4]

A predecessor concept was used in creating some concordances. For example, the first Hebrew concordance, Me’ir nativ, contained a one-page list of unindexed words, with nonsubstantive prepositions and conjunctions which are similar to modern stop words.[5]

In SEO terminology, stop words are the most common words that most search engines avoid, saving space and time in processing large data during crawling or indexing. This helps search engines to save space in their databases.

1、Stop words list?

What exactly are stop words? According to Wikipedia, stop words are the most common words in a language. Since there is no single universal list of stop words available, we’ve created our own list. Learn more about stop words here.

The following list contains most of the stop words used by Yoast SEO and Yoast SEO Premium in English. The full list can be found here.

  • a
  • about
  • above
  • after
  • again
  • against
  • all
  • am
  • an
  • and
  • any
  • are
  • as
  • at
  • be
  • because
  • been
  • before
  • being
  • below
  • between
  • both
  • but
  • by
  • could
  • did
  • do
  • does
  • doing
  • down
  • during
  • each
  • few
  • for
  • from
  • further
  • had
  • has
  • have
  • having
  • he
  • he’d
  • he’ll
  • he’s
  • her
  • here
  • here’s
  • hers
  • herself
  • him
  • himself
  • his
  • how
  • how’s
  • I
  • I’d
  • I’ll
  • I’m
  • I’ve
  • if
  • in
  • into
  • is
  • it
  • it’s
  • its
  • itself
  • let’s
  • me
  • more
  • most
  • my
  • myself
  • nor
  • of
  • on
  • once
  • only
  • or
  • other
  • ought
  • our
  • ours
  • ourselves
  • out
  • over
  • own
  • same
  • she
  • she’d
  • she’ll
  • she’s
  • should
  • so
  • some
  • such
  • than
  • that
  • that’s
  • the
  • their
  • theirs
  • them
  • themselves
  • then
  • there
  • there’s
  • these
  • they
  • they’d
  • they’ll
  • they’re
  • they’ve
  • this
  • those
  • through
  • to
  • too
  • under
  • until
  • up
  • very
  • was
  • we
  • we’d
  • we’ll
  • we’re
  • we’ve
  • were
  • what
  • what’s
  • when
  • when’s
  • where
  • where’s
  • which
  • while
  • who
  • who’s
  • whom
  • why
  • why’s
  • with
  • would
  • you
  • you’d
  • you’ll
  • you’re
  • you’ve
  • your
  • yours
  • yourself
  • yourselves

二、List of stop words

 

随机推荐

  1. linux设置代理

    在~/.bashrc或者/etc/profile下,添加下面 http_proxy=http://192.168.105.171:80 https_proxy=$http_proxy export h ...

  2. IDEA——找不到或无法加载主类的一种暴力解决方法

    对于用maven构建的java项目,可以利用maven工具编译一下,大致上可以解决很多奇奇怪怪的问题. 具体操作如下: 首先找到项目所在的文件夹,以F:\project为例. 删除.idea文件. 在 ...

  3. Eclipse搭建maven project web war项目pom.xml报错

    在eclipse中搭建maven project时,在不使用模板的情况下,搭建的web项目会报错. 操作步骤如下: 1.勾选Create a simple project ,因为如果不勾选系统会提供模 ...

  4. WSDL(Web服务描述语言)详细解析(全文转载学习用)

    WSDL (Web Services Description Language,Web服务描述语言)是一种XML Application,他将Web服务描述定义为一组服务访问点,客户端可以通过这些服务 ...

  5. 4698: Sdoi2008 Sandy的卡片

    前言 总之这个东西说起来很麻烦就是了, 思路 差分合并+后缀数组+二分(dddl) 类似于那个bzoj1031的复制子串和那个poj1743的差分 来看个例子 3 5 1 2 3 4 5 4 1 1 ...

  6. Docker 安装Hadoop HDFS命令行操作

    网上拉取Docker模板,使用singlarities/hadoop镜像 [root@localhost /]# docker pull singularities/hadoop 查看: [root@ ...

  7. IntelliJ IDEA 中SpringBoot对Run/Debug Configurations配置 SpringBoot热部署

    运行一个SpringBoot多模块应用 使用SpringBoot配置启动: Use classpath of module选中要运行的模块 VM options:内部配置参数 -Dserver.por ...

  8. 【注册码】Matlab7.0(R14)注册码

    Matlab 7 (R14) 注册码1:14-13299-56369-16360-32789-51027-35530-39910-50517-56079-43171-43696-14148-64597 ...

  9. (转)Introductory guide to Generative Adversarial Networks (GANs) and their promise!

    Introductory guide to Generative Adversarial Networks (GANs) and their promise! Introduction Neural ...

  10. 洛谷P2362 围栏木桩----dp思路

    在翻dp水题的时候找到的有趣的题0v0 原文>>https://www.luogu.org/problem/show?pid=2362<< 题目描述 某农场有一个由按编号排列的 ...