python 正则表达式 re.search

 #coding:utf-8

 import re

 #将正则表达式编译为pattern对象

 #compile(pattern, flags=0)

 #Compile a regular expression pattern, returning a pattern object.

 pattern = re.compile(r'sub2020')

 #help(re.match) Try to apply the pattern at the start of the string

 #match 从 str 的开始（0 位置）查找匹配，如果没有，返回none

 #help(re.search) Scan through string looking for a match to the pattern

 #search 扫描整个 str

 match =re.search(pattern,'http://www.cnblogs.com/sub2020/p/7920845.html')

 if match:

     #使用match获得group信息

     print match.group()

 m=re.search(r'(\w+)(\w+)(\w+)(\d+)(\d+)(\d+)(\d+)','http://www.cnblogs.com/sub2020/p/7920845.html')

 #string: 匹配时使用的文本。

 print 'm.string :',m.string

 #re: 匹配时使用的Pattern对象

 print 'm.re :',m.re

 #pos: 文本中正则表达式开始搜索的索引。值与Pattern.match()

 # 和Pattern.seach()方法的同名参数相同。

 print 'm.pos :',m.pos

 #endpos: 文本中正则表达式结束搜索的索引。值与Pattern.match()

 # 和Pattern.seach()方法的同名参数相同。

 print 'm.endpos :',m.endpos

 #lastindex: 最后一个被捕获的分组在文本中的索引。

 # 如果没有被捕获的分组，将为None。

 print 'm.lastindex :',m.lastindex

 #lastgroup: 最后一个被捕获的分组的别名。

 # 如果这个分组没有别名或者没有被捕获的分组，将为None。

 print 'm.lastgroup :',m.lastgroup

 #.group([group1, …]):获得一个或多个分组截获的字符串；

 # 指定多个参数时将以元组形式返回。group1可以使用编号也可以使用别名；

 # 编号0代表整个匹配的子串；不填写参数时，返回group(0)；

 # 没有截获字符串的组返回None；截获了多次的组返回最后一次截获的子串。

 print 'm.group() :',m.group()

 print 'm.group(1,2)  :',m.group(1,2)

 print 'm.group(1, 2) :',m.group(1, 2)

 #groups([default]):以元组形式返回全部分组截获的字符串。

 # 相当于调用group(1,2,…last)。default表示没有截获字符串的组以这个

 # 值替代，默认为None。

 print 'm.groups() :',m.groups()

 print 'm.groups(3) :',m.groups(3)

 #groupdict([default]):返回以有别名的组的别名为键、以该组截获的子串

 # 为值的字典，没有别名的组不包含在内。default含义同上。

 print 'm.groupdict() :',m.groupdict()

 #start([group]):返回指定的组截获的子串在string中的起始索引

 #（子串第一个字符的索引）。group默认值为0。

 print 'm.start(2) :',m.start(2)

 print 'm.start(3) :',m.start(3)

 #end([group]):返回指定的组截获的子串在string中的结束索引

 #（子串最后一个字符的索引+1）。group默认值为0。

 print 'm.end(2) :',m.end(2)

 print 'm.end(3) :',m.end(3)

 #span([group]):返回(start(group), end(group))。

 print 'm.span(1) :',m.span(1)

 print 'm.span(2) :',m.span(2)

 #expand(template):将匹配到的分组代入template中然后返回。

 # template中可以使用\id或\g、\g引用分组，但不能使用编号0。

 # \id与\g是等价的；但\10将被认为是第10个分组，如果你想表达\1之后是字符’0’，

 # 只能使用\g0。

 print r"m.expand(r'\g') 1:",m.expand(r'\1')

 print r"m.expand(r'\g') 2:",m.expand(r'\2')

 print r"m.expand(r'\g') 3:",m.expand(r'\3')

 print r"m.expand(r'\g \g\g') :",m.expand(r'\2 \2\2')

 print r"m.expand(r'\g \g\g') :",m.expand(r'\2 \1\3')

Output:

 sub2020

 m.string : http://www.cnblogs.com/sub2020/p/7920845.html

 m.re : <_sre.SRE_Pattern object at 0x0000000001D08810>

 m.pos : 0

 m.endpos : 45

 m.lastindex : 7

 m.lastgroup : None

 m.group() : sub2020

 m.group(1,2)  : ('s', 'u')

 m.group(1, 2) : ('s', 'u')

 m.groups() : ('s', 'u', 'b', '', '', '', '')

 m.groups(3) : ('s', 'u', 'b', '', '', '', '')

 m.groupdict() : {}

 m.start(2) : 24

 m.start(3) : 25

 m.end(2) : 25

 m.end(3) : 26

 m.span(1) : (23, 24)

 m.span(2) : (24, 25)

 m.expand(r'\g') 1: s

 m.expand(r'\g') 2: u

 m.expand(r'\g') 3: b

 m.expand(r'\g \g\g') : u uu

 m.expand(r'\g \g\g') : u sb

 ***Repl Closed***

quote:http://cuiqingcai.com/977.html

python 正则表达式 re.search的更多相关文章

Python正则表达式re.search(r'\*{3,8}','*****')和re.search('\*{3,8}','*****')的匹配结果为什么相同？
老猿做过如下测试: >>> re.search(r'\*{3,100}','*****') <re.Match object; span=(0, 5), match='**** ...
python正则表达式--match search方法
1.re.match函数 re.match 尝试从字符串的起始位置匹配一个模式,如果不是起始位置匹配成功的话,match()就返回None. (1)函数语法: re.match(pattern, st ...
python正则表达式(4)--search方法
1.re.search函数 re.search 扫描整个字符串并返回第一个成功的匹配,如果匹配失败search()就返回None. (1)函数语法: re.search(pattern, string ...
Python正则表达式(总)
转载请标明出处: http://www.cnblogs.com/why168888/p/6445044.html 本文出自:[Edwin博客园] Python正则表达式(总) search(patte ...
Python正则表达式re模块学习遇到的问题
Python正则表达式处理的组是什么? Python正则表达式处理中的匹配对象是什么? Python匹配对象的groups.groupdict和group之间的关系 Python正则表达式re.mat ...
python正则表达式基础，以及pattern.match(),re.match(),pattern.search(),re.search()方法的使用和区别
正则表达式(regular expression)是一个特殊的字符序列,描述了一种字符串匹配的模式,可以用来检查一个字符串是否含有某种子字符串. 将匹配的子字符串替换或者从某个字符串中取出符合某个条件 ...
Learning Python 008 正则表达式-003 search()方法
Python 正则表达式 - search()方法 findall()方法在找到第一个匹配之后,还会继续找下去,findall吗,就是找到所有的匹配的意思.如果你只是想找到第一个匹配的信息后,就不在继 ...
Python 正则表达式入门（中级篇）
Python 正则表达式入门(中级篇) 初级篇链接:http://www.cnblogs.com/chuxiuhong/p/5885073.html 上一篇我们说在这一篇里,我们会介绍子表达式,向前向 ...
Python 正则表达式入门（初级篇）
Python 正则表达式入门(初级篇) 本文主要为没有使用正则表达式经验的新手入门所写. 转载请写明出处引子首先说正则表达式是什么? 正则表达式,又称正规表示式.正规表示法.正规表达式.规则表达 ...

随机推荐

MongoDB的使用学习之（四）权限设置--用户名、密码、端口==
本文参照:http://hi.baidu.com/tianhuimin/item/590d96cfd7ac1509c610b26a 本人也是按照此文章操作的,但是有些不妥,红色文字就是我实践后,需要改 ...
ElasticSearch - 解决ES的深分页问题 (游标 scroll)
https://www.jianshu.com/p/f4d322415d29 1.简介 ES为了避免深分页,不允许使用分页(from&size)查询10000条以后的数据,因此如果要查询第10 ...
CentOS7搭建FastDFS V5.11分布式文件系统及Java整合详细过程
1.1 FastDFS的应用场景 FastDFS是为互联网应用量身定做的一套分布式文件存储系统,非常适合用来存储用户图片.视频.文档等文件.对于互联网应用,和其他分布式文件系统相比,优势非常明显.其中 ...
计蒜客蓝桥模拟 B.素数个数
用 0,1,2,3⋯70,1,2,3 \cdots 70,1,2,3⋯7 这 888 个数组成的所有整数中,质数有多少个(每个数字必须用到且只能用一次). 提示:以 000 开始的数字是非法数字. 代 ...
larval 使用redis做缓存
1.存redis 使用setex命令可以同时设置数据和保存时间 $data = [ 'name'=>zhangsan, 'age' => 28, 'sex' => 1 ]; Redi ...
noip级别模板小复习
不是很noip的知识点就不写了. dij什么的太easy就不写了. 缩点注意$Tarjan$在缩边双和求强联通分量时候的区别. 一个要判断是否在栈内一个不要. 最后$topsort$来\(d ...
apache-2.4.x 编译安装方法
apache-2.4.x 编译安装方法作者:朱茂海 /分类:Apache 字号:L M S apache-.2与新出的apache-.4安装不同的地方在于,.4版的已经不自带apr库,所以在安装a ...
Java课程作业02
01. 一.设计思想: 第一种使用n!的公式直接计算,利用递归方法求n! 第二种使用递推的公式,利用递归返回求和. 二.程序流程图三.源代码 import java.util.*;import ja ...
man clock
CLOCK(3) Linux程序员手册 CLOCK(3) 名称时钟-确定处理器时间简介 #include <time.h> clock_t clock(void); ...
初识Django之前端后端与数据库的配置
目录一.静态文件配置二.关于form表单的补充三.post请求出现403问题四.request方法初识 4.1判断当前请求方式 4.2request.POST与request.GET 五.py ...

python 正则表达式 re.search

python 正则表达式 re.search的更多相关文章

随机推荐

热门专题