https://segmentfault.com/q/1010000010845573 import re #reg=r'\s+[^(href)]*=\"[^<>]+\"' reg = r'\b(?!(?:href|src))\w+=(["\']).+?\1' with open(r'input.txt','r',encoding='ISO-8859-15') as f_read: html= f_read.read() result = re.sub(reg,&
jsoup Cookbook(中文版) 入门 1. 解析和遍历一个html文档 如何解析一个HTML文档: String html = "<html><head><title>First parse</title></head>" + "<body><p>Parsed HTML into a doc.</p></body></html>&quo