使用python正则表达式替换HTML片段中的块

text = 'Text1 Text2 ' pattern = " (?!|) " matches = [ match for match in re.finditer(pattern, text) ] #matches = [ 'Text1 Text2 ' ]

1条回答

网友

1楼 · 发布于 2024-10-01 02:35:40

下面的例子是给你一个想法，你可以自己实现

from simplified_scrapy.core.regex_helper import replaceReg,regSearch
html = '''
<p>Text1</p> <br />Text2 <br /> <p> </p> <br/>
<p>Text11</p> <br />Text12 <br /> <p> </p> <br/>
'''
while True: # Use cycle to process one by one
    o = regSearch(html,"<br\s*/>[^<]*<br\s*/>") # Take out the data to be replaced
    if not o: break
    n = replaceReg(o,"<br\s*/>","<p>",1) # Replace start
    n = replaceReg(n,"<br\s*/>","</p>",1) # Replace end
    html = html.replace(o,n)
print (html)

结果:

<p>Text1</p> <p>Text2 </p> <p> </p> <br/>
<p>Text11</p> <p>Text12 </p> <p> </p> <br/>

编程相关推荐

java控制台返回扫描器捕获的第一件事，而不转移到其他代码块
java无法使用Jedis Lib本地连接到aws上的ElasticCache群集
java我正在尝试将GPS功能添加到我的安卓应用程序中，GPS坐标每次都是0.0,0.0
Java/Selenium RemoteWebDriver/Maven/JUnit在尝试调用浏览器时获取空会话id
java可自由拖动的TextView精细控件
java如何找到使用特定端口的神秘服务？
swing无法使用Ubuntu运行Java GUI程序
.net如何在java GUI应用程序中读取终端流？
使用ApacheMaven打包时出现java错误
java我需要帮助来制作文本框架

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用python正则表达式替换HTML片段中的块

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >