擅长:python、mysql、java
<p>为什么不使用HTML解析器来解析和修改<code>HTML</code>。你知道吗</p>
<p>例如,使用<a href="http://www.crummy.com/software/BeautifulSoup/bs4/doc/" rel="nofollow noreferrer">^{<cd2>}</a>和<a href="http://www.crummy.com/software/BeautifulSoup/bs4/doc/#replace-with" rel="nofollow noreferrer">^{<cd3>}</a>:</p>
<pre><code>from bs4 import BeautifulSoup
data = """Your html here"""
soup = BeautifulSoup(data)
for link in soup('a', id=True):
link.replace_with('AAAAAA')
print(soup.prettify())
</code></pre>
<p>这将用<code>AAAAAA</code>文本替换所有具有<code>id</code>属性的链接:</p>
<pre><code><div class="thebibliography">
<p class="bibitem">
<span class="biblabel">
[1]
<span class="bibsp">
</span>
</span>
AAAAAA
<span class="cmcsc-10">
...
</code></pre>
<p>另请参见:</p>
<ul>
<li><a href="https://stackoverflow.com/questions/1732348/regex-match-open-tags-except-xhtml-self-contained-tags">RegEx match open tags except XHTML self-contained tags</a></li>
</ul>