擅长:python、mysql、java
<p>使用:</p>
<pre><code>import re
sequence = "AGATCAGATCTTTTTTCTAATGTCTAGGATATATCAGATCAGATCAGATCAGATCAGATC"
matches = re.findall(r'(?:AGATC)+', sequence)
# To find the longest subsequence
longest = max(matches, key=len)
</code></pre>
<p>说明:</p>
<p>非捕获组<code>(?:AGATC)+</code></em></p>
<ul>
<li><code>+</code>量词-尽可能多地在一次和无限次之间匹配</李>
<li><code>AGATC</code>按字面意思匹配字符AGATC(区分大小写)</li>
</ul>
<p>结果:</p>
<pre><code># print(matches)
['AGATCAGATC', 'AGATCAGATCAGATCAGATCAGATC']
# print(longest)
'AGATCAGATCAGATCAGATCAGATC'
</code></pre>
<p>您可以测试regex<a href="https://regex101.com/r/TaBVnx/1" rel="nofollow noreferrer">^{<cd4>}</a></p>