回答此问题可获得 20 贡献值,回答如果被采纳可获得 50 分。
<p>我有一个文本格式类似于此示例:</p>
<blockquote>
<p>PAUL: Lorem ipsum dolor sit amet, consectetuer adipiscing elit. Aenean commodo >ligula eget dolor. </p>
<p>LEONARD: Aenean massa. Cum sociis natoque penatibus et magnis dis parturient >montes, nascetur ridiculus mus. Donec quam felis, ultricies nec, pellentesque >eu, pretium quis, sem. Nulla consequat massa quis enim. Donec pede justo, >fringilla vel, aliquet nec, vulputate eget, arcu. </p>
<p>EVIL NINJA [on the roof]: In enim justo, rhoncus ut, imperdiet a, venenatis >vitae, justo. Nullam dictum felis eu pede mollis pretium. Integer tincidunt. >Cras dapibus. Vivamus elementum semper nisi. Aenean vulputate eleifend tellus. >Aenean leo ligula, porttitor eu, consequat vitae, eleifend ac, enim. </p>
<p>PAUL [SCREAMING]: Aliquam lorem ante, dapibus in, viverra quis, feugiat a, >tellus. </p>
</blockquote>
<p>以及一个正则表达式来将脚本解析为对话框。你知道吗</p>
<pre><code>[A-Z]+([:]|[ ]{1}[[][A-Z]*[]])
</code></pre>
<p>我试图捕获所有蝗虫,以便正则表达式匹配</p>
<pre><code>"PAUL:",
"LEONARD [some context]:"
</code></pre>
<p>正如你所看到的<a href="https://regex101.com/r/WfTuWq/3/" rel="nofollow noreferrer">here</a>我没能捕获所有的蝗虫。你知道吗</p>
<blockquote>
<p>EVIL NINJA [on the roof]:</p>
</blockquote>
<p>我怎样才能捕捉到上面的内容呢?正则表达式是正确的方法吗?你知道吗</p>
<p>编辑:所有演讲者的名字都用大写字母表示,并以冒号结尾。这就是我处理的所有笔录的格式。你知道吗</p>