回答此问题可获得 20 贡献值,回答如果被采纳可获得 50 分。
<p>我需要找到一个只选择金额(以欧元为单位)的正则表达式,因此值的前面需要有一个<code>€</code>或<code>euros</code>,并且在<code>,</code>之后我们有便士,也可以有空格或点</p>
<pre><code>7 967 59 €
- 9847, 48 euros à titre de rappel de salaire sur le bonus de l'année 2012,
- 1929, 78 euros à titre de rappel de salaire sur le bonus de l'année 2013,
- 129 689, 78 euros à titre de solde d'indemnité conventionnelle de licenciement,
- 1098 euros au titre du paiement du DIF,
é à 20 892, 05 euros, il ressort des pi
le de 27 084, 26 euros
ée à 26 395, 10 euros, hors bo
de 129 689, 78 euros,
6.000 € au titre des dommages et intérêts pour licenciement sans cause réelle et sérieuse,
1.510 € au titre de l'indemnité compensatrice de préavis,
151 € au titre des congés payés y afférents, 739 € au titre de l'indemnité de licenciement,
656,19 € au titre de l'indemnité due au titre de la non rémunération de la période de mise à pied conservatoire,
65,61 € au titre des congés payés afférents,
2.000 € au titre de 59 € au titre de <span class="highlight_underline">l'indemnité légale de licenciement</span>
2014,7 967, 59 € au titre de <span class="highlight_underline">l'indemnité légale de licenciement</span>
rappel de salaires de janvier 2007 au 7 mars 2007 3.708,34 €
SECTION B N° 419 425 426 427 428 429 430 432 433 434 436 441 442 443 444 446 467 571 572
</code></pre>
<p>我想到了这个:</p>
<pre><code>(\d.+\d+)(?:\s(?:euros?|€))
</code></pre>
<p>但它并不像它应该的那样准确</p>
<p>有人能帮我吗</p>
<p>编辑:</p>
<p>@Wiktor Stribiżew给了我:</p>
<pre><code>(\d[\d.\s,]*)(?:\s(?:euro|€))
</code></pre>
<p>这很接近,但通过以下示例:</p>
<pre><code>2014,7 967, 59 €
</code></pre>
<p>它还需要<code>2014,</code></p>
<p>和<code>49715 11000158926 101,30 €</code></p>
<p>它需要<code>49715 11000158926</code>。人数限制为3人一组</p>
<p>和<code>2007 3.708,34 €</code></p>
<p>它不应该也需要<code>2007</code></p>
<p>编辑2:</p>
<p>感谢您的回答,但在我的python脚本中似乎不起作用:</p>
<pre><code>import regex
sentences_pd = pd.read_csv('sampled_amounts.csv', names=["text"])
sentences_pd.head()
print([(regex.findall("\b((?:\d+|\d{1,3}(?:[,.\s]\d{3})*)(?:[,.\s]*\d+)?)\s(?:euros?|€)", x)) for x in sentences_pd['text']])
</code></pre>
<p>文本列看起来像:</p>
<p><a href="https://i.stack.imgur.com/nRVsy.png" rel="nofollow noreferrer"><img src="https://i.stack.imgur.com/nRVsy.png" alt="enter image description here"/></a></p>
<p>它给了我一个空数组</p>
<pre><code>[[], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], [], []]
</code></pre>