回答此问题可获得 20 贡献值,回答如果被采纳可获得 50 分。
<p>我试图在多个网站上获取利率。数据是非常非结构化的,但形式上足够接近。我想捕捉的是:</p>
<p><strong>x.xx%至xx.xx%</strong></p>
<p>数据外观示例:</p>
<p><em>联邦存款保险公司成员WebBank提供的所有贷款。您的实际利率取决于信用评分、贷款金额、贷款期限和信用使用和历史记录。APR的变化范围为5.98%~35.89%</strong>。例如,您可以获得6000美元的贷款,利率为7.99%,4月利率为11.51%,5.00%的贷款手续费为300美元。在本例中,您将收到5700美元,并将每月支付36美元187.99美元。应偿还总额为6767.64美元。您的APR将根据您在申请时的信用额度确定。发起费从1%到6%不等,截至2017年第一季度,平均发起费为5.49%。没有首付款,也从来没有提前还款罚款。贷款的结清取决于您是否同意www.lendingclub.com网站。通过LendingClub提供的所有贷款的最低还款期限为36个月或更长。</em></p>
<p><strong>3.09%–14.24%*</strong></p>
<p><em>固定费率:
<strong>6.99%至24.99%APR</strong>
锁定你的费率。你每月的付款永远不会改变。</em></p>
<p>我已经把我想捕捉的东西加粗了。我当前的正则表达式如下所示:</p>
<pre><code>(re.findall('(?i)(\d\.\d\d% (?:to|-) \d\d\.\d\d%)
</code></pre>
<p>实际报价如下:</p>
^{pr2}$
<p>新输出:</p>
<pre><code>['5.98% to 35.89%', '2018-06-22', 'https://www.lendingclub.com/loans/personal-loans']
['2018-06-22', 'https://www.lendingclub.com/loans/personal-loans']
['6.99% to 24.99%', '6.99% to 24.99%', '6.99% to 24.99%', '6.99% to 24.99%', '2018-06-22', 'https://www.marcus.com/us/en/personal-loans']
['2018-06-22', 'https://www.marcus.com/us/en/personal-loans']
['2018-06-22', 'https://www.discover.com/personal-loans/']
['2018-06-22', 'https://www.discover.com/personal-loans/']
['2018-06-22', 'https://www.discover.com/personal-loans/']
['6.99% to 24.99%', '2018-06-22', 'https://www.discover.com/personal-loans/']
['2018-06-22', 'https://www.discover.com/personal-loans/']
['2018-06-22', 'https://www.discover.com/personal-loans/']
['2018-06-22', 'https://www.discover.com/personal-loans/']
['2018-06-22', 'https://www.discover.com/personal-loans/']
['2018-06-22', 'https://www.lightstream.com/']
['2018-06-22', 'https://www.lightstream.com/']
['2018-06-22', 'https://www.lightstream.com/']
['2018-06-22', 'https://www.lightstream.com/']
['2018-06-22', 'https://www.lightstream.com/']
['2018-06-22', 'https://www.lightstream.com/']
['2018-06-22', 'https://www.lightstream.com/']
['2018-06-22', 'https://www.prosper.com/']
['2018-06-22', 'https://www.prosper.com/']
['2018-06-22', 'https://www.prosper.com/']
['2018-06-22', 'https://www.prosper.com/']
['2018-06-22', 'https://www.prosper.com/']
['2018-06-22', 'https://www.prosper.com/']
['2018-06-22', 'https://www.prosper.com/']
['2018-06-22', 'https://www.prosper.com/']
['2018-06-22', 'https://www.prosper.com/']
['2018-06-22', 'https://www.prosper.com/']
['2018-06-22', 'https://www.prosper.com/']
</code></pre>