<p>我可以用下面的方法来解决这个问题,它与您想要的字母匹配,并且可以处理文本中的括号(不是引用):</p>
<ol>
<li><p>寻找开头<code>\(</code></p></li>
<li><p>查找<code>[^()]+(?:\([^()]+\))?</code>的重复,即一个或多个非圆括号字符,后跟一对可选的<code>( )</code>,其中一个或多个非圆括号字符。</p></li>
<li><p>寻找结尾<code>\nGoogle Scholar\)</code></p></li>
<li><p>拆分并与空格合并以删除多个空格</p></li>
</ol>
<p>代码:</p>
<pre><code>import re
text = 'Introduction The endocrine cells in the pancreatic islets of Langerhans secrete insulin and glucagon in response to glucose perturbations to maintain glucose homeostasis. The insulin-secreting beta cells exhibit morphological, functional, and molecular variations, suggesting that they may consist of sub-populations with specialized tasks and physiological responses (Gutierrez etal., 2017Gutierrez G.D. Gromada J. Sussel L. Heterogeneity of the pancreatic beta cell.Front. Genet. 2017; 8: 22Crossref\nPubMed\nScopus (11)\nGoogle Scholar, Roscioni etal., 2016Roscioni S.S. Migliorini A. Gegg M. Lickert H. Impact of islet architecture on -cell heterogeneity, plasticity and function.Nat. Rev. Endocrinol. 2016; 12: 695-709Crossref\nPubMed\nScopus (36)\nGoogle Scholar). Features of beta cell heterogeneity include glucose responsiveness and secretory activity ..... Visualizing transcripts in the pancreas, however, has been infeasible without the use of specialized techniques such as photoswitchable dyes (Cui etal., 2018Cui Y. Hu D. Markillie L.M. Chrisler W.B. Gaffrey M.J. Ansong C. Sussel L. Orr G. Fluctuation localization imaging-based fluorescence insitu hybridization (fliFISH) for accurate detection and counting of RNA copies in single cells.Nucleic Acids Res. 2018; 46: e7Crossref\nPubMed\nScopus (2)\nGoogle Scholar). We have optimized the standard tissue smFISH protocol (Lyubimova etal., 2013Lyubimova A. Itzkovitz S. Junker J.P. Fan Z.P. Wu X. van Oudenaarden A. Single-molecule mRNA detection and counting in mammalian tissue.Nat. Protoc. 2013; 8: 1743-1758Crossref\nPubMed\nScopus (62)\nGoogle Scholar) by substantially increasing the period of mRNA denaturation, which precedes the probe hybridization steps, from 5min to at least 3hr.'
fixed_text = ' '.join(re.sub(r'\((?:[^()]+(?:\([^()]+\))?)+\nGoogle Scholar\)', '', text).split())
print(fixed_text)
</code></pre>
<p>输出:</p>
<blockquote>
<p>Introduction The endocrine cells in the pancreatic islets of
Langerhans secrete insulin and glucagon in response to glucose
perturbations to maintain glucose homeostasis. The insulin-secreting
beta cells exhibit morphological, functional, and molecular
variations, suggesting that they may consist of sub-populations with
specialized tasks and physiological responses . Features of beta cell
heterogeneity include glucose responsiveness and secretory activity
..... Visualizing transcripts in the pancreas, however, has been
infeasible without the use of specialized techniques such as
photoswitchable dyes . We have optimized the standard tissue smFISH
protocol by substantially increasing the period of mRNA denaturation,
which precedes the probe hybridization steps, from 5min to at least
3hr.</p>
</blockquote>
<p>可以通过更改以下代码来进行改进,该代码还删除了前导<code>\(</code>之前的空格,但它与所需的输出不匹配(该代码存在缺陷):</p>
<pre><code>fixed_text = re.sub(r' ?\((?:[^()]+(?:\([^()]+\))?)+\nGoogle Scholar\)', '', string)
</code></pre>
<blockquote>
<p>Introduction The endocrine cells in the pancreatic islets of
Langerhans secrete insulin and glucagon in response to glucose
perturbations to maintain glucose homeostasis. The insulin-secreting
beta cells exhibit morphological, functional, and molecular
variations, suggesting that they may consist of sub-populations with
specialized tasks and physiological responses. Features of beta cell
heterogeneity include glucose responsiveness and secretory activity
..... Visualizing transcripts in the pancreas, however, has been
infeasible without the use of specialized techniques such as
photoswitchable dyes. We have optimized the standard tissue smFISH
protocol by substantially increasing the period of mRNA denaturation,
which precedes the probe hybridization steps, from 5min to at least
3hr.</p>
</blockquote>