擅长:python、mysql、java
<pre><code>import re
with open('/Users/Dev/Sometext.txt','r') as f:
tokens = re.findall(r'[a-z][A-Z]{3}([a-z])[A-Z]{3}[a-z]', f.read())
for token ins tokens:
print token
</code></pre>
<p><code>findall</code>做什么:</p>
<blockquote>
<p>Return all non-overlapping matches of
pattern in string, as a list of
strings. The string is scanned
left-to-right, and matches are
returned in the order found. If one or
more groups are present in the
pattern, return a list of groups; this
will be a list of tuples if the
pattern has more than one group. Empty
matches are included in the result
unless they touch the beginning of
another match.</p>
</blockquote>
<p>可能是<code>re</code>模块中最有用的函数。在</p>
<p>函数的作用是:将整个文件读入大字符串。如果需要将正则表达式与整个文件相匹配,这一点尤其有用。在</p>
<p><strong>警告</strong>:根据文件的大小,您可能喜欢像第一种方法那样逐行迭代文件。在</p>