擅长:python、mysql、java
<p>假设其他类文本块与显示的文本块共享相同的结构,则可能需要使用<strong>正则表达式来提取类名和类号:</p>
<p>接下来,我假设每个文本块都包含信息“XX小时讲座”,顺序相同,其中“XX”表示任何类型的数字(时间范围)。在变量“match\u re”中,我定义了一个正则匹配表达式,只与定义的点“XX”匹配。通过使用匹配.组(2) “我把我的匹配限制在最里面的括号里。你知道吗</p>
<p>下面的匹配表达式可能还不完整,因为我不知道您的整个文本文件。你知道吗</p>
<p>下面我提取字符串:<em>WELD 190 Manufacturing I Introduction to MasterCAM(3)</em></p>
<pre><code>import re
string = "WELD 190 Manufacturing I Introduction to MasterCAM (3) 1½ hours lecture - 4½ hours laboratory Note: Cross listed as DT 190/ENGR 190/IT 190 This course will introduce the students to MasterCAM and 2D and basic 3D modeling. Students will receive instructions and drawings of parts requiring 2- or 3-axis machining. Students will design, model, program, set-up and run their parts on various machines, including plasma cutters, water jet cutters and milling machines. WELD 197 Welding Technology Topics (.5 - 3)"
match_re = "(^(.*)\d.* hours lecture)"
match = re.search(match_re,string)
if match:
print(match.group(2))
else:
print("No match")
</code></pre>