<p>糟糕的程序员。对于课堂作业,我必须从文本文档中提取单词,数数并排序。我在删除标点符号和用小写字母替换大写字母时遇到了困难。任何指导都将不胜感激。在</p>
<pre><code>docwords={}
doc=raw_input("Please enter the name of a text file: ")
docread=open(doc, 'r')
doclist=[]
def main():
for x in docread:
words = x.split()
for word in words:
doclist.<a href="https://www.cnpython.com/list/append" class="inner-link">append</a>(word)
def wordcount():
main()
for counter in doclist:
docwords[counter] = docwords.get(counter,0) + 1
wordcount()
docread.close()
for p in sorted(docwords):
print p, "-->", docwords[p]
</code></pre>
<p><strong>删除标点</strong></p>
<p>一个选项是正则表达式模块的re.sub公司功能。在本例中,我将删除所有非字母数字或空格的字符。在</p>
<pre><code>import re
s = "It's ok"
print re.sub('[^\w ]','',s)
Its ok
</code></pre>
<p><strong>小写</strong></p>
<p>字符串对象的一个直接向下的函数。在</p>
^{pr2}$