在Python中从外部文本文件读取多行

os.chdir(r'M:\Project\Count') def main(): infile = open("module3.txt","r") uppercasecount = 0 lowercasecount = 0 digitcount = 0 spacecount = 0 for character in infile.readlines(): if character.isupper() == True: uppercasecount += 1 if character.islower() == True: lowercasecount += 1 if character.isdigit() == True: digitcount += 1 if character.isspace() == True: spacecount += 1 print ("Total count is %d Upper case, %d Lower case, %d Digit(s) and %d spaces." %(uppercasecount, lowercasecount, digitcount, spacecount)) main()

2条回答

网友

1楼 · 编辑于 2024-09-27 09:30:05

如果您只想检查文件中包含的字符类型，我不会使用readlines，而是使用常规的read。在

STEP_BYTES = 1024

def main():
    infile = open("module3.txt","r")
    uppercasecount = 0
    lowercasecount = 0
    digitcount = 0
    spacecount = 0
    data = infile.read(STEP_BYTES)
    while data:
        for character in data:
            if character.isupper() == True:
                uppercasecount += 1
            if character.islower() == True:
                lowercasecount += 1
            if character.isdigit() == True:
                digitcount += 1
            if character.isspace() == True:
                spacecount += 1
        data = infile.read(STEP_BYTES)

    print ("Total count is %d Upper case, %d Lower case, %d Digit(s) and %d spaces." %(uppercasecount, lowercasecount, digitcount, spacecount))

main()

如果您真的需要使用readlines，请记住，该方法将读取文件的所有行，并将它们放入内存中（对于非常大的文件，则不太适合）。在

例如，假设您的module3.txt文件包含：

^{pr2}$

使用readlines()将返回：

['this Is a TEST\n', 'and this is another line']

记住这一点，您可以使用双for循环遍历文件内容：

def main():
    infile = open("module3.txt","r")
    uppercasecount = 0
    lowercasecount = 0
    digitcount = 0
    spacecount = 0
    lines = infile.readlines()
    for line in lines:
        for character in line:
            if character.isupper() == True:
                uppercasecount += 1
            if character.islower() == True:
                lowercasecount += 1
            if character.isdigit() == True:
                digitcount += 1
            if character.isspace() == True:
                spacecount += 1
    print ("Total count is %d Upper case, %d Lower case, %d Digit(s) and %d spaces." %(uppercasecount, lowercasecount, digitcount, spacecount))

main()

至于目录问题，如果代码和文本文件（module3.txt）将在同一目录中提供，则不需要执行chdir。默认情况下，脚本的工作目录是脚本所在的目录。在

假设您将其发送到一个目录中，例如：

  |-> Count
     |-> script.py
     |-> module3.txt

您只需使用相对路径从script.py中打开module3.txt：行open("module3.txt", "r")将使用脚本运行的目录查找名为module3.txt的文件（即Count\）。您不需要调用os.chdir。如果您还想确定，可以chdir到脚本所在的目录（查看this）：

知道了这一点，请将您的硬编码chdir行（文件顶部的os.chdir(r'M:\Project\Count')）更改为：

print "Changing current directory to %s" % os.path.dirname(os.path.realpath(__file__))
os.chdir(os.path.dirname(os.path.realpath(__file__)))

网友

2楼 · 编辑于 2024-09-27 09:30:05

您可以使用两种形式iter一次读取任意数量的字节，使用itertools.chain将它们视为一个长输入。您可以使用str方法作为collections.Counter的键，而不是跟踪几个变量，例如：

from collections import Counter
from itertools import chain

counts = Counter()
with open('yourfile') as fin:
    chars = chain.from_iterable(iter(lambda: fin.read(4096), ''))
    for ch in chars:
        for fn in (str.isupper, str.islower, str.isdigit, str.isspace):
            counts[fn] += fn(ch)

#Counter({<method 'islower' of 'str' objects>: 39, <method 'isspace' of 'str' objects>: 10, <method 'isdigit' of 'str' objects>: 0, <method 'isupper' of 'str' objects>: 0})

那么counts[str.lower]会给你39例如。。。在

相关问题更多 >

编程相关推荐

热门问题

热门文章