计算句子中字母的频率

def getfreq(lines): """ calculate a list with letter frequencies lines - list of lines (character strings) both lower and upper case characters are counted. """ totals = 26*[0] chars = [] for line in lines: for ch in line: chars.append(totals) return totals # convert totals to frequency freqlst = [] grandtotal = sum(totals) for total in totals: freq = totals.count(chars) freqlst.append(freq) return freqlst

3条回答

网友

1楼 · 编辑于 2024-09-29 23:23:50

没有collections.Counter：

import collections

sentence = "A long sentence may contain repeated letters"

count = collections.defaultdict(int)  # save some time with a dictionary factory
for letter in sentence:  # iterate over each character in the sentence
    count[letter] += 1  # increase count for each of the sentences

或者如果您真的想完全手动执行：

^{pr2}$

在这两种情况下，count字典将以每个不同的字母作为其键，其值将是遇到一个字母的次数，例如：

^{3}$

如果您希望它不区分大小写，请确保在将其添加到计数时调用letter.lower()。在

网友

2楼 · 编辑于 2024-09-29 23:23:50

在collections模块中有一个非常方便的函数^{}，它将计算序列中对象的频率：

import collections
collections.Counter('A long sentence may contain repeated letters')

将产生：

^{pr2}$

在您的例子中，您可能需要连接您的行，例如在传递到Counter之前使用''.join(lines)。在

如果要使用原始词典获得类似的结果，可能需要执行以下操作：

^{3}$

根据您的Python版本，这可能会比较慢，但是在增加字符串中每个字符的计数之前，使用.get()方法dict返回现有计数或默认值。在

网友

3楼 · 编辑于 2024-09-29 23:23:50

您可以使用集合将文本缩减为唯一字符，然后只计算：

text = ' '.join(lines)  # Create one long string
# Then create a set of all unique characters in the text
characters = {char for char in text if char.isalpha()}
statistics = {}         # Create a dictionary to hold the results
for char in characters: # Loop through unique characters
    statistics[char] = text.count(char) # and count them

相关问题更多 >

编程相关推荐

热门问题

热门文章