如何使用csv模块统计发生率和计算评分？

import csv def average_rating(csvfile, ID): with open(csvfile) as f: file = csv.reader(f) total = 0 total1 = 0 total2 = 0 for rows in file: for items in ID: if rows[0] == items[0]: total = total + int(rows[3]) for ratings in total: total1 = total1 + int(ratings) total2 = total2 + 1 return total1 / total2

2条回答

网友

1楼 · 编辑于 2024-10-17 06:18:37

您可以使用pandas DataFrame来实现这一点。在

import pandas as pd
df = pd.read_csv('filename.csv')
total_sum = df[df['YouTubeID'] == 'RH5Ta6iHhCQ'].rating.sum()
n_rating = len(df[df['YouTubeID'] == 'RH5Ta6iHhCQ'].rating)
average = total_sum/n_rating

网友

2楼 · 编辑于 2024-10-17 06:18:37

有一些令人困惑的事情，我认为重命名变量和重构将是一个明智的决定。如果一个函数负责获取某个特定youtube id的所有行，而另一个函数则用于计算平均值，这甚至会使事情变得更加明显。在

def average_rating(csvfile, id):
    '''
    Calculate the average rating of a youtube video

    params: - csvfile: the location of the source rating file
            - id: the id of the video we want the average rating of
    '''
    total_ratings = 0
    count = 0
    with open(csvfile) as f:
        file = csv.reader(f)
        for rating in file:
            if rating[0] == id:
                count += 1
                total_ratings += rating[3]
    if count == 0:
        return 0
    return total_ratings / count

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何使用csv模块统计发生率和计算评分？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >