用于减少读取sqlite3中数据库的冗余的函数

def read_MaltaData(): conn = sqlite3.connect('FinalProjectDatabase.sqlite3') Malta = conn.cursor() Malta.execute("SELECT * FROM MaltaData WHERE AirPollutant = 'PM10'") result = Malta.fetchall() print(result)

3条回答

网友

1楼 · 编辑于 2024-09-27 21:23:56

如果将国家名称作为参数传递给数据检索函数，则可以动态生成表名（请注意execute和print中的f-string参数）：

初稿

def print_CountryData(country):
    conn = sqlite3.connect('FinalProjectDatabase.sqlite3')
    cur = conn.cursor()
    cur.execute(f"SELECT SUM(AirPollutionLevel) FROM {country}Data WHERE AirPollutant = 'PM10'")
    sumVal = cur.fetchone()[0]
    print(f"{country} {sumVal}")

# example call:
for country in ('France', 'Germany', 'Italy', 'Malta', 'Poland'):
    print_CountryData(country)

在构建查询字符串时，出于安全原因，在简单的字符串函数中，您自己在sqlite3 documentation中被阻止，在您完全控制实际参数的情况下，我认为它是安全的。p>

这个答案调整了great answer given by forpas的求和，但拒绝将重复移动到SQL。它还显示了与python的集成和输出格式

MRE样式版本

这是我第一个答案的改进版本，转换为Minimal, Reproducible Example并与输出相结合。此外，还进行了一些性能改进，例如只打开数据库一次

import sqlite3
import random # to simulate actual pollution values

# Countries we have data for
countries = ('France', 'Germany', 'Italy', 'Malta', 'Poland')

# There is one table for each country
def tableName(country):
    return country+'Data'

# Generate minimal version of tables filled with random data
def setup_CountryData(cur):
    for country in countries:
        cur.execute(f'''CREATE TABLE {tableName(country)}
                (AirPollutant text, AirPollutionLevel real)''')
        for i in range(5):
            cur.execute(f"""INSERT INTO {tableName(country)} VALUES 
                    ('PM10', {100*random.random()})""")
                    
# Get sum up pollution data for each country
def print_CountryData(cur):
    for country in countries:
        cur.execute(f"""SELECT SUM(AirPollutionLevel) FROM 
                {tableName(country)} WHERE AirPollutant = 'PM10'""")
        sumVal = cur.fetchone()[0]
        print(f"{country:10} {sumVal:9.5f}")

# For testing, we use an in-memory database
conn = sqlite3.connect(':memory:')
cur = conn.cursor()
setup_CountryData(cur)

# The functionality actually required
print_CountryData(cur)

样本输出：

France     263.79430
Germany    245.20942
Italy      225.72068
Malta      167.72690
Poland     290.64190

在没有实际尝试的情况下，通常很难评估解决方案。这就是为什么StackOverflow上的提问者不断被鼓励以这种方式提问的原因：这使得有人更可能理解并解决问题快速

网友

2楼 · 编辑于 2024-09-27 21:23:56

如果数据库不是太大，可以使用^{}

与直接使用SQL查询相比，这种方法效率较低，但如果您希望以交互方式在笔记本中浏览数据，则可以使用这种方法

您可以使用^{}从SQLite数据库创建数据帧

然后使用为此类任务设计的pandas.DataFrame方法执行计算

针对您的具体情况：

import sqlite3
import pandas as pd

conn = sqlite3.connect(db_file)

query = "SELECT * FROM MaltaData WHERE AirPollutant = 'PM10'"
df = pd.read_sql_query(query, conn)

# check dataframe content
print(df.head())

如果我理解了，然后您想计算给定列中的值之和：

s = df['AirPollutionLevel'].sum()

如果缺少值，则可能需要在求和之前用0填充：

s = df['AirPollutionLevel'].fillna(0).sum()

网友

3楼 · 编辑于 2024-09-27 21:23:56

可以使用UNION ALL为每个国家/地区获取一行：

SELECT 'France' country, SUM(AirPolutionLevel) [summation value] FROM FranceData WHERE AirPollutant = 'PM10'
UNION ALL
SELECT 'Germany' country, SUM(AirPolutionLevel) [summation value] FROM GermanyData WHERE AirPollutant = 'PM10'
UNION ALL
SELECT 'Italy' country, SUM(AirPolutionLevel) [summation value] FROM ItalyData WHERE AirPollutant = 'PM10'
UNION ALL
SELECT 'Malta' country, SUM(AirPolutionLevel) [summation value] FROM MaltaData WHERE AirPollutant = 'PM10'
UNION ALL
SELECT 'Poland' country, SUM(AirPolutionLevel) [summation value] FROM PolandData WHERE AirPollutant = 'PM10'

初稿

MRE样式版本

相关问题更多 >

编程相关推荐

热门问题

热门文章