如果Dash应用程序被导入的大量数据减慢了运行速度，如何让它运行得更快

import dash import dash_core_components as dcc import dash_html_components as html import pandas as pd from redcap import Project import pandas as pd #redcap api and key api_url = "enter link" api_key = "enter key" project = Project(api_url, api_key) #call data from redcap def data(): df = project.export_records(format="df", df_kwargs={"index_col": project.field_names[1]}) return df df = data() #generate table def generate_table(dataframe, max_rows=10): return html.Table( # Header [html.Tr([html.Th(col) for col in dataframe.columns])] + # Body [html.Tr([ html.Td(dataframe.iloc[i][col]) for col in dataframe.columns ]) for i in range(min(len(dataframe), max_rows))] ) external_stylesheets = ['https://codepen.io/chriddyp/pen/bWLwgP.css'] app = dash.Dash(__name__, external_stylesheets=external_stylesheets) app.layout = html.Div(children=[ html.H4(children='US Agriculture Exports (2011)'), generate_table(df) ]) if __name__ == '__main__': app.run_server(debug=True)

1条回答

网友

1楼 · 发布于 2024-09-27 22:21:46

这里有几件事：

1）用project.export_records从redcap导出数据可能是不必要的步骤。我不能百分之百确定您正在使用的数据结构，但我建议将对象转换为pandas数据帧–pandas处理结构化数据的速度非常快。你知道吗

2）假设您不打算显示所有数据，我建议将数据帧的大小限制为所需的最小大小。你知道吗

3）为数据帧生成html的计算量很大，而且有点循环，依赖于索引。我将对那里的代码做以下更改：

# Generating the Body (Slightly more readable and a lot less loopy & indexy)
html_all_rows = []
for idx, row in dataframe[:max_rows].iterrows():
   html_row = html.Tr([html.Td(v) for v in row])
   html_all_rows.append(html_row)

4）或者，我建议使用Plotly的内置datatable。它是一个比典型的表更具交互性的对象+它允许非常整洁的排序和查询。datatable的数据输入是一个类似json的字典，因此一旦数据被访问，速度就会提高。你知道吗

5）同样，我建议只向应用程序加载所需的数据。我无法想象350个字段对任何人都有用——同样，250000行。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章