需要解决从网站下载xlsx的问题

# packages import pandas as pd url = 'https://www.ssga.com/us/en/institutional/etfs/library-content/products/fund-data/etfs/us/holdings-daily-us-en-spy.xlsx' # Load the first sheet of the Excel file into a data frame df = pd.read_excel(url, sheet_name=0, header=1) # View the first ten rows df.head(10) #is it worth it to download file to a repisotory, convert to xls, then read in?

1条回答

网友

1楼 · 发布于 2024-09-28 22:01:02

您始终可以通过请求发出请求，然后将xlsx读入数据帧，如下所示：

import pandas as pd
import requests

from io import BytesIO

url = ("https://www.ssga.com/us/en/institutional/etfs/library-content/"
       "products/fund-data/etfs/us/holdings-daily-us-en-spy.xlsx")

r = requests.get(url)
bts = BytesIO(r.content)
df = pd.read_excel(bts)

我不确定是否存在安全问题，但这相当于在浏览器中发出相同的请求。至于动态url，如果您能够确定url的哪些部分正在更改，您可以按如下方式对其进行修改

stock = 'spy'
url = ("https://www.ssga.com/us/en/institutional/etfs/library-content/"
       f"products/fund-data/etfs/us/holdings-daily-us-en-{stock}.xlsx")

相关问题更多 >

编程相关推荐

热门问题

热门文章