从多个URL导入表以创建单个数据帧和csv fi

producturls = ['https://www.interactivebrokers.com/en/index.php?f=2222&exch=ecbot&showcategories=FUTGRP', 'https://www.interactivebrokers.com/en/index.php?f=2222&exch=cfe&showcategories=FUTGRP', 'https://www.interactivebrokers.com/en/index.php?f=2222&exch=dtb&showcategories=FUTGRP&p=&cc=&limit=100&page=2' ] dfmaster =[] for url in producturls: table = pd.read_html(url, index_col=None, header=None,) df = table[2] for item in df: if item not in dfmaster: dfmaster.append(item) print(dfmaster) dfmaster.to_csv('IB_tickers.csv')

1条回答

网友

1楼 · 发布于 2024-09-24 06:21:10

这应该适合您：

import pandas as pd
from tabulate import  tabulate

producturls = ['https://www.interactivebrokers.com/en/index.php?f=2222&exch=ecbot&showcategories=FUTGRP',
               'https://www.interactivebrokers.com/en/index.php?f=2222&exch=cfe&showcategories=FUTGRP',
               'https://www.interactivebrokers.com/en/index.php?f=2222&exch=dtb&showcategories=FUTGRP&p=&cc=&limit=100&page=2'
               ]

df_list = []

for url in producturls:
    table = pd.read_html(url, index_col=None, header=None,)
    df = table[2]
    df_list.append(df)

dfmaster = pd.concat(df_list, sort=False)
dfmaster = dfmaster.drop_duplicates().reset_index(drop=True)
print(tabulate(dfmaster.head(), headers='keys'))
dfmaster.to_csv('IB_tickers.csv')

结果：

    IB Symbol    Product Description                                      Symbol    Currency
                                         (click link for more details)
        -                             -             
 0  AC           Ethanol -CME                                             EH        USD
 1  AIGCI        Bloomberg Commodity Index                                AW        USD
 2  B1U          30-Year Deliverable Interest Rate Swap Futures           B1U       USD
 3  DJUSRE       Dow Jones US Real Estate Index                           RX        USD
 4  F1U          5-Year Deliverable Interest Rate Swap Futures            F1U       USD

相关问题更多 >

编程相关推荐

热门问题

热门文章