将空字符串附加到forloop列表中的最后一个值

test = ['https://www.latlong.net/location/10-things-i-hate-about-you-locations-250', 'https://www.latlong.net/location/12-angry-men-locations-818', 'https://www.latlong.net/location/12-monkeys-locations-501'] for i in range(0, len(test), 1): r = requests.get(test[i]) testone = {'location name':[],'film':[]} soup = BeautifulSoup(r.content, 'lxml') for th in soup.select("td"): testone['location name'].append(th.text.strip()) testone['location name'].append('') for h in soup.select_one("h3"): testone['film'].append(h)

'location name': ["1117 Broadway (Gil's Music Shop)", '', '47.252495', '', '-122.439644', '', "2715 North Junett St (Kat and Bianca's House)", '', '47.272591', '', '-122.474480', ....

'location name': ["1117 Broadway (Gil's Music Shop)", '47.252495', '-122.439644', "2715 North Junett St (Kat and Bianca's House)", '47.272591', '-122.474480', 'Aurora Bridge', '47.646713', '-122.347435', 'Buckaroo Tavern (closed)', '47.657841', '-122.350327', 'Century Ballroom', '47.615028', '-122.319855', 'Fremont Place Books (closed)', '47.650452', '-122.350510', 'Fremont Troll', '47.651093', '-122.347435', 'Gas Works Park', '47.645561', '-122.334496', 'Kerry Park', '47.629402', '-122.360008', 'Kingdome', '47.595993', '-122.333649', 'Paramount Theatre', '47.613235', '-122.331451', 'Seattle', '47.601871', '-122.341248', 'Stadium High School', '47.265991', '-122.448570', 'Tacoma', '47.250828', '-122.449135', '', 'New York City', '40.742298', '-73.982559', 'New York County Courthouse', '40.714310', '-74.001930', '', ................], 'film': ['10 Things I Hate About You Locations Map','12 Angry Men Locations Map'...]}

2条回答

网友

1楼 · 编辑于 2024-10-04 09:24:08

问题是您在每个表后面追加了一个空字符串'' 你读的是手机。这样，由于位置名称、经度和纬度有3个单独的单元格，所以在每个单元格之间插入一个空字符串

最佳解决方案可能是添加一个计数器并将所有内容存储在地图中，而不是两个列表：

test = ['https://www.latlong.net/location/10-things-i-hate-about-you-locations-250',
'https://www.latlong.net/location/12-angry-men-locations-818',
'https://www.latlong.net/location/12-monkeys-locations-501']

for i in range(0, len(test), 1):
   r = requests.get(test[i])
   testone = {}
   cells = soup.select("td")
   soup = BeautifulSoup(r.content, 'lxml')
   for h in soup.select_one("h3"):
       testone[h] = list()
       for j in range(3):
           testone[h].append(cells.pop(0))

通过这种方式，您可以使用testone[<filmname>]获得有关胶片的所有信息

网友

2楼 · 编辑于 2024-10-04 09:24:08

用extned()代替append()；由于strip()函数返回一个list，并且您希望将列表的所有项附加到testone['location name']
试试这个：

for i in range(0, len(test), 1):
    r = requests.get(test[i])
    testone = {'location name':[],'film':[]}
    soup = BeautifulSoup(r.content, 'lxml')
    for th in soup.select("td"):
        testone['location name'].extend(th.text.strip())
        # Do nothing
    for h in soup.select_one("h3"):
        testone['film'].append(h)

相关问题更多 >

编程相关推荐

热门问题

热门文章