如何在python中的csv下载中显示日文字符以用于web抓取

2024-04-27 23:50:11 发布

您现在位置:Python中文网/ 问答频道 /正文

我无法在我创建的下载csv文件上显示日语字符。怎么做?我是python新手

import requests
from bs4 import BeautifulSoup
import csv

csv_file = open('details.csv', 'w', encoding='utf-8')
csv_writer = csv.writer(csv_file)
csv_writer.writerow(['company Name', 'link', 'established', 'members', 'address'])

for i in range(1):
    res = requests.get('https://abcdefg.com/projects?type=mixed&page='+str(i))
    soup = BeautifulSoup(res.text, 'lxml')
    site = soup.select('h1>a', href=True)

    for i in site:
        res = requests.get('https://abcdefg.com/projects/' + i['href'][10:16])
        soup1 = BeautifulSoup(res.text, 'lxml')

        data = soup1.find('div', class_='company')
        company_name = (data.a.text).strip()

        data = soup1.find('div', class_='company-info-list')
        company_desc = data.find_all('div', class_='company-description')
        try:
            company_link = company_desc[0].text.strip()
        except Exception as e:
            company_link = "None"

        try:
            company_established = company_desc[1].text.strip()
        except Exception as identifier:
            company_established = "None"

        try:
            company_members = company_desc[2].text.strip()
        except Exception as identifier:
            company_members = "None"

        try:
            company_address = company_desc[4].text.strip()
        except Exception as identifier:
            company_address = "None"

        csv_writer.writerow([company_name, company_link, company_established, company_members, company_address])

csv_file.close()

我在寻找解决办法,但这个

csv_file = open('details.csv', 'w', encoding='utf-8')

代码不能做这件事

enter image description here

这是csv文件的结果。我还想删除没有数据的行。请帮忙。我在Windows10上


Tags: csvtextdataaddresslinkresdesccompany