在Python中删除HTML响应中的行之间的空格

import neocities import requests import re nc = neocities.NeoCities(api_key='[no]') response = nc.info() print(response) htmlresponse = requests.get('https://thesite.com/index.html') oldBlog = open('newindex.html', 'w') oldBlog.write(str(htmlresponse.text).strip('\n').strip('\r')) oldBlog.close() with open('newindex.html', 'r') as blog: contents = blog.readlines() contents.insert(39,' <p class="header">test lol</p>\n' ' <p class="logpost">foobar</p>\n') with open('newindex.html', 'w') as blog: contents = "".join(contents) blog.write(contents)

2条回答

网友

1楼 · 编辑于 2024-10-03 06:32:10

改变

oldBlog.write(str(htmlresponse.text).strip('\n').strip('\r'))

到

oldBlog.write(str(htmlresponse.text).replace('\n', ''))

网友

2楼 · 编辑于 2024-10-03 06:32:10

假设html是python字符串（在代码中html_string是str(htmlresponse.text)）：

html_string = '''<html>

    <head>

        <!  <link rel="icon" href="/sort/this/later.jpg" type="image/x-icon" /> >

        <title>foo</title>

        <meta name="description" content="bar" />
'''

按换行符html_string.split('\n')拆分它将输出：

['<html>',
 '',
 '    <head>',
 '',
 '        <!  <link rel="icon" href="/sort/this/later.jpg" type="image/x-icon" /> >',
 '',
 '        <title>foo</title>',
 '',
 '        <meta name="description" content="bar" />',
 '']

这段代码将提取列表中的每个字符串，如果字符串的长度为> 0，则将其保留

list1 = [line for line in html_string.split('\n') if len(line) > 0]

或更紧凑：

list1 = [line for line in html_string.split('\n') if line]

这将给你：

['<html>',
 '    <head>',
 '        <!  <link rel="icon" href="/sort/this/later.jpg" type="image/x-icon" /> >',
 '        <title>foo</title>',
 '        <meta name="description" content="bar" />']

但是list1是一个列表。要将其转换回字符串，您需要：

new_html_string = '\n'.join(list1)

打印new_html_string将为您提供：

<html>
    <head>
        <!  <link rel="icon" href="/sort/this/later.jpg" type="image/x-icon" /> >
        <title>foo</title>
        <meta name="description" content="bar" />

总而言之：

html_string = '''<html>

    <head>

        <!  <link rel="icon" href="/sort/this/later.jpg" type="image/x-icon" /> >

        <title>foo</title>

        <meta name="description" content="bar" />
'''
list1 = [line for line in html_string.split('\n') if line]
new_html_string = '\n'.join(list1)

相关问题更多 >

编程相关推荐

热门问题

热门文章