无法删除爬取文本之间的空格

html=""" <div class="postal-address"> <p>11525 23 AVE</p> <p>EDMONTON, AB , T6J 4T3 </p> <p><a rel="nofollow" href="mailto:info@something.com">info@something.com</a></p> <p><a rel="nofollow" href="http://www.something.org" target="_blank">Visit our Web Site</a></p> </div> """

3条回答

网友

1楼 · 编辑于 2024-10-01 22:41:46

将源字符串拆分为逗号。在
从结果列表中的每个字符串中去掉前导空格或尾随空格。在
使用', '作为分隔符连接字符串。在

像这样：

src = '11525 23 AVE, EDMONTON,\n        AB\n        ,\n        T6J 4T3\n'
print(', '.join([s.strip() for s in src.split(',')]))

输出

^{pr2}$

如果已经有字符串列表，则更容易：

^{3}$

网友

2楼 · 编辑于 2024-10-01 22:41:46

当你这样做的时候。replace（“\n”，“”）我想你必须避开斜杠。这有时会令人困惑，如果不尝试的话，我无法告诉你需要多少个斜杠来逃避它，但请尝试其中一个。。。。在

.replace("\\n","")
.replace("\\\n","")
.replace("\\\\n","")

使用单引号时会发生什么？在

网友

3楼 · 编辑于 2024-10-01 22:41:46

请尝试以下解决方案，如有任何问题，请通知我：

address = [" ".join(item.text.split()).replace(" ,", ",") for item in root.cssselect(".postal-address p") if item.text]

输出：

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章