擅长:python、mysql、java
<p>我建议使用<a href="https://pypi.python.org/pypi/beautifulsoup4" rel="nofollow noreferrer">BeautifulSoup</a>。这样地。在</p>
<pre><code>import requests
import re
from bs4 import BeautifulSoup
res = requests.get('https://www.proxynova.com/proxy-server-list/country-fr/')
soup = BeautifulSoup(res.content, "lxml")
REGEX_JS = re.compile("^document\.write\('([^']+)'\.substr\(2\) \+ '([^']+)'\);$")
proxy_ip_list = []
for table in soup.find_all("table", id="tbl_proxy_list"):
for script in table.find_all("script"):
m = REGEX_JS.search(script.text)
if m:
proxy_ip_list.append(m.group(1)[2:] + m.group(2))
for ip in proxy_ip_list:
print(ip)
</code></pre>