我尝试通过使用库获取url并使用extructjson和rdfa数据。不知怎么的,代码中有个错误,得到了一个sql错误。在
代码如下
import pyodbc
import requests
from pprint import pprint
import extruct
cnxn = pyodbc.connect('DRIVER={SQL
Server};SERVER=localhost\SQLEXPRESS;DATABASE=WebCrawler;
Trusted_Connection=yes')
cursor = cnxn.cursor()
cursor.execute("select Id, url from WebCrawlerEFs")
rows = cursor.fetchall()
for row in rows:
print (row.Id,",", row.url)
r = requests.get(row.url)
data = extruct.extract(r.text, r.url)
cursor.execute("INSERT INTO RdfaEFs(rdfa) VALUES ('"data"')")
cnxn.commit()
尝试:
相关问题 更多 >
编程相关推荐