Python刮股脚本不回拉定价数据？

import urllib import re symbolfile = open("symbols.txt") symbolslist = symbolfile.read() newsymbolslist = symbolslist.split("\n") i = 0 while i<len(newsymbollist): url = "http://finance.yahoo.com/q?uhb=uh3_finance_vert_gs_ctrl1&fr=&type=2button&s=" +symbolslist[i] +"" htmlfile = urllib.urlopen(url) htmltext = htmlfile.read() regex = '<span id="yfs_184_' +newsymbolslist[i] +'">(.+?)</span>' pattern = re.compile(regex) price = re.findall(pattern,htmltext) print "The price of", newsymbolslist[i] ," is ", price i+=1

1条回答

网友

1楼 · 发布于 2024-09-28 21:57:54

通过实现@Linus Gustav Larsson Thiel在注释中提供的修改和另一个关于regex的修改，您的代码将返回正确的结果。请注意正则表达式中的lowercase()，因为源代码包含小写符号：

i = 0

while i < len(newsymbolslist):
    url = "http://finance.yahoo.com/q?uhb=uh3_finance_vert_gs_ctrl1&fr=&type=2button&s=" +newsymbolslist[i]
    htmlfile = urllib.urlopen(url)
    htmltext = htmlfile.read()
    regex = '<span id="yfs_l84_' +newsymbolslist[i].lower() +'">(.+?)</span>'
    pattern = re.compile(regex)
    price = pattern.findall(htmltext)
    print "The price of", newsymbolslist[i] ," is ", price
    i+=1

对于用于测试目的的静态列表['AAPL','GOOGL','MSFT']，我收到以下输出：

The price of AAPL  is  ['98.53']
The price of GOOGL  is  ['733.07']
The price of MSFT  is  ['52.30']

如果需要，还可以简化代码：

baseurl = "http://finance.yahoo.com/q?uhb=uh3_finance_vert_gs_ctrl1&fr=&type=2button&s="

for symbol in newsymbolslist:
    url = baseurl + symbol
    source = urllib.urlopen(url).read()
    regex = re.compile('<span id="yfs_l84_' + symbol.lower() + '">(.+?)</span>')
    price = regex.findall(source)[0]
    print "The price of", symbol, "is", price

for ... in ...循环消除了对计数器变量的需要，并且由于findall()返回匹配项列表（而您只需要一个），因此可以附加[0]以显示包含的字符串，而不是带有单个元素的列表。你知道吗

这将返回以下内容：

The price of AAPL is 98.53
The price of GOOGL is 733.07
The price of MSFT is 52.30

相关问题更多 >

编程相关推荐

热门问题

热门文章