抓取html数据并解析成lis

import android droid = android.Android() import urllib current = 0 newlist = [] sock = urllib.urlopen("http://m.funtweets.com/random") htmlSource = sock.read() sock.close() rawhtml = [] rawhtml.append (htmlSource) while current < len(rawhtml): while current != "<div class=": if [current] == "</b></a>": newlist.append (current) current += 1 print newlist

2条回答

网友

1楼 · 编辑于 2024-09-28 21:26:27

方法如下： [代码] 进口re 导入urllib2

page = urllib2.urlopen("http://www.m.funtweets.com/random").read() 
user = re.compile(r'<span>@</span>(\w+)') 
text = re.compile(r"</b></a> (\w.*)") 
user_lst =[match.group(1) for match in re.finditer(user, page)] 
text_lst =[match.group(1) for match in re.finditer(text, page)] 
for _user, _text in zip(user_lst, text_lst):
    print '@{0}\n{1}\n'.format(_user,_text)

[/代码]

网友

2楼 · 编辑于 2024-09-28 21:26:27

在android中使用这个LIB来解析HTMLhttp://jsoup.org/它的范围和开发人员广泛接受的LIB它也可以用于python:）

相关问题更多 >

编程相关推荐

热门问题

热门文章