用靓汤Python刮问题的回答

用靓汤Python刮

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

<pre><code><div class="members_box_second"> <div class="members_box0"> 1 </div> <div class="members_box1"> Name:Mr.Jagadhesan.S Designation:Proprietor CODISSIA - Designation:(Founder President, CODISSIA) Name of the Industry:Govardhana Engineering Industries Specification:LIFE Date of Admission:19.12.1969 </div> <div class="members_box2"> Ukkadam South Phone:2320085, 2320067 Email:<a href="mailto:jagadhesan@infognana.com">jagadhesan@infognana.com</a> </div> </div> <div class="members_box"> <div class="members_box0"> 2 </div> <div class="members_box1"> Name:Mr.Somasundaram.A Designation:Proprietor Name of the Industry:Everest Engineering Works Specification:LIFE Date of Admission:19.12.1969 </div> <div class="members_box2"> Alagar Nivas, 284 NSR Road Phone:2435674 <h4>Factory Address</h4> Coimbatore - 641 027 Phone:2435674 </div> </div> </code></pre> 我有上面的结构。因此，我试图从<code>class</code>成员框1和成员框2的<code>div</code>内的文本。在 我有下面的脚本，它只从成员\u box1获取数据 ^{pr2}$ 这就是我试图从两个盒子里得到数据的方法 <pre><code>from bs4 import BeautifulSoup import urllib2 import csv import re page = urllib2.urlopen("http://www.codissia.com/member/members-directory/?mode=paging&Keyword=&Type=&pg=1") soup = BeautifulSoup(page.read()) eachbox2 = soup.findAll('div ', {'class':'members_box2'}) for eachuniversity in soup.findAll('div',{'class':'members_box1'}): data = eachbox2 + [re.sub('\s+', ' ', text).strip().encode('utf8') for text in eachuniversity.find_all(text=True) if text.strip()] print data </code></pre> 但我得到的结果和我对成员的结果是一样的 更新 我希望迭代的输出是这样的（在单行中） <pre><code>Name:,Mr.Srinivasan.N,Designation:,Proprietor,CODISSIA - Designation:,(Past President, CODISSIA),Name of the Industry:,Arian Soap Manufacturing Co,Specification:,LIFE,Date of Admission:,19.12.1969, "Parijaat" 26/1Shanker Mutt Road, Basavana Gudi,Phone:,2313861 </code></pre> 但我得到的是： <pre><code>Name:,Mr.Srinivasan.N,Designation:,Proprietor,CODISSIA - Designation:,(Past President, CODISSIA),Name of the Industry:,Arian Soap Manufacturing Co,Specification:,LIFE,Date of Admission:,19.12.1969 "Parijaat" 26/1Shanker Mutt Road, Basavana Gudi,Phone:,2313861 </code></pre>

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

用靓汤Python刮

1 个回答

相关Python问题