<pre><code>import urllib
budgeturl = "http://www.the-numbers.com/movie/budgets/all"
s = urllib.urlopen(budgeturl).read()
def find_between( s, first, last ):
try:
start = s.index( first ) + len( first )
end = s.index( last, start )
return s[start:end]
except ValueError:
return ""
s = find_between(s, '<table>', '</table>')
print s[:500]
print '.............................................................'
print s[-250:]
</code></pre>
<p><a href="https://stackoverflow.com/questions/3368969/find-string-between-two-substrings">Find string between two substrings</a></p>
<p>退货:</p>
^{pr2}$
<p><a href="https://i.stack.imgur.com/tk7vr.png" rel="nofollow noreferrer"><img src="https://i.stack.imgur.com/tk7vr.png" alt="enter image description here"/></a></p>
<pre><code>.........................................
</code></pre>
<p><a href="https://i.stack.imgur.com/Ct7yY.png" rel="nofollow noreferrer"><img src="https://i.stack.imgur.com/Ct7yY.png" alt="enter image description here"/></a></p>
<blockquote>
<p>I need the text not the link.</p>
</blockquote>
<p>通过<a href="http://www.convertcsv.com/html-table-to-csv.htm" rel="nofollow noreferrer">http://www.convertcsv.com/html-table-to-csv.htm</a></p>
<pre><code>Release Date,Movie,Production Budget,Domestic Gross,Worldwide Gross
1,12/18/2009,Avatar,"$425,000,000","$760,507,625","$2,783,918,982"
8/5/2005,My Date With Drew,"$1,100","$181,041","$181,041"
</code></pre>
<p>您可以使用beautifulsoup执行相同操作,请参见:</p>
<p><a href="https://stackoverflow.com/questions/14167352/beautifulsoup-html-csv">beautifulSoup html csv</a></p>