正在尝试将文件下载缓冲区拆分为单独的线程问题的回答

正在尝试将文件下载缓冲区拆分为单独的线程

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

<p>下面是我如何让它工作如果任何人有任何建议，可能的改进，你是非常欢迎的。在</p> <pre><code>import os import requests import threading import urllib2 import time url = "http://www.nasa.gov/images/content/607800main_kepler1200_1600-1200.jpg" def buildRange(value, numsplits): lst = [] for i in range(numsplits): if i == 0: lst.append('%s-%s' % (i, int(round(1 + i * value/(numsplits*1.0) + value/(numsplits*1.0)-1, 0)))) else: lst.append('%s-%s' % (int(round(1 + i * value/(numsplits*1.0),0)), int(round(1 + i * value/(numsplits*1.0) + value/(numsplits*1.0)-1, 0)))) return lst class SplitBufferThreads(threading.Thread): """ Splits the buffer to ny number of threads thereby, concurrently downloading through ny number of threads. """ def __init__(self, url, byteRange): super(SplitBufferThreads, self).__init__() self.__url = url self.__byteRange = byteRange self.req = None def run(self): self.req = urllib2.Request(self.__url, headers={'Range': 'bytes=%s' % self.__byteRange}) def getFileData(self): return urllib2.urlopen(self.req).read() def main(url=None, splitBy=3): start_time = time.time() if not url: print "Please Enter some url to begin download." return fileName = url.split('/')[-1] sizeInBytes = requests.head(url, headers={'Accept-Encoding': 'identity'}).headers.get('content-length', None) print "%s bytes to download." % sizeInBytes if not sizeInBytes: print "Size cannot be determined." return dataLst = [] for idx in range(splitBy): byteRange = buildRange(int(sizeInBytes), splitBy)[idx] bufTh = SplitBufferThreads(url, byteRange) bufTh.start() bufTh.join() dataLst.append(bufTh.getFileData()) content = ''.join(dataLst) if dataLst: if os.path.exists(fileName): os.remove(fileName) print " - %s seconds -" % str(time.time() - start_time) with open(fileName, 'w') as fh: fh.write(content) print "Finished Writing file %s" % fileName if __name__ == '__main__': main(url) </code></pre> <p>这是我开始工作的第一个基本代码，我发现如果我将<code>bufTh</code>缓冲线程设置为Daemon False，则进程需要更多的时间来完成。在</p>

正在尝试将文件下载缓冲区拆分为单独的线程

1 个回答

相关Python问题