Python URL中的Strip

http://www.mega.pk/**washingmachine**-dawlance/ http://www.mega.pk/**washingmachine**-haier/ http://www.mega.pk/**airconditioners**-acson/ http://www.mega.pk/**airconditioners**-lg/ http://www.mega.pk/**airconditioners**-samsung/

3条回答

网友

1楼 · 编辑于 2024-10-01 17:26:43

使用urlparse模块。它是专门为这个目的而建造的。在

from urlparse import urlparse

url = "http://www.mega.pk/washingmachine-dawlance/"

path = urlparse(url).path  # get the path from the URL ("/washingmachine-dawlnace/")
path = path[:path.index("-")]  # remove everything after the '-' including itself
path = path[1:]  # remove the '/' at the starting of the path (just before 'washing')

path变量的值为washingmachine

查看此（urlparse Python module of the week）以获取更多阅读内容。在

干杯！在

网友

2楼 · 编辑于 2024-10-01 17:26:43

不使用正则表达式也可以实现相同的效果。Avinash提出的解决方案更简洁，但下面的方法可能更容易理解，尤其是如果您想在某个时候修改它：

s = '''http://www.mega.pk/washingmachine-dawlance/
http://www.mega.pk/washingmachine-haier/'''.splitlines()
for line in s:    
   cleanedUrl = line.replace('http://www.mega.pk/**','').replace('/','')
   urlParameters = cleanedUrl.split('-')
   print urlParameters[-1]

或者，如果您愿意，您可以使用更紧凑的版本：

^{pr2}$

网友

3楼 · 编辑于 2024-10-01 17:26:43

使用re.sub

re.sub(r'^.*\/([^/]*)-.*', r'\1', line)

DEMO

示例：

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章

Python URL中的Strip

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >