任何解决以下问题的有效方法假设数据很大。我解决了这个问题,但我如何才能改进代码,这将使它的效率。有什么建议吗?你知道吗
数据:
movie_sub_themes = {
'Epic': ['Ben Hur', 'Gone With the Wind', 'Lawrence of Arabia'],
'Spy': ['James Bond', 'Salt', 'Mission: Impossible'],
'Superhero': ['The Dark Knight Trilogy', 'Hancock, Superman'],
'Gangster': ['Gangs of New York', 'City of God', 'Reservoir Dogs'],
'Fairy Tale': ['Maleficent', 'Into the Woods', 'Jack the Giant Killer'],
'Romantic':['Casablanca', 'The English Patient', 'A Walk to Remember'],
'Epic Fantasy': ['Lord of the Rings', 'Chronicles of Narnia', 'Beowulf']}
movie_themes = {
'Action': ['Epic', 'Spy', 'Superhero'],
'Crime' : ['Gangster'],
'Fantasy' : ['Fairy Tale', 'Epic Fantasy'],
'Romance' : ['Romantic']}
themes_keys = movie_themes.keys()
theme_movies_keys = movie_sub_themes.keys()
#Iterate in movie_themes
#Check movie_themes keys in movie_sub_keys
#if yes append the movie_sub_keys into the newdict
newdict = {}
for i in range(len(themes_keys)):
a = []
for j in range(len(movie_themes[themes_keys[i]])):
try:
if movie_themes[themes_keys[i]][j] in theme_movies_keys:
a.append(movie_sub_themes[movie_themes[themes_keys[i]][j]])
except:
pass
newdict[themes_keys[i]] = a
# newdict contains nested lists
# Program to unpack the nested list into single list
# Storing the value into theme_movies_data
theme_movies_data = {}
for k, v in newdict.iteritems():
mylist_n = [j for i in v for j in i]
theme_movies_data[k] = dict.fromkeys(mylist_n).keys()
print (theme_movies_data)
输出:
{'Action': ['Gone With the Wind', 'Ben Hur','Hancock, Superman','Mission: Impossible','James Bond','Lawrence of Arabia','Salt','The Dark Knight Trilogy'],
'Crime': ['City of God', 'Reservoir Dogs', 'Gangs of New York'],
'Fantasy': ['Jack the Giant Killer','Beowulf','Into the Woods','Maleficent','Lord of the Rings','Chronicles of Narnia'],
'Romance': ['The English Patient', 'A Walk to Remember', 'Casablanca']}
抱歉没有正确评论代码。你知道吗
我更关心跑步时间。你知道吗
谢谢你。。你知道吗
您可以使用关系数据库来存储两个表,一个是电影及其子主题,另一个是将子主题与电影主题相关联的表。然后可以使用SQL查询数据库,选择所有电影及其相关电影主题的列表。你知道吗
这种方法会更有效,因为SQL命令的编译往往是为了提高处理速度。关系数据库模型具有很强的可伸缩性,因此可以以最小的开销处理非常大的数据集。你知道吗
有关在Python中创建和使用简单数据库的示例,请参见here。如果您不熟悉SQL操作,请参阅here以获取有关有用操作的简单教程。你知道吗
下面是我的解决方案(使用defaultdict):
计时:4.84µs vs 14.6µs
相关问题 更多 >
编程相关推荐