如何将单词列表(如向量)相乘?

2024-09-22 16:29:48 发布

您现在位置:Python中文网/ 问答频道 /正文

我有这样的单词列表(这里只列出了2个):

list_1 = ['average', 'reasonable'] 
list_2 = ['fiddle', 'frolic']
list_n = ['etc', 'etc']

我想把这两张单子相乘得到这个答案:

obj[l1] * obj[l2] = ['average fiddle', 'average frolic', 'reasonable fiddle', 'reasnable frolic']
obj[l1] * obj[l2] *...* obj[n]

我写了这个代码:

import numpy as np
obj = {}
obj['l1'] = np.array(list_1)
obj['l2'] = np.array(list_2)
print(obj['l1']*obj['l2'])

但这只给了我一个错误:

TypeError: ufunc 'multiply' did not contain a loop with signature matching types dtype('<U10') dtype('<U10') dtype('<U10')

我该怎么办?你知道吗

编辑: 尝试使用以下用户建议的itertools:

word_list = ['fair play']
output = {'fair': ['average', 'reasonable'], 'play': ['fiddle', 'frolic']}
result = []
for words in word_list: 
    for word in word_tokenize(words): 
        list_1 = output_set[word]
        result = [(x, y) for x, y in product(list_1, result)]
        result = list(map(' '.join, result))
print(result)

但这只返回一个空集。有没有一种方法可以遍历“无限”列表?你知道吗


Tags: inobjl1fornpresultlistword
2条回答

使用itertools.产品我们可以将这些作为tuples获得,然后使用' '.join创建str

from itertools import product

list_1 = ['average', 'reasonable'] 
list_2 = ['fiddle', 'frolic']
list_n = ['etc', 'vash']

a = [(x, y, z) for x, y, z in product(list_1, list_2, list_n)]
a = list(map(' '.join, a))
# ['average fiddle etc', 'average fiddle vash', 'average frolic etc', 'average frolic vash', 'reasonable fiddle etc', 'reasonable fiddle vash', 'reasonable frolic etc', 'reasonable frolic vash']

如果必须使用列表:

In [86]: list_1 = ['average', 'reasonable'] 
    ...: list_2 = ['fiddle', 'frolic']
In [87]: arr1 = np.array(list_1, object)
In [88]: arr2 = np.array(list_2, object)
In [89]: np.add.outer(arr1, arr2)
Out[89]: 
array([['averagefiddle', 'averagefrolic'],
       ['reasonablefiddle', 'reasonablefrolic']], dtype=object)

通过创建对象数组,而不是stringdtype,我强制addufunc使用Python字符串的+方法。正如@Sandeep的答案所示,字符串加法是一种连接。字符串乘法是一种复制。你知道吗

第三个阵列:

In [90]: arr3 = np.array(['etc', 'etc'], object)
In [91]: np.add.outer(np.add.outer(arr1, arr2),arr3)
Out[91]: 
array([[['averagefiddleetc', 'averagefiddleetc'],
        ['averagefrolicetc', 'averagefrolicetc']],

       [['reasonablefiddleetc', 'reasonablefiddleetc'],
        ['reasonablefrolicetc', 'reasonablefrolicetc']]], dtype=object)

我在猜测你把行动链起来是什么意思。你知道吗

就我个人而言,我更喜欢@vash的itertools解决方案;numpy不会给Python的字符串处理增加太多内容。你知道吗

In [105]: [' '.join(x) for x in itertools.product(arr1,arr2,arr3)]
Out[105]: 
['average fiddle etc',
 'average fiddle etc',
 'average frolic etc',
 'average frolic etc',
 'reasonable fiddle etc',
 'reasonable fiddle etc',
 'reasonable frolic etc',
 'reasonable frolic etc']

相关问题 更多 >