创建一个2d numpy数组来保存字符问题的回答

创建一个2d numpy数组来保存字符

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

我有下面的numpy数组和前面的设置，它有一个单词队列和一个临时变量'temp'来存储单词。这个单词需要一个字母一个字母地“放入”numpy 2d数组中： <pre><code>from collections import deque import numpy as np message=input("Write a message:") wordqueue=message.split() queue=deque(wordqueue) print(wordqueue) for i in range(1): temp=wordqueue.pop(0) #store the removed item in the temporary variable 'temp' print(wordqueue) print(temp) display = np.zeros((4,10)) #create a 2d array that is to store the words from the queue print(display) display[0, 0] = temp #add the word from the temp variable to fill the array (each character in each sequential position in the array) print(display) </code></pre> 不幸的是，输出如下： ^{pr2}$ 我确实尝试过定义2d数组和定义数据类型，但这也不是很明显，我不断得到各种错误。在 我需要以下帮助： 1理想情况下，我希望numpy数组设置为“*”而不是0/1（文档对这个设置没有帮助）。 2用temp变量替换数组中的*s。每个字母一个* 示例： 显示阵列：（4 x 20） <pre><code>* * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * </code></pre> 输入消息：这是一个测试消息临时工：这个 更新后的显示将显示： <pre><code>t h i s * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * </code></pre> 对于后续的单词，它将填充数组（如果单词太大，则截短，必要时转到下一行） 目前为止： <a href="https://repl.it/IcJ3/7" rel="nofollow noreferrer">https://repl.it/IcJ3/7</a> 例如，我尝试了以下方法来创建char数组： <pre><code>display = np.chararray((4,10)) #create a 2d array that is to store the letters in the words from the queue display[:]="*" </code></pre> 但它产生了这个错误的“b”。不明白为什么。。。在 <pre><code>[[b'*' b'*' b'*' b'*' b'*' b'*' b'*' b'*' b'*' b'*'] [b'*' b'*' b'*' b'*' b'*' b'*' b'*' b'*' b'*' b'*'] [b'*' b'*' b'*' b'*' b'*' b'*' b'*' b'*' b'*' b'*'] [b'*' b'*' b'*' b'*' b'*' b'*' b'*' b'*' b'*' b'*']] </code></pre> 更新（正在处理）更换在这里： <a href="https://repl.it/IcJ3/8" rel="nofollow noreferrer">https://repl.it/IcJ3/8</a>

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

第一件事是第一件事，如果你想要一个“字符”数组，你必须小心你所期望的。在python3中，字符串现在是unicode代码点的序列。在Python2中，字符串是C等语言中的经典“字节序列”字符串。这意味着，从内存pov来看，unicode类型可能会占用更多内存： <pre><code>In [1]: import numpy as np In [2]: chararray = np.zeros((4,10), dtype='S1') In [3]: unicodearray = np.zeros((4,10), dtype='U1') In [4]: chararray.itemsize, unicodearray.itemsize Out[4]: (1, 4) In [5]: chararray.nbytes Out[5]: 40 In [6]: unicodearray.nbytes Out[6]: 160 </code></pre> 因此，如果您知道您只想使用ascii字符，那么可以使用<code>S1</code>数据类型将内存使用量减少到1/4。还要注意，由于Python 3中的<code>S1</code>实际上对应于<code>bytes</code>数据类型（这与Python 2<code>str</code>相等），所以<code>b'this is a bytes object'</code>前面加了一个<code>b</code>，因此<code>b'this is a bytes object'</code>： ^{pr2}$ 现在，假设您有一些负载，您想将消息分配给您的数组。如果消息包含可表示为ascii的字符，则可以快速而松散地使用数据类型： <pre><code>In [15]: message = 'This' In [16]: unicodearray.reshape(-1)[:len(message)] = list(message) In [17]: unicodearray Out[17]: array(['T', 'h', 'i', 's', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', ''], dtype='<U1') In [18]: chararray.reshape(-1)[:len(message)] = list(message) In [19]: chararray Out[19]: array([[b'T', b'h', b'i', b's', b'', b'', b'', b'', b'', b''], [b'', b'', b'', b'', b'', b'', b'', b'', b'', b''], [b'', b'', b'', b'', b'', b'', b'', b'', b'', b''], [b'', b'', b'', b'', b'', b'', b'', b'', b'', b'']], dtype='|S1') </code></pre> 然而，如果情况并非如此： <pre><code>In [22]: message = "กขฃคฅฆงจฉ" In [23]: len(message) Out[23]: 9 In [24]: unicodearray.reshape(-1)[:len(message)] = list(message) In [25]: unicodearray Out[25]: array(['ก', 'ข', 'ฃ', 'ค', 'ฅ', 'ฆ', 'ง', 'จ', 'ฉ', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', ''], dtype='<U1') In [26]: chararray.reshape(-1)[:len(message)] = list(message) - UnicodeEncodeError Traceback (most recent call last) <ipython-input-26-7d7cdb93de1f> in <module>() > 1 chararray.reshape(-1)[:len(message)] = list(message) UnicodeEncodeError: 'ascii' codec can't encode character '\u0e01' in position 0: ordinal not in range(128) In [27]: </code></pre> 注意，如果您想用一个元素初始化数组，而不是它默认使用的<code>np.zeros</code>，可以使用<code>np.full</code>： <pre><code>In [27]: chararray = np.full((4,10), '*', dtype='S1') In [28]: chararray Out[28]: array([[b'*', b'*', b'*', b'*', b'*', b'*', b'*', b'*', b'*', b'*'], [b'*', b'*', b'*', b'*', b'*', b'*', b'*', b'*', b'*', b'*'], [b'*', b'*', b'*', b'*', b'*', b'*', b'*', b'*', b'*', b'*'], [b'*', b'*', b'*', b'*', b'*', b'*', b'*', b'*', b'*', b'*']], dtype='|S1') </code></pre> 最后，要使用for循环执行此长表单： <pre><code>In [17]: temp = "a test" In [18]: display = np.full((4,10), '*', dtype='U1') In [19]: display Out[19]: array([['*', '*', '*', '*', '*', '*', '*', '*', '*', '*'], ['*', '*', '*', '*', '*', '*', '*', '*', '*', '*'], ['*', '*', '*', '*', '*', '*', '*', '*', '*', '*'], ['*', '*', '*', '*', '*', '*', '*', '*', '*', '*']], dtype='<U1') In [20]: it = iter(temp) # give us a single-pass iterator ...: for i in range(display.shape[0]): ...: for j, c in zip(range(display.shape[1]), it): ...: display[i, j] = c ...: In [21]: display Out[21]: array([['a', ' ', 't', 'e', 's', 't', '*', '*', '*', '*'], ['*', '*', '*', '*', '*', '*', '*', '*', '*', '*'], ['*', '*', '*', '*', '*', '*', '*', '*', '*', '*'], ['*', '*', '*', '*', '*', '*', '*', '*', '*', '*']], dtype='<U1') </code></pre> 另一个关于良好度量的测试，跨越行： <pre><code>In [36]: temp = "this is a test, a test this is" In [37]: display = np.full((4,10), '*', dtype='U1') In [38]: it = iter(temp) # give us a single-pass iterator ...: for i in range(display.shape[0]): ...: for j, c in zip(range(display.shape[1]), it): ...: display[i, j] = c ...: In [39]: display Out[39]: array([['t', 'h', 'i', 's', ' ', 'i', 's', ' ', 'a', ' '], ['t', 'e', 's', 't', ',', ' ', 'a', ' ', 't', 'e'], ['s', 't', ' ', 't', 'h', 'i', 's', ' ', 'i', 's'], ['*', '*', '*', '*', '*', '*', '*', '*', '*', '*']], dtype='<U1') </code></pre> 警告传递给<code>zip</code>的参数顺序很重要，因为<code>it</code>是一个单循环迭代器： <pre><code>zip(range(display.shape[1]), it) </code></pre> 它应该是最后一个参数，否则它将跳过行之间的字符！在 最后，请注意，<code>numpy</code>提供了一个方便的函数，用于按顺序迭代数组： <pre><code>In [49]: temp = "this is yet another test" In [50]: display = np.full((4,10), '*', dtype='U1') In [51]: for c, x in zip(temp, np.nditer(display, op_flags=['readwrite'])): ...: x[...] = c ...: In [52]: display Out[52]: array([['t', 'h', 'i', 's', ' ', 'i', 's', ' ', 'y', 'e'], ['t', ' ', 'a', 'n', 'o', 't', 'h', 'e', 'r', ' '], ['t', 'e', 's', 't', '*', '*', '*', '*', '*', '*'], ['*', '*', '*', '*', '*', '*', '*', '*', '*', '*']], dtype='<U1') </code></pre> 为了确保返回的迭代器允许对底层数组进行修改，必须将<code>op_flags=['readwrite']</code>传递给函数，这有一个小的复杂性，但它极大地简化了代码，而且我们不需要使用单次迭代器。不过，我还是喜欢切片分配。在

创建一个2d numpy数组来保存字符

1 个回答

相关Python问题