擅长:python、mysql、java
<p>您可以使用正则表达式来提取前三个单词。你知道吗</p>
<pre class="lang-py prettyprint-override"><code>df.select(regexp_extract(col("product"), "([^\\s]+\\s+){0,2}[^\\s]+", 0))\
.show(truncate=False)
</code></pre>
<pre><code>+ +
|regexp_extract(product, ([^\s]+\s+){0,2}[^\s]+, 0)|
+ +
|HI Celebrate Cake |
|GO Choc Celebrat |
|BI Chocolate Buttercream |
|Graduation Cake 28 |
|Slab Image Cake |
|Slab Celebration Cake |
|Grain Bread |
+ +
</code></pre>