擅长:python、mysql、java
<p>可能有很大的优化空间(可能有依赖项/更少的循环),但这里有一个非regex解决方案,如注释中所述:</p>
<pre><code>s_list = [s1, s2, s3, s4]
def hashtag_words(string_list):
words = [s.split(" ") for s in string_list]
hashcounts = [["#" in word for word in wordlist].count(True) for wordlist in words]
normcounts = [len(wordlist) - hashcount for wordlist, hashcount in zip(words, hashcounts)]
sel_strings = [s for s, h, n in zip(string_list, hashcounts, normcounts) if h>1 if n in (1,2,3)]
return sel_strings
hashtag_words(s_list)
>['Makeupby Antonia #makeup #makeupartist #makeupdolls #abhcosmetics',
'Makeupby Antonia asia #makeup #makeupartist #makeupdolls']
</code></pre>