基于tcc+双向rnns的泰语分词
nokcut的Python项目详细描述
nokcut
基于TCC+双向RNNs的泰语分词
来自A Beginner's Guide to Deep NLP with PyTorch - Dr. Prachya Boonkwan的信用代码
COLAB笔记本:https://colab.research.google.com/drive/1WS08VsjlZGAmCGsoI7AlRm-Do3zo-b-g
由我的最佳语料库训练集训练。(90%培训,10%测试)
ep 6
loss: 0.017879242024514966
f1 : 98.47012481095481
来自最佳语料库测试集的f1
F-measure: 96.94929
Recall: 122271.00000/125850.00000 = 97.15614
Precision: 122271.00000/126387.00000 = 96.74333
Number of incorrect : 3579.00000 words
Wannaphong Phatthiyaphaibun先生 wannaphong@kkumail.com