检测乱码字符串。
gibberish-detector的Python项目详细描述
胡言乱语检测器
这是基于https://github.com/rrenaud/Gibberish-Detector,并进行了调整,使其成为 Python3模块。在
示例
快速入门:
$ gibberish-detector train examples/big.txt > big.model
$ gibberish-detector detect --model big.model --string "ertrjiloifdfyyoiu"
True
训练大细胞:
^{pr2}$交互式检测:
$ gibberish-detector detect --model big.model --interactive Entering interactive mode. Press ctrl+d to quit. Input text: superman False (2.375) Input text: ertrjiloifdfyyoiu True (4.154)
安装
pip install gibberish-detector
使用
$ gibberish-detector -h
usage: gibberish-detector [-h] [--version] {train,detect} ...
positional arguments:
{train,detect}
train Trains a model to be used for gibberish detection.
detect Uses a trained model to identify gibberish strings.
optional arguments:
-h, --help show this help message and exit
--version Display version information.
您也可以将其用作导入的模块:
>>>fromgibberish_detectorimportdetector>>>Detector=detector.create_from_model('big.model')>>>print(Detector.is_gibberish('ertrjiloifdfyyoiu'))True
- 项目
标签: