datasette插件添加sql函数实现水母驱动的模糊文本匹配
datasette-jellyfish的Python项目详细描述
数据集水母
datasette插件,添加自定义sql函数以进行模糊字符串匹配,该插件构建在james turk和michael stephens的Jellyfishpython库之上。
交互式演示:
示例:
SELECT soundex("hello");
-- Outputs H400
SELECT metaphone("hello");
-- Outputs HL
SELECT nysiis("hello");
-- Outputs HAL
SELECT match_rating_codex("hello");
-- Outputs HLL
SELECT porter_stem("running");
-- Outputs run
SELECT levenshtein_distance("hello", "hello world");
-- Outputs 6
SELECT damerau_levenshtein_distance("hello", "hello world");
-- Outputs 6
SELECT hamming_distance("hello", "hello world");
-- Outputs 6
SELECT jaro_distance("hello", "hello world");
-- Outputs 0.8181818181818182
SELECT jaro_winkler("hello", "hello world");
-- Outputs 0.890909090909091
SELECT match_rating_comparison("hello", "helloo");
-- Outputs 1
请参见the Jellyfish documentation以了解这些函数的说明。