Sensitive-words 是一个基于Python的敏感词过滤工具,其核心算法采用了Aho-Corasick算法,可以在O(n+k)的时间复杂度内完成大量关键词的搜索工作。该工具提供了多种敏感词匹配算法,并支持自定义敏感词库和替换策略。Sensitive-words 还具有开源免费、易于使用的特点,可以快速准确地对文本中的敏感词进行过滤,保护用户的隐私...
sensitive-words 基于DFA算法实现的敏感词过滤算法 在实现文字过滤的算法中,DFA是唯一比较好的实现算法。 DFA 全称为:Deterministic Finite Automaton,即确定有穷自动机。其特征为:有一个有限状态集合和一些从一个状态通向另一个状态的边,每条边上标记有一个符号,其中一个状态是初态,某些状态是终态。但不同于不确...
pythonjavarusttext-miningtext-classificationtextpattern-matchingwordtext-analysismatchertext-processingaho-corasickstring-matchingmatching-enginecontent-moderationsensitive-word UpdatedApr 14, 2025 Rust 谛听- 轻量级、可扩展的敏感词识别与数据脱敏组件 sensitivesensitive-data-discoverysensitive-wordsensitive-words ...
忽略大小写 finalStringtext="fuCK the bad words.";Stringword=SensitiveWordBs.newInstance().findFirst(text); Assert.assertEquals("fuCK", word); 忽略半角圆角 finalStringtext="fuck the bad words.";Stringword=SensitiveWordBs.newInstance().findFirst(text); Assert.assertEquals("fuck", word); 后期road-...
final String text = "fuCK the bad words."; String word = SensitiveWordBs.newInstance().findFirst(text); Assert.assertEquals("fuCK", word); 1. 2. 3. 4. 忽略半角圆角 final String text = "fuck the bad words."; String word = SensitiveWordBs.newInstance().findFirst(text); ...
Updated Aug 22, 2024 Python cjh0613 / strict-sensitive-word Sponsor Star 128 Code Issues Pull requests 轻量超严格敏感词库 keyword sensitive sensitive-word-filter sensitive-words-filter dirtywords sensitive-word sensitive-words Updated May 20, 2023 the...
sensitive-words-filter 是一个敏感词过滤项目,它提供了以下功能: 1. TTMPS(Trigram Token Matrix):支持对文本进行分词和词性标注。 2. DFA(Dynamic Frequency Algorithm):支持动态频率算法,可以根据词频动态调整敏感词的权重。 3. DAT(Dynamic Alignment Algorithm):支持动态对齐算法,可以根据上下文信息动态调整敏感词...
Sensitive Words Example fwwdn/sensitive-stop-words observerss/textfilter 57ing/Sensitive-word qloog/sensitive_words jim19930609/Twitter-Sensitive-Word-Detection Todo List String validation Random substitute symbol Craw word list with Python Construct the Converter with fileURL Ignore special symbols ...
Dictionaries,Databases,Production,Information filters,Information management,Monitoring,PythonTaking Chongqing Three Gorges Medical College as the research object, we design and implement a website group data monitoring system based on sensitive words filtering. The monitoring target of the system is the ...