原因是因为 simhash-py 不支持 python3.9(该库已经废弃,最高支持到 3.8) ,需要使用simhash 功能的话,安装 simhash 库就好了,使用一下命令:
pip install git+https://github.com/seomoz/simhash-py.git It can also be built fromgit: git submodule update --init --recursive python setup.py install or pip install simhash-py under osx, you should exportMACOSX_DEPLOYMENT_TARGET = 10.x (10.9,10.10...) ...
Locality Sensitive Hashing, fuzzy-hash, min-hash, simhash, aHash, pHash, dHash。基于 Hash值的图片相似度、文本相似度 - guofei9987/pyLSHash
import simhash # 6 blocks, 3 bits may differ corpus = simhash.Corpus(6, 3) With a corpus, you can then insert, remove and query the data structure. You may be interested in justanynear-duplicate fingerprint in which case you can usefind_firstorfind_first_bulk. If you're interested in...