原因是因为 simhash-py 不支持 python3.9(该库已经废弃,最高支持到 3.8) ,需要使用simhash 功能的话,安装 simhash 库就好了,使用一下命令:
pip install git+https://github.com/seomoz/simhash-py.git It can also be built fromgit: git submodule update --init --recursive python setup.py install or pip install simhash-py under osx, you should exportMACOSX_DEPLOYMENT_TARGET = 10.x (10.9,10.10...) ...
I see. For the time being, I'm using a pure python implementation from https://github.com/liangsun/simhash . Wish we can switch back to this faster implementation later. Thank you.dlecocq mentioned this issue Jul 1, 2014 from .table import PyCorpus as Corpus #3 Closed dlecocq closed...
Locality Sensitive Hashing, fuzzy-hash, min-hash, simhash, aHash, pHash, dHash。基于 Hash值的图片相似度、文本相似度 - guofei9987/pyLSHash
Simhash and near-duplicate detection. Contribute to jannson/simhash-py development by creating an account on GitHub.