RegisterModel fail when not provided with image_uri, even when HuggingFaceModel has all the given specified version needed to query for HuggingFace AWS ECR Inference Container To reproduce model = HuggingFaceModel( model_data="s3://finbert-tone/finbert.tar.gz", entry_point="./hf_scripts/run_gl...
LLM (v)RAM estimator tool and golang package for GGUF models from Ollama and Huggingface across various quantisation and context sizes. At present quantest is in the process of being used as a library in my Gollama and Ingest projects. Usage CLI / Standalone To use the package as a sta...