TIL - How to run Hugging Face models with Ollama

ai Dec 27, 2024

You can now deploy any GGUF model with Ollama, in just a few clicks!

You can use any GGUF quants created by the community on Hugging Face directly with Ollama, without the need to create a new Modelfile . It works like a charm with all llama.cpp compatible models, with all sizes, from 0.1B up to 405B parameters.

Simply filter GGUF models, select the quant type as per your requirement, and it is done!

ollama run hf.co/bartowski/Qwen2.5.1-Coder-7B-Instruct-GGUF:Q5_K_L

Recommended Collection of Models

More Information

Recommended for you

project

AI-Powered Cloud Configuration Review

a year ago • 4 min read

Hot Take - Learn Async Programming before LangChain and LangGraph

a year ago • 3 min read

openai

Tagging and Summarizing Articles with OpenAI and LangChain

a year ago • 2 min read

TIL - In Python concurrent.futures.as_completed Yields Tasks as Soon as They Finish

TIL - uuid.uuid5 Generate Deterministic Identifiers

TIL - Python's ast.literal_eval Is the Safe Alternative to eval()

TIL - Python's types.SimpleNamespace Gives Quick Dot-Notation Access