A self-hosted AI agent needs to run entirely on your own Ollama or vLLM with no OpenAI key. All four major frameworks claim that support, but only LangGraph and CrewAI wire to a local model with zero workarounds. AutoGen needs a client swap, and Flowise needs one base-URL field. The model, not the framework, is the real reliability ceiling.
Key Takeaways
- All four run on Ollama, but only LangGraph and CrewAI need zero workarounds.
- The small local model, not the framework, is what breaks tool calling.
- Flowise is the only true no-code pick; LangGraph is the most code-heavy.
- Most framework docs still assume an OpenAI key, so budget setup time.
- Use Qwen3 or larger for agents; smaller models drop tool calls under load.
Why Local-First Fitness Is the Axis That Counts
Most “best agent framework” roundups assume you have an OpenAI key and a credit card. The first code sample spins up a hosted client, and the “swap to local” path is a footnote if it shows up at all. Self-hosters ask a sharper question about whether any of these run on their own box with no cloud call.
Botmonster Tech




