Tag: litellm
8 items with this tag.
LiteLLM — Configuring Local LLM and MiniMax
LiteLLM — Configuring Local LLM and MiniMax How to add both a local Ollama model and the MiniMax cloud API to a single LiteLLM config.yaml.
Hermes Agent Orchestration
Hermes Agent Orchestration Building Hermes as a top-level orchestrator that delegates long-running software development tasks to Claude Code sub-agents via the Claude Agent SDK, all routing through the existing LiteLLM proxy on localhost:4000.
Claude Agent SDK — Hermes Orchestration Pattern
Claude Agent SDK — Hermes Orchestration Pattern Use the Claude Agent SDK (Python) to let Hermes act as a top-level orchestrator that delegates long-running development tasks to Claude Code sub-agents, all routing through your existing LiteLLM proxy.
Claude Code Per-Project Settings
Claude Code Per-Project Settings Configure different Claude models (or LiteLLM backends) per project on the same machine using project-local settings.json.
Minimax + LiteLLM Cost Estimation (Team)
Minimax + LiteLLM Cost Estimation (Team) Cost estimates for running Minimax models via a LiteLLM proxy for a development team of 20–30 people at medium daily usage.
Effort — LiteLLM Local LLM Setup
LiteLLM Local LLM Setup Status: 🟡 In progress — connected but cold-start timeouts persist Started: 2026-05-11 Goal: Run Claude Code against local qwen3_8b / DeepSeek via LiteLLM proxy Progress log 2026-05-11 Got /v1/chat/completions working ✅ Got /v1/messages (Anthropic pass-through) working ✅ Clau...
Claude Code — Switching Between API Key and Login Auth
Claude Code — Switching Between API Key and Login Auth Claude Code supports two auth modes.
LiteLLM — Connecting Claude Code to Local LLMs
LiteLLM — Connecting Claude Code to Local LLMs Use LiteLLM as a translation proxy: Claude Code speaks Anthropic format, local models speak OpenAI format.