10 docs tagged with "bifrost"

Basic Deployment

kgateway installation, HTTPRoute configuration, Bifrost Gateway Mode setup

Coding Tool Integration & Cost Analysis

Aider, Cline, Continue.dev integration + Bedrock vs Kiro vs self-hosting cost comparison

Custom Model Pipeline Guide

Building a domain-optimized model serving pipeline with LoRA Fine-tuning, Multi-LoRA Hot-swap, and SLM Cascade Routing

Inference Gateway

Routing strategies, deployment, cascade tuning, and implementation examples for kgateway and Bifrost-based 2-Tier inference gateways

Inference Gateway & LLM Gateway Routing Strategy

kgateway + Bifrost/LiteLLM 2-Tier architecture with Cascade Routing, Semantic Router, and Hybrid Routing design patterns

Inference Platform Benchmark: Bedrock AgentCore vs EKS Self-Managed

A benchmark plan comparing Bedrock AgentCore as baseline against self-managed EKS (vLLM, llm-d, Bifrost/LiteLLM) across features, performance, and cost

OpenClaw AI Agent Gateway Deployment & Full Observability

Deploy OpenClaw AI Agent Gateway on EKS with cost optimization, and achieve full observability using Bifrost Auto-Router + Cilium Hubble + Langfuse

Request Cascading — Intelligent Model Routing

Complexity-based automatic model routing — comparison of LLM Classifier, LiteLLM, and vLLM Semantic Router approaches, RouteLLM research reference, and cost savings

Semantic Caching Strategy

LLM Gateway-level semantic caching strategy and implementation options comparison (GPTCache, Redis Semantic Cache, Portkey, Helicone, Bifrost+Redis)

Troubleshooting Guide

Common issues and solutions during Inference Gateway deployment and operations