Skip to main content

9 docs tagged with "architecture"

View all tags

Design & Architecture

Architecture design, technical challenges, and AWS Native and EKS-based implementation approaches for the Agentic AI Platform

EKS GPU Node Strategy

Optimal node strategies for GPU workloads across EKS Auto Mode, Karpenter, MNG, and Hybrid Nodes

EKS Hybrid Nodes Complete Guide

A complete guide for adopting Amazon EKS Hybrid Nodes: architecture, configuration, networking, DNS, GPU servers, cost analysis, and Dynamic Resource Allocation (DRA)

Inference Optimization on EKS

EKS architecture overview for maximizing LLM Inference performance — starting point for vLLM, KV Cache-Aware Routing, Disaggregated Serving, LWS multi-node, and Hybrid Node integration

MoE Model Serving Concept Guide

Architecture concepts, distributed deployment strategies, and performance optimization principles for Mixture of Experts models

Platform Selection

Strategic selection guide for building Agentic AI platforms in AWS environments