Skip to main content

One doc tagged with "rlaif"

View all tags

Self-Improving Agent Loop (Autosearch)

5-stage loop design and safety mechanisms for self-hosted SLMs to autonomously learn and improve from production traces based on Karpathy's autosearch concept