Architecture

Runlet's current architecture is a small, async, provider-neutral agent loop.

If you are new to the project, start with the getting-started guides in docs/getting-started/ first. This document is the maintainer-oriented view.

Packages

runlet.core: stable contracts such as agents, messages, run results, model request and response types, events, and error classes.
runlet.runtime: runtime orchestration concerns such as the execution loop, context preparation, policies, and state storage.
runlet.integrations: runtime extension surfaces such as tools and hooks.
runlet.testing: fake providers and deterministic test helpers.

A caller invokes the runtime with an agent and input.
The runtime creates a run context and emits run.started.
The runtime builds a model request for the next step.
Hooks can modify the model request.
The context manager enforces the model context budget and compresses or truncates as configured.
The model provider is called.
Hooks can inspect or modify the model response.
Tool calls are validated, passed through hooks, executed, and recorded.
State updates and structured events are emitted.
The loop continues until a final result, policy stop, cancellation, or error.

Runtime.stream() follows the same high-level loop as Runtime.run(), but each assistant step is consumed as a stream.

This keeps streaming tool execution in the runtime loop rather than pushing it into provider-specific code.

Providers are responsible for translating vendor-specific stream events into a Runlet internal event layer. The current event kinds include:

This boundary keeps runlet.runtime provider-neutral while still allowing each provider adapter to map its own SDK semantics into the runtime loop.

Runlet core does not provide an HTTP server, UI, worker queue, multi-tenant control plane, database migration layer, or graph workflow engine.