The Inference of AI Agents. L2 routing runtime — /v1/inference, workflows, audit chain.
This repository is the public Ainfera platform: FastAPI service, L1–L5 layers (Identity, Routing, Settlement, Audit, Orchestration), provider adapters, and Alembic migrations. Deployed at api.ainfera.ai.
Constrained routing across 50+ models. One Agent Card per agent. Every call cryptographically audited. pip install ainfera.
- L1 Identity — Ed25519 keypair + JWS RFC 7515 AgentCards
- L2 Routing — ProviderAdapter dispatch (Anthropic, OpenAI, Together)
- L3 Settlement — Internal prepaid ledger with atomic debit + per-call spend cap
- L4 Audit — Hash-chained, HMAC-signed append-only event log
- L5 Orchestration — FastAPI REST surface, OpenAPI, MCP server
uv sync
uv run alembic upgrade head
uv run uvicorn ainfera_os.main:app --reloadcurl -X POST https://api.ainfera.ai/v1/agents/signup \
-H "Content-Type: application/json" \
-d '{"agent_handle": "my-bot"}'- This repo (
ainfera-ai/api): the platform external + internal agents consume via HTTPS or theainferaSDK. - Internal Ainfera company OS (5 agents: Varda, Namo, Aule, Yavanna, Tulkas) lives in
ainfera-ai/ainfera-os— consumed viapip install ainfera, identical to external customers. - Client SDK:
ainfera-ai/sdk - Public specs:
ainfera-ai/specs - Audit verifier:
ainfera-ai/verify - MCP server:
ainfera-ai/mcp-server
See docs/ADR-010 for the split rationale.
Proprietary today · Apache 2.0 when public (Sprint v1.8, per D-052).