openlayer-ai
diff --git a/‎examples/tracing/claude_agent_sdk/claude_agent_sdk_tracing.ipynb‎
Lines changed: 284 additions & 0 deletions b/‎examples/tracing/claude_agent_sdk/claude_agent_sdk_tracing.ipynb‎
Lines changed: 284 additions & 0 deletions
diff --git a/‎pyproject.toml‎
Lines changed: 1 addition & 1 deletion b/‎pyproject.toml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎src/openlayer/lib/__init__.py‎
Lines changed: 76 additions & 0 deletions b/‎src/openlayer/lib/__init__.py‎
Lines changed: 76 additions & 0 deletions
@@ -0,0 +1,284 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/openlayer-ai/openlayer-python/blob/main/examples/tracing/claude_agent_sdk/claude_agent_sdk_tracing.ipynb)\n",
+    "\n",
+    "# Tracing the Claude Agent SDK with Openlayer\n",
+    "\n",
+    "This notebook shows how to enable Openlayer tracing for applications built with Anthropic's [Claude Agent SDK](https://github.com/anthropics/claude-agent-sdk-python). After one line of setup, every `query()` becomes an Openlayer trace with nested steps for assistant turns, tool calls (built-in + MCP), subagents, session metadata, cost, and tokens.\n",
+    "\n",
+    "Three scenarios, building up in complexity:\n",
+    "\n",
+    "1. **Quickstart** — single `query()` with built-in tools (Read / Glob / Grep)\n",
+    "2. **MCP + subagent** — register an in-process MCP tool, dispatch a subagent\n",
+    "3. **Multi-stage orchestration** — wrap multiple `query()` calls inside one outer step so the whole pipeline is a single trace"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 1. Install dependencies"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install openlayer 'claude-agent-sdk>=0.1.81'"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 2. Set environment variables\n",
+    "\n",
+    "You need three secrets:\n",
+    "\n",
+    "- `OPENLAYER_API_KEY` — get from [openlayer.com/settings/api-keys](https://app.openlayer.com/settings/api-keys)\n",
+    "- `OPENLAYER_INFERENCE_PIPELINE_ID` — the inference pipeline you want to stream traces to\n",
+    "- `ANTHROPIC_API_KEY` — your Anthropic API key"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import os\n",
+    "\n",
+    "os.environ[\"OPENLAYER_API_KEY\"] = \"YOUR_OPENLAYER_API_KEY\"\n",
+    "os.environ[\"OPENLAYER_INFERENCE_PIPELINE_ID\"] = \"YOUR_INFERENCE_PIPELINE_ID\"\n",
+    "os.environ[\"ANTHROPIC_API_KEY\"] = \"YOUR_ANTHROPIC_API_KEY\""
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 3. Enable tracing — one line\n",
+    "\n",
+    "`trace_claude_agent_sdk()` monkey-patches `claude_agent_sdk.query` and `ClaudeSDKClient` so every subsequent call is auto-traced. It composes with any hooks you've configured yourself — your hooks are not replaced."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from openlayer.lib import trace_claude_agent_sdk\n",
+    "\n",
+    "trace_claude_agent_sdk()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 4. Scenario 1 — quickstart\n",
+    "\n",
+    "A simple `query()` with read-only built-in tools. The resulting trace contains one root `Claude Agent SDK query` AGENT step with nested `CHAT_COMPLETION` turns and `TOOL` calls."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from claude_agent_sdk import ResultMessage, ClaudeAgentOptions, query\n",
+    "\n",
+    "\n",
+    "async def scenario_1():\n",
+    "    options = ClaudeAgentOptions(\n",
+    "        model=\"claude-haiku-4-5\",\n",
+    "        allowed_tools=[\"Read\", \"Glob\", \"Grep\"],\n",
+    "    )\n",
+    "    async for message in query(\n",
+    "        prompt=\"Find any .py files in the current directory and tell me roughly what they do.\",\n",
+    "        options=options,\n",
+    "    ):\n",
+    "        if isinstance(message, ResultMessage):\n",
+    "            print(message.result)  # noqa: T201\n",
+    "\n",
+    "\n",
+    "await scenario_1()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 5. Scenario 2 — in-process MCP tool + subagent\n",
+    "\n",
+    "Register a custom MCP tool that counts files by extension, and dispatch a `code-reviewer` subagent. In the trace, the MCP call appears as a `TOOL` step with `metadata.mcp_server=\"file-stats\"` and `metadata.mcp_tool_name=\"count_files\"`. The subagent dispatch appears as a nested `AGENT` step (`Agent: code-reviewer`) containing the subagent's own assistant turns and tool calls."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from pathlib import Path\n",
+    "from collections import Counter\n",
+    "\n",
+    "from claude_agent_sdk import AgentDefinition, tool, create_sdk_mcp_server\n",
+    "\n",
+    "\n",
+    "@tool(\"count_files\", \"Count files in a directory grouped by extension\", {\"directory\": str})\n",
+    "async def count_files(args):\n",
+    "    target = Path(args[\"directory\"]).expanduser().resolve()\n",
+    "    if not target.is_dir():\n",
+    "        return {\"content\": [{\"type\": \"text\", \"text\": f\"Not a directory: {target}\"}], \"isError\": True}\n",
+    "    counts = Counter()\n",
+    "    for f in target.rglob(\"*\"):\n",
+    "        if f.is_file():\n",
+    "            counts[f.suffix or \"(no ext)\"] += 1\n",
+    "    body = \"\\n\".join(f\"{ext}: {n}\" for ext, n in counts.most_common(20))\n",
+    "    return {\"content\": [{\"type\": \"text\", \"text\": body or \"(empty)\"}]}\n",
+    "\n",
+    "\n",
+    "mcp_server = create_sdk_mcp_server(\"file-stats\", \"1.0.0\", tools=[count_files])\n",
+    "\n",
+    "code_reviewer = AgentDefinition(\n",
+    "    description=\"Briefly reviews a code file for clarity, correctness, and style.\",\n",
+    "    prompt=(\n",
+    "        \"You are a senior code reviewer. Read the file the user names, then return ONE \"\n",
+    "        \"specific observation about its quality. Two sentences max.\"\n",
+    "    ),\n",
+    "    tools=[\"Read\", \"Grep\"],\n",
+    "    model=\"claude-haiku-4-5\",\n",
+    ")\n",
+    "\n",
+    "\n",
+    "async def scenario_2():\n",
+    "    options = ClaudeAgentOptions(\n",
+    "        model=\"claude-haiku-4-5\",\n",
+    "        system_prompt=(\n",
+    "            \"You are a codebase explorer. Count files in the directory, then dispatch \"\n",
+    "            \"the code-reviewer subagent on ONE interesting file. Output a 2-line summary.\"\n",
+    "        ),\n",
+    "        # Subagent tools must also be in the session's allowed_tools.\n",
+    "        allowed_tools=[\"Glob\", \"Read\", \"Grep\", \"Agent\", \"mcp__file-stats__count_files\"],\n",
+    "        mcp_servers={\"file-stats\": mcp_server},\n",
+    "        agents={\"code-reviewer\": code_reviewer},\n",
+    "        permission_mode=\"acceptEdits\",\n",
+    "        max_turns=10,\n",
+    "    )\n",
+    "    async for message in query(\n",
+    "        prompt=f\"Analyze the directory at: {Path.cwd()}\",\n",
+    "        options=options,\n",
+    "    ):\n",
+    "        if isinstance(message, ResultMessage):\n",
+    "            print(message.result)  # noqa: T201\n",
+    "\n",
+    "\n",
+    "await scenario_2()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 6. Scenario 3 — multi-stage orchestration\n",
+    "\n",
+    "When you want multiple `query()` calls to appear as one trace, wrap them in `tracer.create_step()`. Each inner `query()` becomes a nested `AGENT` step under your outer step.\n",
+    "\n",
+    "This example splits an audit workflow into two phases: an inventory query, then a review query that dispatches a specialist subagent. Both are children of one outer `codebase-audit` AGENT step."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from openlayer.lib.tracing import tracer\n",
+    "from openlayer.lib.tracing.enums import StepType\n",
+    "\n",
+    "\n",
+    "async def phase_inventory():\n",
+    "    options = ClaudeAgentOptions(\n",
+    "        model=\"claude-haiku-4-5\",\n",
+    "        system_prompt=(\n",
+    "            \"Inventory the current working directory and pick ONE .py file. \"\n",
+    "            \"End your last message with: TARGET: <absolute path>\"\n",
+    "        ),\n",
+    "        allowed_tools=[\"Glob\", \"Read\", \"mcp__file-stats__count_files\"],\n",
+    "        mcp_servers={\"file-stats\": mcp_server},\n",
+    "        max_turns=6,\n",
+    "    )\n",
+    "    async for message in query(prompt=f\"Working directory: {Path.cwd()}\", options=options):\n",
+    "        if isinstance(message, ResultMessage):\n",
+    "            for line in reversed((message.result or \"\").splitlines()):\n",
+    "                if line.strip().startswith(\"TARGET:\"):\n",
+    "                    return line.strip()[len(\"TARGET:\"):].strip()\n",
+    "    return None\n",
+    "\n",
+    "\n",
+    "async def phase_review(target):\n",
+    "    options = ClaudeAgentOptions(\n",
+    "        model=\"claude-haiku-4-5\",\n",
+    "        system_prompt=\"Dispatch code-reviewer on the file and return its observation verbatim.\",\n",
+    "        allowed_tools=[\"Agent\", \"Read\", \"Grep\"],\n",
+    "        agents={\"code-reviewer\": code_reviewer},\n",
+    "        permission_mode=\"acceptEdits\",\n",
+    "        max_turns=6,\n",
+    "    )\n",
+    "    async for message in query(prompt=f\"Review this file: {target}\", options=options):\n",
+    "        if isinstance(message, ResultMessage):\n",
+    "            return message.result\n",
+    "    return None\n",
+    "\n",
+    "\n",
+    "with tracer.create_step(name=\"codebase-audit\", step_type=StepType.AGENT) as outer:\n",
+    "    target = await phase_inventory()\n",
+    "    review = await phase_review(target) if target else None\n",
+    "    outer.output = review or \"(no review produced)\"\n",
+    "    outer.log(metadata={\"audited_file\": target})\n",
+    "\n",
+    "print(\"audited:\", target)  # noqa: T201\n",
+    "print(\"\\nreview:\\n\", review)  # noqa: T201"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 7. What to look for in the Openlayer trace\n",
+    "\n",
+    "Open your inference pipeline and click into each trace. You should see:\n",
+    "\n",
+    "**Scenario 1** — a single root `AGENT` step (`Claude Agent SDK query`) with assistant turn(s) and tool calls as children.\n",
+    "\n",
+    "**Scenario 2** — same root, plus a `TOOL` step for the MCP call (with `metadata.mcp_server` and `metadata.mcp_tool_name`) and a nested `AGENT` step named `Agent: code-reviewer` containing the subagent's own chat completions and tool steps.\n",
+    "\n",
+    "**Scenario 3** — one outer `codebase-audit` AGENT step, with two nested `Claude Agent SDK query` AGENT steps inside it (one per phase), and the review phase contains its own `Agent: code-reviewer` nested step.\n",
+    "\n",
+    "Click any `AGENT` step to see `system_prompt`, `agent_config`, `agents_defined`, `options`, and the raw `ResultMessage`. Click any `CHAT_COMPLETION` step for per-turn model, prompt/completion tokens, thinking content, and raw assistant message. Click any `TOOL` step for input, output, latency, and the originating `tool_use_id`."
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "name": "python",
+   "version": "3.10"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}
@@ -159,7 +159,7 @@ exclude = [
     ".git",
 ]
 
-ignore = ["src/openlayer/lib/*", "examples/*"]
+ignore = ["src/openlayer/lib/*", "examples/*", "tests/integrations/*"]
 
 reportImplicitOverride = true
 reportOverlappingOverload = false
 
@@ -20,6 +20,8 @@
     "trace_portkey",
     "trace_google_adk",
     "unpatch_google_adk",
+    "trace_claude_agent_sdk",
+    "traced_claude_agent_sdk_query",
     "trace_gemini",
     "update_current_trace",
     "update_current_step",
@@ -315,6 +317,80 @@ def unpatch_google_adk():
     return google_adk_tracer.unpatch_google_adk()
 
 
+# ------------------------------ Claude Agent SDK ---------------------------- #
+def trace_claude_agent_sdk(
+    *,
+    inference_pipeline_id=None,
+    truncate_tool_output_chars: int = 8192,
+    capture_thinking: bool = True,
+    redact_mcp_env: bool = True,
+):
+    """Enable Openlayer tracing for the Claude Agent SDK.
+
+    Monkey-patches ``claude_agent_sdk.query`` and ``ClaudeSDKClient`` so every
+    call becomes an Openlayer trace with nested steps for assistant turns,
+    tool calls (including MCP and subagent calls), session metadata, cost,
+    and tokens.
+
+    Requirements:
+        ``claude-agent-sdk>=0.1.81`` must be installed:
+        ``pip install 'claude-agent-sdk>=0.1.81'``
+
+    Args:
+        inference_pipeline_id: Optional Openlayer inference pipeline ID. Falls
+            back to the ``OPENLAYER_INFERENCE_PIPELINE_ID`` env var.
+        truncate_tool_output_chars: Maximum characters of tool output to
+            capture per TOOL step. Defaults to 8192.
+        capture_thinking: Whether to capture ``ThinkingBlock`` content into
+            chat-completion step metadata. Defaults to True.
+        redact_mcp_env: Whether to strip ``env`` and ``headers`` from MCP
+            server config dicts in trace metadata. Defaults to True.
+
+    Example:
+        >>> import os
+        >>> os.environ["OPENLAYER_API_KEY"] = "..."
+        >>> os.environ["OPENLAYER_INFERENCE_PIPELINE_ID"] = "..."
+        >>> os.environ["ANTHROPIC_API_KEY"] = "..."
+        >>> from openlayer.lib import trace_claude_agent_sdk
+        >>> trace_claude_agent_sdk()
+        >>>
+        >>> from claude_agent_sdk import query, ClaudeAgentOptions
+        >>> async for m in query(prompt="hello", options=ClaudeAgentOptions(model="claude-haiku-4-5")):
+        ...     ...
+    """
+    # pylint: disable=import-outside-toplevel
+    from .integrations import claude_agent_sdk as _integration
+
+    return _integration.trace_claude_agent_sdk(
+        inference_pipeline_id=inference_pipeline_id,
+        truncate_tool_output_chars=truncate_tool_output_chars,
+        capture_thinking=capture_thinking,
+        redact_mcp_env=redact_mcp_env,
+    )
+
+
+def traced_claude_agent_sdk_query(*, prompt, options=None, inference_pipeline_id=None, **kwargs):
+    """Per-call wrapper around ``claude_agent_sdk.query()`` (alternative to global init).
+
+    Returns an async iterator that yields the same messages as ``query()`` while
+    emitting an Openlayer trace as a side effect.
+
+    Example:
+        >>> from openlayer.lib import traced_claude_agent_sdk_query
+        >>> async for m in traced_claude_agent_sdk_query(prompt="hello"):
+        ...     ...
+    """
+    # pylint: disable=import-outside-toplevel
+    from .integrations import claude_agent_sdk as _integration
+
+    return _integration.traced_query(
+        prompt=prompt,
+        options=options,
+        inference_pipeline_id=inference_pipeline_id,
+        **kwargs,
+    )
+
+
 # -------------------------------- Google Gemini --------------------------------- #
 def trace_gemini(client):
     """Trace Google Gemini chat completions."""
Original file line number	Diff line number	Diff line change
`@@ -159,7 +159,7 @@ exclude = [`
`159`	`159`	`".git",`
`160`	`160`	`]`
`161`	`161`
`162`		`-ignore = ["src/openlayer/lib/", "examples/"]`
	`162`	`+ignore = ["src/openlayer/lib/", "examples/", "tests/integrations/*"]`
`163`	`163`
`164`	`164`	`reportImplicitOverride = true`
`165`	`165`	`reportOverlappingOverload = false`