AWS Bedrock

Available for: Python and TypeScript.

Wrap your BedrockRuntimeClient with wrapBedrock() (TypeScript) to trace all model invocations through the Bedrock Converse API — or patch boto3 with patch_bedrock() (Python).

Installation

npm install @zespan/sdk @aws-sdk/client-bedrock-runtime

pip install zespan boto3

Setup

import { BedrockRuntimeClient } from "@aws-sdk/client-bedrock-runtime";
import { zespan } from "@zespan/sdk";

zespan.init({ apiKey: process.env.ZESPAN_API_KEY! });

const bedrock = zespan.wrapBedrock(new BedrockRuntimeClient({ region: "us-east-1" }));

import os
import zespan

zespan.init(api_key=os.environ["ZESPAN_API_KEY"])
zespan.patch_bedrock()

import boto3  # import after patching
client = boto3.client("bedrock-runtime", region_name="us-east-1")

In TypeScript, wrapBedrock() returns a wrapped client instance. In Python, patch_bedrock() patches botocore.client.ClientCreator.create_client — every boto3.client("bedrock-runtime", ...) instance created afterward has its converse() and converse_stream() methods traced automatically, with no wrapping step.

Example

const response = await bedrock.converse({
  modelId: "anthropic.claude-sonnet-4-6-v1",
  messages: [{ role: "user", content: [{ text: "Summarize observability in two sentences." }] }],
});

response = client.converse(
    modelId="anthropic.claude-sonnet-4-6-v1",
    messages=[{"role": "user", "content": [{"text": "Summarize observability in two sentences."}]}],
)

Both wrapBedrock() and patch_bedrock() trace the Converse API (converse / converse_stream) only — the older, model-specific invoke_model / InvokeModelCommand API is not patched. Use converse() for traced calls.

converse_stream() (Python) and streaming Converse calls (TypeScript) are traced end-to-end in both languages, including time-to-first-token.

What gets captured

Field	Details
Model	Bedrock model ID (e.g. `anthropic.claude-sonnet-4-6-v1`)
Input tokens	From `usage.inputTokens` in the Converse response
Output tokens	From `usage.outputTokens` in the Converse response
Cost	Calculated from token counts and Bedrock pricing
Latency	Total invocation duration
Server latency	`metrics.latencyMs` from the Converse response, when present. Python only — the TypeScript wrapper does not currently extract this field
Finish reason	Bedrock’s `stopReason` (e.g. `end_turn`, `tool_use`, `max_tokens`)
Tool calls	Tool name and parsed input from `toolUse` content blocks, both languages

Token extraction requires the model response to include usage metadata. All Anthropic and Llama models on Bedrock return this via the Converse API. Check your model’s response schema if tokens show as zero.

Overview

TypeScript SDK

Python SDK

Advanced SDK Configuration

Integrations

LLM Providers

Agent Frameworks

RAG Frameworks

Vector Databases

Custom / Other

Guides

Installation

Setup

Example

What gets captured

​Installation

​Setup

​Example

​What gets captured

Installation

Setup

Example

What gets captured