rtcstats-pre-processor

A Go library and CLI tool that compresses WebRTC event logs into a compact format optimized for LLM token consumption.

Given raw rtcstats JSONL dumps (arrays of [eventName, scope, payload, timestamp]), the processor:

Abbreviates field names (n, s, p, ts, dt)
Extracts and flattens handler-specific payloads (SDP digests, ICE candidates, stats reports)
Supports absolute, delta, or both timestamp modes
Reports input/output size with reduction percentage

Installation

CLI:

go install rtcstats/cmd/rtcstats@latest

Library:

go get rtcstats

CLI Usage

rtcstats [flags] <input-file>

Flag	Description
`-o`, `--output`	Output file (default: stdout)
`--ts`	Timestamp mode: `absolute`\|`delta`\|`both` (default: `absolute`)
`--pretty`	Pretty-print JSON output
`-q`, `--quiet`	Suppress stats logging to stderr
`--sample`	Enable adaptive sampling for getstats events
`--sample-n`	Sampling interval: keep every Nth getstats (default: `5`)
`--sample-ctx`	Context window: samples before/after interesting moments (default: `2`)

Examples:

# Process file, output to stdout
rtcstats events.jsonl

# Write to file
rtcstats -o compressed.jsonl events.jsonl

# Delta timestamps, pretty-printed
rtcstats --ts delta --pretty events.jsonl

# Pipe to another tool, suppress stats
rtcstats -q events.jsonl | jq .

# Discard output, show stats only
rtcstats -o /dev/null events.jsonl

# Enable adaptive sampling (keep every 5th getstats + interesting moments)
rtcstats --sample events.jsonl

# Sample every 10th getstats with wider context window
rtcstats --sample --sample-n 10 --sample-ctx 3 events.jsonl

Package Usage

File-to-file

import "rtcstats"

result, err := rtcstats.ProcessStats("input.jsonl", "output.jsonl",
    rtcstats.WithTimestampMode(rtcstats.TSDelta),
    rtcstats.WithLogger(rtcstats.StderrLogger()),
)
// result.Reduction => 0.73 (73% smaller)

Streaming (io.Reader / io.Writer)

Works with HTTP handlers, stdin/stdout piping, or any io stream.

import "rtcstats"

result, err := rtcstats.Process(r, w,
    rtcstats.WithTimestampMode(rtcstats.TSAbsolute),
)

In-memory

Useful for serverless functions, tests, or batch processing.

import "rtcstats"

output, result, err := rtcstats.ProcessBytes(inputBytes,
    rtcstats.WithPrettyPrint(),
)

Stats-only analysis

import (
    "io"
    "rtcstats"
)

result, err := rtcstats.Process(file, io.Discard)
fmt.Printf("%d events, %.0f%% reduction\n", result.EventCount, result.Reduction*100)

Options

Function	Description
`WithTimestampMode(mode)`	`TSAbsolute` (default), `TSDelta`, or `TSBoth`
`WithPrettyPrint()`	Indent JSON output
`WithLogger(l)`	Receive stats log line after processing
`WithSampling()`	Enable adaptive sampling with defaults (N=5, context=2, steady-state=true)
`WithSamplingInterval(n)`	Set sampling interval (keep every Nth getstats). Implies `WithSampling()`
`WithSamplingContext(before, after)`	Set context window around interesting moments. Implies `WithSampling()`

LLM Prompt Injection

The internal/prompts package exports constant strings that translate compressed field names back to human-readable descriptions. Inject these into your LLM system prompts so the model can interpret abbreviated output.

import "rtcstats/internal/prompts"

// Full reference covering stats, events, SDP digests, and scopes
systemPrompt := "You are a WebRTC diagnostics assistant.\n\n" + prompts.FullReference

// Or pick only what you need:
//   prompts.StatsFields      – getstats report field translations (out_v, in_a, etc.)
//   prompts.EventFields      – connection event payload field translations
//   prompts.SDPDigestFields   – SDP summary (sdp_sum) field translations
//   prompts.ScopeReference   – scope string conventions (0-pub, 0-sub, sfu:*)

Available constants:

Constant	Covers
`prompts.StatsFields`	All getstats report types and their abbreviated fields (bs, hbs, fps, etc.)
`prompts.EventFields`	Connection event payload keys (did, sid, uid, ok, dur, etc.) and state enums
`prompts.SDPDigestFields`	SDP digest object fields (sdp_sum: type, codecs, sim_rids, tcc, etc.)
`prompts.ScopeReference`	Scope string meanings (0-pub, 0-sub, sfu:<region>)
`prompts.SamplingReference`	Adaptive sampling and `"="` steady-state marker explanation
`prompts.FullReference`	All of the above concatenated

Adaptive Sampling

For long calls, getstats events dominate the output. Adaptive sampling reduces this with two layers:

Layer 1 — Nth-sample selection: Keep every Nth getstats sample (default N=5), with full resolution preserved around "interesting" moments (packet loss, freeze, FPS drops, jitter/RTT spikes, quality score changes, track additions/removals). A configurable context window (default 2 samples before/after) ensures transitions are captured.

Layer 2 — Steady-state suppression: Within kept samples, report categories that are identical to the previous emission are replaced with "=", further reducing redundancy.

Counter deltas are accumulated correctly across skipped samples — the total change in any counter field is preserved.

result, err := rtcstats.ProcessStats("input.jsonl", "output.jsonl",
    rtcstats.WithSampling(),
    rtcstats.WithSamplingInterval(10),
)

Typical results:

Sample	Without Sampling	Sampling N=5	Sampling N=10
2.3 MB call	73.5% reduction	87.8% reduction	89.9% reduction
1 MB call	80.7% reduction	94.6% reduction	96.4% reduction

Result

ProcessStats, Process, and ProcessBytes all return a *Result:

type Result struct {
    InputBytes  int64   // raw input size
    OutputBytes int64   // compressed output size
    Reduction   float64 // 0-1 fraction (e.g. 0.73 = 73% reduction)
    EventCount  int     // number of events processed
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
cmd/rtcstats		cmd/rtcstats
internal		internal
specs		specs
.DS_Store		.DS_Store
README.md		README.md
go.mod		go.mod
rtcstats		rtcstats
rtcstats.go		rtcstats.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rtcstats-pre-processor

Installation

CLI Usage

Package Usage

File-to-file

Streaming (io.Reader / io.Writer)

In-memory

Stats-only analysis

Options

LLM Prompt Injection

Adaptive Sampling

Result

About

Uh oh!

Releases 2

Packages

Languages

GetStream/rtcstats-pre-processor

Folders and files

Latest commit

History

Repository files navigation

rtcstats-pre-processor

Installation

CLI Usage

Package Usage

File-to-file

Streaming (io.Reader / io.Writer)

In-memory

Stats-only analysis

Options

LLM Prompt Injection

Adaptive Sampling

Result

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages