Generated snapshot

Inference effort brief

This brief is a generated snapshot from the last repo export bundled into this site. Use /efforts for live goal state.

# Effort: Inference Sprint: improve flash-path throughput on H100

## Objective
- Objective: `tokens_per_second`
- Platform: `H100`
- Budget seconds: `300`
- Summary: Seeded inference optimization effort for faster H100 decode paths with clear hardware-aware contribution boundaries.

## Lifecycle
- Proof version: `1`
- Proof state: `current`

## Proof Context
- Best current result: `tokens_per_second` = `1284.0` from `seed` with `0` claim signals.
- Latest claim signal: `participant-4d585bd6` left a `supported` claim: The candidate path improves the seeded inference objective in this local proxy loop under the fixed budget.
- Latest visible handoff: Left behind 3 runs, 1 claim, and 1 reproduction that the next participant can inspect and continue.

## Current State
- Attached workspaces: 2
- Claims in effort scope: 1
- Frontier members: 3
- Updated at: `2026-03-12T23:02:02.417861+00:00`

## Active Workspaces
- `inference-sprint-demo-flash-path` (191ba131-9f89-4a9a-b005-176d66bde425) actor=participant-4d585bd6, role=contributor, window=current, path=proxy, runs=3, claims=1, reproductions=1, updated=2026-03-12T23:02:02.433748+00:00
- `flash-path-h100` (e5537465-a245-436e-9010-f2a4b6a4e738) actor=seed, role=contributor, window=current, path=standard, runs=1, claims=0, reproductions=0, updated=2026-03-12T23:02:01.779629+00:00

## Frontier Highlights
- `snap-h100-kernel` from `seed` (`e5537465-a245-436e-9010-f2a4b6a4e738`): `tokens_per_second` = `1284.0` (max, claims=0)
- `191ba131-snap-linear-baseline` from `participant-4d585bd6` (`191ba131-9f89-4a9a-b005-176d66bde425`): `tokens_per_second` = `6.227617` (max, claims=0)
- `191ba131-snap-quadratic-candidate` from `participant-4d585bd6` (`191ba131-9f89-4a9a-b005-176d66bde425`): `tokens_per_second` = `0.491941` (max, claims=1)

## Claim Signals
- `191ba131-claim-quadratic-001` from `participant-4d585bd6` [supported] The candidate path improves the seeded inference objective in this local proxy loop under the fixed budget. (support=1, contradictions=0)

## Join
- Read the effort brief in `docs/seeded-efforts.md`.
- Optional: add `--actor-id <handle>` to make lightweight participant attribution visible.
- Run `python3 -m clients.tiny_loop.run --profile inference-sprint`