Short-lived by default
Every capability token has a TTL of 300 seconds or less. Long-lived secrets are not the unit of access.
Capability-bound infrastructure
Lease layer for disaggregated fabrics
Firmware for fabric resources
Fail-closed infrastructure
Tenura is the firmware boundary for disaggregated GPU, memory, and storage fabrics. Tenants share metal under capability tokens — TTL-bounded, audience-scoped, fenced on teardown failure.
Every GPU, memory pool, and storage namespace becomes a time-bounded lease. One capability model, one teardown contract, one audit log — across compute, memory, fabric, and block.
PC BIOS made hardware addressable to operating systems. fabricBIOS makes disaggregated fabric resources addressable to runtimes — with leases and capability tokens built in from day one.
Hardware-enforced teardown, capability-scoped access, no leaked GPU time and no leaked memory pages. The infrastructure layer that fails closed.
What "lease-bound" actually buys you
Every capability token has a TTL of 300 seconds or less. Long-lived secrets are not the unit of access.
If teardown fails on lease expiry, the resource enters a FENCED state. No new lease lands on dirty hardware.
The control plane is QUIC with mutual TLS by default. There is no plaintext mode to opt in or out of.
How it works
A grafOS program asks for resources by shape, not by identity: a GPU, 80 GiB of memory, a queue pair, a 5-minute TTL.
fabric.alloc_gpu().lease_secs(300).acquire()? After scheduler admission, the target fabricBIOS node issues a signed, audience-bound token for exactly that lease and hardware. Replay-protected, fenced by epoch.
cap.audience = node-7 · cap.epoch = 42 · cap.ttl ≤ 300s When the lease ends or is revoked, the data plane is torn down. Failure to tear down does not retry — it fences the resource.
lease expired → teardown(ok) | teardown(fail) → FENCED Why it exists
A 4-GPU node serving eight tenants today usually pins one quarter-GPU per tenant for the day — and runs at 30–40% utilization while everyone holds capacity they aren't using. Tenura replaces that with leases minted on demand and reclaimed automatically when the workload finishes or the TTL elapses.
The split between grafOS and fabricBIOS keeps policy out of the hardware boundary. Programs say what they need; the scheduler admits the request; fabricBIOS mints and enforces the scoped capability at the resource boundary.
Slice one GPU between tenants by lease, not by VM. Idle capacity returns to the pool in seconds.
Reserve a fabric of GPUs for a job, hold it only for the run, release it on completion or failure.
Replace static "GPU per grad student" with leases that expire when a notebook idles.
Lease memory and block from anywhere on the fabric. The lease, not the wire, is what teardown walks.
Read deeper
grafOS and fabricBIOS concepts, the CLI flow, and operator notes.
ArchitectureThe programming model behind leases, capabilities, placement, recovery.
EconomicsHow finer-grained leases change utilization and capacity planning.
Use casesConcrete fits: multi-tenant inference, burst training, shared research, disaggregated memory.
Common questions
Beta access
We review applications weekly. Approved teams get a one-click signup link by email and a short onboarding call to size the workload. A workload note helps us route access to teams that can exercise the fabric meaningfully.
After approval
One line installs the CLI and joins your cell.
curl -fsSL https://get.tenura.systems/install.sh | sh macOS arm64 / Linux x86_64 / Linux arm64. The installer drops grafos on your PATH and registers your beta token.
On Windows? Same release, manual extract.
grafos-0.2.8-x86_64-pc-windows-gnu.tar.gz @ releases.tenura.systems/0.2.8/ Download the tarball, verify the minisig against the same public key as every other platform, extract grafos.exe, and drop on PATH. PowerShell installer is on the roadmap. Quickstart →