What the Agent Does
The agent runs in your Kubernetes cluster and:Reports Capacity
Discovers GPU nodes and reports capacity to Chamber
Manages Workloads
Creates and monitors GPU jobs on your cluster
Collects Metrics
Gathers GPU utilization for dashboards
Handles Scheduling
Implements scheduling decisions from Chamber
Architecture
The agent maintains a secure connection to Chamber and executes scheduling decisions locally.Resilience
Jobs continue running even if Chamber is temporarily unavailable. The agent operates independently once jobs are scheduled. If the connection is lost, running workloads are unaffected and the agent automatically reconnects.
Communication
Outbound Only
The agent only makes outbound connections:- No inbound ports required
- Works behind corporate firewalls
- Encrypted with TLS
Network Requirements
| Host | Port | Purpose |
|---|---|---|
*.usechamber.io | 443 | Chamber communication |
Resource Requirements
| Resource | Requirement |
|---|---|
| CPU | 100m request, 200m limit |
| Memory | 128Mi request, 256Mi limit |
Security
Permissions
The agent requires these Kubernetes permissions:The agent does not require cluster-admin. It uses minimal permissions scoped to GPU workload management.
Data Sent to Chamber
| Data | Purpose |
|---|---|
| Node inventory (GPU count, type) | Capacity tracking |
| Job metadata (name, status, GPU request) | Workload management |
| GPU utilization metrics | Dashboards and reporting |

