reflex chat
reflex chat is a chat agent that runs the same reflex commands you would. Useful for non-experts, useful for one-off operations, useful for “I forgot the flag for X.”
Start a session
Section titled “Start a session”reflex chatyou › what version am I running and what hardware can I deploy to?
→ show_version({}) → reflex --version → "reflex 0.7.0" → list_targets({}) → reflex targets → [orin-nano, orin, orin-64, thor, desktop]
You're running reflex 0.7.0. Supported targets: - orin-nano — Jetson Orin Nano: 8 GB, fp16 - orin — Jetson AGX Orin 32GB: 32 GB, fp16 - orin-64 — Jetson AGX Orin 64GB: 64 GB, fp16 - thor — Jetson Thor: 128 GB, fp8 - desktop — Desktop GPU (RTX 4090 / A100 / H100): 24 GB, fp16
Want me to show which models support each target, or run reflex doctor?The chat understands the full CLI. Ask it to deploy a model, run benchmarks, validate exports, run reflex doctor, fetch model info — anything you would otherwise type.
How it works
Section titled “How it works”Powered by GPT-5 Mini through a hosted proxy at chat.fastcrest.com. The 17 tools available to it map 1:1 onto reflex CLI subcommands, and each tool runs the actual subprocess on your machine. The chat doesn’t see your data or your filesystem — it can only ask reflex questions and read what reflex prints back.
Free tier: 100 calls per day per machine. No signup, no API key.
Available tools
Section titled “Available tools”The chat agent has access to 17 tools (numbered as they appear in the tool registry):
show_version—reflex --versionlist_targets—reflex targetslist_models—reflex models listmodel_info—reflex models info <id>pull_model—reflex models pull <id>export_model—reflex export <id> --target <target>serve_model—reflex serve <export_dir>bench—reflex bench <export_dir>doctor—reflex doctorvalidate_dataset—reflex validate dataset <path>validate_export—reflex validate export <path>eval—reflex eval <export_dir>traces—reflex tracesreplay—reflex replay <trace_file>inspect_targets—reflex inspect targetsinspect_guard—reflex inspect guarddistill—reflex train distill
47/47 regression tests cover the routing layer.
Privacy
Section titled “Privacy”Conversations stream through chat.fastcrest.com, a Cloudflare Worker that does no logging beyond what’s needed for rate-limit accounting (anonymous IP hash + day bucket, rolling 24h). The Worker forwards to OpenAI; OpenAI’s data-retention policies apply to the prompts and responses.
If you don’t want your prompts to leave your machine, use the bring-your-own-key path with FASTCREST_PROXY_URL pointing at an OpenAI-compatible endpoint you control.
When NOT to use
Section titled “When NOT to use”- If you’re running CI or non-interactive scripts — call the underlying
reflexcommands directly. The chat is for ambiguous, exploratory work. - If you need deterministic exit codes — the chat returns natural-language answers, not exit codes.
- If you’re on an air-gapped robot — the chat needs network access to reach the proxy.
For everything else, reflex chat saves typing.