Skip to content

reflex chat

reflex chat is a chat agent that runs the same reflex commands you would. Useful for non-experts, useful for one-off operations, useful for “I forgot the flag for X.”

Terminal window
reflex chat
you › what version am I running and what hardware can I deploy to?
→ show_version({}) → reflex --version → "reflex 0.7.0"
→ list_targets({}) → reflex targets → [orin-nano, orin, orin-64, thor, desktop]
You're running reflex 0.7.0. Supported targets:
- orin-nano — Jetson Orin Nano: 8 GB, fp16
- orin — Jetson AGX Orin 32GB: 32 GB, fp16
- orin-64 — Jetson AGX Orin 64GB: 64 GB, fp16
- thor — Jetson Thor: 128 GB, fp8
- desktop — Desktop GPU (RTX 4090 / A100 / H100): 24 GB, fp16
Want me to show which models support each target, or run reflex doctor?

The chat understands the full CLI. Ask it to deploy a model, run benchmarks, validate exports, run reflex doctor, fetch model info — anything you would otherwise type.

Powered by GPT-5 Mini through a hosted proxy at chat.fastcrest.com. The 17 tools available to it map 1:1 onto reflex CLI subcommands, and each tool runs the actual subprocess on your machine. The chat doesn’t see your data or your filesystem — it can only ask reflex questions and read what reflex prints back.

Free tier: 100 calls per day per machine. No signup, no API key.

The chat agent has access to 17 tools (numbered as they appear in the tool registry):

  1. show_versionreflex --version
  2. list_targetsreflex targets
  3. list_modelsreflex models list
  4. model_inforeflex models info <id>
  5. pull_modelreflex models pull <id>
  6. export_modelreflex export <id> --target <target>
  7. serve_modelreflex serve <export_dir>
  8. benchreflex bench <export_dir>
  9. doctorreflex doctor
  10. validate_datasetreflex validate dataset <path>
  11. validate_exportreflex validate export <path>
  12. evalreflex eval <export_dir>
  13. tracesreflex traces
  14. replayreflex replay <trace_file>
  15. inspect_targetsreflex inspect targets
  16. inspect_guardreflex inspect guard
  17. distillreflex train distill

47/47 regression tests cover the routing layer.

Conversations stream through chat.fastcrest.com, a Cloudflare Worker that does no logging beyond what’s needed for rate-limit accounting (anonymous IP hash + day bucket, rolling 24h). The Worker forwards to OpenAI; OpenAI’s data-retention policies apply to the prompts and responses.

If you don’t want your prompts to leave your machine, use the bring-your-own-key path with FASTCREST_PROXY_URL pointing at an OpenAI-compatible endpoint you control.

  • If you’re running CI or non-interactive scripts — call the underlying reflex commands directly. The chat is for ambiguous, exploratory work.
  • If you need deterministic exit codes — the chat returns natural-language answers, not exit codes.
  • If you’re on an air-gapped robot — the chat needs network access to reach the proxy.

For everything else, reflex chat saves typing.