Alpha / Developer Tool / AI Infrastructure

Synrail catches false-green AI-agent work before you accept it.

AI coding agents can claim that work is done before the proof is trustworthy. Synrail adds a proof-first control layer that separates agent claims from accepted results.

The problem: coding agents can look done before they are proven done.

AI coding agents are powerful, but they often produce plausible completion narratives before the work is actually verified. They may claim tests passed, show a plausible diff, or leave handoff state unclear while the operator still lacks trustworthy proof.

Phantom Tests

Claimed tests that were not actually run or were bypassed by the agent's internal narrative.

Mismatched Proof

Proof artifacts that do not match the changed files or reflect an older state of the codebase.

Narrative Completion

Persuasive explanations of completion instead of concrete, reproducible runtime evidence.

Broken Handoffs

Failed repair cycles that leave the environment in an inconsistent state without a clear next step.

Proof-first acceptance for agentic coding work.

Synrail keeps the boundary between execution and acceptance explicit. An agent saying “done” is not enough. Synrail checks whether proof is real, matched to the task, and verified. If proof is weak, mismatched, stale, or unverified, Synrail blocks acceptance and gives one bounded repair step.

Agent Claim
Proof Artifact
Synrail Check
Accept
Block
Repair
synrail check — output

Loading proof from ./.synrail/proof.json...

Comparing diff against claimed changes...

⚠ Proof Mismatch: claimed "Fix bug in parser" but diff shows "Update README"

Status: Rejected (False-Green Detected)

Recommended repair: "Ensure the actual code changes match the reported intent before claiming success."

How Synrail works

1

Start a bounded agent task

Initialize a task scope that the agent must work within.

2

Produce explicit proof artifacts

The agent generates machine-readable proof: test results, trace logs, and verified diffs.

3

Run Synrail check as the acceptance gate

Validate the artifacts against the actual state of the workspace.

4

Repair or Accept

If non-green, repair only the named gap and check again until success.

Quick Install

git clone https://github.com/USBVadik/synrail
cd synrail
make install-dev
make demo

Standard Workflow

synrail start "Describe the bounded local change"
# agent works...
synrail check

Built for developers using AI coding agents

Tool Operators

Developers using Claude Code, Cursor, Aider, Gemini CLI, or custom agentic loops who want to automate the verification of "done" claims.

Engineering Teams

Teams running repeated small AI-assisted code changes where manual verification of every diff is becoming a bottleneck.

Quality-First Hackers

Operators who still manually verify agent work but want a tool to catch the "easy" hallucinations before they look at the code.

Second Operators

Developers inheriting a failed repair from an AI agent who need an explicit handoff state to continue safely.

What Synrail does

Current status

Synrail is currently a narrow local alpha product. It is focused on one local trusted worktree, one bounded local agent task, explicit proof artifacts, and synrail check as the acceptance gate. It is not yet a broad hosted orchestration platform, a generic CI/CD replacement, or a universal correctness system.

Why NexusshellAI is building Synrail

NexusshellAI is building developer infrastructure for safer AI-assisted software work. As coding agents become more capable, teams need better control over when generated work is allowed to be trusted. Synrail is our first product in that direction: a focused reliability layer for proof, acceptance, bounded repair, and handoff.

Designed for AI coding-agent infrastructure

Synrail is model-agnostic and can be used around different coding-agent workflows. As the product evolves, NexusshellAI plans to evaluate larger model-inference and agent-reliability workloads, including coding-model evaluation, proof-quality checks, false-green detection benchmarks, and multi-agent repair/handoff scenarios.

Try Synrail

Start with the public alpha repo and run the false-green demo.