Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

ADR 26006: Requirement for Agentic-Class Models for the Architect Phase

Title

Standardizing the use of Agentic-Class LLMs for the aidx Architect Phase to Ensure Logic Rigidity and Execution Integrity.

Status

Proposed

Date

2026-01-15

Context

As defined in ADR 26005, the aidx framework utilizes a Two-Pass Hybrid Bridge pattern to decouple architectural planning from code implementation. This pattern relies on a high-reasoning “Cloud Architect” to generate a standalone artifacts/plan.md file, which is subsequently applied by a “Local Editor”.

The current implementation faces a critical quality bottleneck in the planning stage:

General-purpose models prioritize creativity and conversational nuance, which introduces non-deterministic noise into the technical pipeline. Conversely, Agentic-class models (e.g., Gemini 3 Flash) are purpose-built for high instruction adherence and tool-use precision, making them the appropriate tool for this specific agentic purpose.

Decision

We mandate the use of Agentic-Class models (prioritizing Instruction Adherence over Abstract Synthesis) for the Architect role within the aidx framework. See “General Purpose (Abstract Synthesis) vs Agentic (Instruction Adherence) Models”

  1. Model Classification: Only models verified for “High Instruction Adherence” (e.g., Gemini 3 Flash, Claude 3.7/4.0 Sonnet) are permitted for the Architect role.

  2. Primary Choice: Gemini 3 Flash (in Thinking: High mode) is established as the default Architect model due to its high “Intelligence Density” and 2026 benchmarks for agentic rigidity.

  3. Artifact Enforcement: The Architect must strictly output the artifacts/plan.md using the templates defined in the aidx wrapper to ensure the local editor receives a high-integrity instruction set.

  4. Logic Interrogation: The Architect must be prompted to perform an audit of its own plan, identifying potential implementation bottlenecks for the local 14B editor before finalization.

Consequences

Positive

Negative

Alternatives

References

Participants

  1. Vadim Rudakov

  2. Senior AI Architect (Gemini 3 Flash)