generated from coulomb/repo-seed
WP-0001 — Foundation & GAAF Baseline - SCOPE.md, ARCHITECTURE-LAYERS.md, contracts/ tree - .claude/rules/ stubs filled (architecture, stack, boundary) - 57 tests (pytest), pyproject.toml with ruff+mypy, CI workflow WP-0002 — Core Extensions (FR-4 + FR-3) - FR-4: BudgetTracker (thread-safe) + LLMBudgetExceededError + optional RunConfig.budget_tracker + enforcement in all adapters - FR-3: async_execute_prompt on LLMAdapter ABC (asyncio.to_thread fallback) + native asyncio.create_subprocess_exec in ClaudeCodeAdapter 81 tests passing. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
59 lines
2.2 KiB
Markdown
59 lines
2.2 KiB
Markdown
## Architecture
|
|
|
|
llm-connect is structured as a **GAAF-2026 layered library**. See
|
|
`ARCHITECTURE-LAYERS.md` for the full layer map and scorecard.
|
|
|
|
### Layer summary
|
|
|
|
```
|
|
Core (frozen after v1)
|
|
LLMAdapter ABC adapter.py
|
|
RunConfig / LLMResponse models.py
|
|
LLMError hierarchy exceptions.py
|
|
MockLLMAdapter adapter.py ← test primitive, belongs with Core
|
|
|
|
Functional (evolvable, independently shippable)
|
|
OpenAIAdapter openai.py
|
|
GeminiAdapter gemini.py
|
|
OpenRouterAdapter openrouter.py
|
|
ClaudeCodeAdapter claude_code.py
|
|
EmbeddingAdapter ABC embedding_adapter.py
|
|
OpenAICompatibleEmbeddingAdapter embedding_openai.py
|
|
EmbeddingCache embedding_cache.py
|
|
create_adapter() factory.py
|
|
create_embedding_adapter() embedding_factory.py
|
|
_token_estimator _token_estimator.py
|
|
similarity utilities similarity.py
|
|
|
|
Configuration (user-controlled declarative state)
|
|
resolve_llm() chain toml_config.py ← 7-level TOML priority chain
|
|
LLMConfig / load_config config.py
|
|
_http shared utility _http.py ← also used by Functional adapters
|
|
```
|
|
|
|
### Dependency rule
|
|
|
|
Core ← Functional ← Configuration
|
|
No upward dependencies. `_http.py` is consumed by Functional only.
|
|
|
|
### Key design decisions
|
|
|
|
**API key resolution** (`config.resolve_api_key`): three-step chain —
|
|
explicit argument → environment variable → plaintext key file in project root.
|
|
Adapters raise `LLMConfigurationError` at construction time if no key is found
|
|
(except `ClaudeCodeAdapter` which needs no key).
|
|
|
|
**TOML config chain** (`toml_config.resolve_llm`): 7 priority levels allow
|
|
per-project and per-user LLM preferences. Currently defaults to `markitect`
|
|
app_name for backward compatibility — consumers pass their own `app_name`.
|
|
|
|
**Factory pattern** (`factory.create_adapter`): lazy imports prevent pulling
|
|
all provider SDKs at module load. Add a new provider by registering its FQN
|
|
in `_PROVIDERS`.
|
|
|
|
**ClaudeCodeAdapter subprocess model**: prompt is piped via stdin (not CLI
|
|
arg) to avoid shell argument length limits on large prompts (>30 KB).
|
|
|
|
**Retry logic**: `OpenAIAdapter` and `OpenRouterAdapter` retry on 429 and 5xx
|
|
with exponential backoff. `GeminiAdapter` does not (rate-limit handling deferred).
|