Summary
Establish standard input/output practices for all ASR scripts, grounded in research into the endeavor’s theory of tools (including MUD heritage), then apply those standards to improve infer-triage-frontmatter with enrichment provenance tracking, model- aware skip logic, and the infrastructure for agent skill proficiency measurement.
Motivation
Three problems converge:
-
Scripts lack standard I/O conventions. Each script (enrich- triage, infer-triage-frontmatter, mine-triage-relevance, etc.) handles output, logging, error reporting, and progress differently. There is no specification for what a script MUST produce. This makes scripts hard to compose, hard to monitor, and hard for less-capable agents to invoke reliably.
-
Enrichment provenance is lost. When infer-triage-frontmatter enriches a file, it writes
triage-status: enriched— but not which script version did it, which model was used, or when. This makes it impossible to answer: “should a better model re-enrich this?” or “what version of the tool produced this enrichment?” -
No model ranking for skill execution. The progressive automation policy (001) describes moving from inference-heavy to delegable to procedural. But there is no mechanism for encoding which models are trusted at which skill levels, or for a script to decide “a weaker model already did this, I should re-do it with a stronger one.” This is a new theory direction: agent skill proficiency.
Research phase
Before writing specifications or modifying scripts, research these:
Script I/O in the endeavor’s theory
The MUD tradition (LPMud driver/mudlib separation) provides a derivation source for how the endeavor thinks about the relationship between infrastructure (driver/engine) and application logic (mudlib/content). The ASR’s scripts are infrastructure; the skills that invoke them are application logic. The driver/mudlib pattern says: the driver provides a standard interaction surface (I/O, object lifecycle, permissions); the mudlib builds on that surface. Our scripts need the same: a standard interaction surface that skills and MCP tools can rely on.
Research questions:
- What does the MUD driver provide that our scripts currently lack? (Structured output, error envelopes, progress reporting, permission checks, lifecycle management)
- What do modern CLI conventions (12-factor app, Unix philosophy, structured logging) add?
- What does the error-envelopes theory already specify?
- What should the semiotic-script specification (candidate aspect) contain?
Enrichment provenance
Research questions:
- What metadata should an enrichment stamp contain? Proposed:
triage-enriched-by: <script>-v<version>-by-<model>,date-triage-enriched: <ISO-8601> - Should the enrichment stamp be a single field or structured?
- How does this relate to the semiotic-changelog spec’s provenance model?
Agent skill proficiency
Research questions:
- What does it mean for a model to be “better” at a skill? (accuracy, consistency, coverage, speed, cost)
- How should model rankings be encoded? (per-skill trust tiers, a proficiency matrix, a simple ordered list)
- Where does this fit in the spec family? Agent skill proficiency starts at the agential semioverse level (the concept file is at mathematics/objects/universes/agential-semioverse/concepts/), with ASR-specific encoding (trust lists in skill manifests, provenance stamps in frontmatter) at the implementation level.
- How does this relate to the progressive automation direction? (the ranking encodes which rung of the automation ladder a model occupies for a given skill)
Steps
-
Research: read MUD driver/mudlib architecture, error-envelopes theory, skill-maturity concept, existing script patterns. Write a text documenting findings and the derivation from MUD heritage.
-
Theory: refine the agent skill proficiency concept (at mathematics/objects/universes/agential-semioverse/concepts/). Define what it means for a model to be trusted at a skill at the agential semioverse level. Then derive ASR-specific encoding (trust lists in skill manifests, provenance stamps) as implementation.
-
Specification: write a semiotic-script interaction surface specification (or extend an existing spec) that defines:
- Standard output format (structured JSON envelope)
- Standard error reporting
- Standard progress/logging (stderr, structured)
- Standard exit codes
- Provenance fields (who ran, what version, what model)
-
Apply to infer-triage-frontmatter:
- Replace
triage-status: enrichedwith structured provenance:triage-enriched-by: infer-triage-frontmatter-v<X>-by-<model>date-triage-enriched: <ISO-8601> - Add model-aware skip logic: if a file was enriched by the same or a stronger model, skip it. If enriched by a weaker model, re-enrich.
- Standardize output to match the new script I/O spec.
- Replace
-
Apply to other scripts: update enrich-triage, mine-triage- relevance, index-triage, etc. to match the standard.
-
Update MCP server: ensure the script wrapper in mcp-server.py correctly surfaces the standardized output.
Done when
- A specification (or spec section) defines standard script I/O
- infer-triage-frontmatter writes structured provenance fields
- infer-triage-frontmatter checks provenance before re-enriching
- A concept or theory file defines agent skill proficiency
- At least 2 other scripts conform to the new standard
- A text documents the MUD derivation for script architecture
Dependencies
Benefits from plan 0042 (semiotic-endeavor-specification), which defines conformance requirements. Benefits from the MUD content already in the repo (technology/disciplines/computing/topics/ multi-user-dungeons/).
Log
2026-03-08 — Created from emsenn’s prompt identifying three converging needs: script I/O standardization, enrichment provenance tracking, and agent skill proficiency measurement. MUD heritage identified as derivation source for the theory of tool interaction surfaces.