Table of contents
State
Formal definition
A State is a pair :
where:
- is the state domain — the carrier from which states are drawn; the state domain’s mathematical structure depends on what the system computes: a finite set for a finite automaton, a directed-complete partial order (dcpo) for recursive computations, a metric space for hybrid systems, or any object in a category with enough structure to support a coalgebra
- is the current state — the element of the system currently occupies; carries everything about the system’s history that matters for future behavior; it is the system’s memory, condensed into a single point in
Two invariants. is a state in a system iff:
-
The Markov property (state sufficiency): future behavior given current state is independent of the trajectory by which was reached. For any behavioral predicate about future outputs, the system’s response to given state is the same regardless of which history ended in . The state screens off the past. This is what justifies treating the state as the system’s complete present description: knowing is knowing everything about the past that matters.
-
Behavioral determinacy: together with the transition structure of completely determines the system’s next state and output given any input . There is no hidden information beyond that influences the transition. This is the condition that makes states observable from the outside: an experimenter who knows and the transition structure can predict the system’s behavior without knowing its history.
Observational equivalence. Two states are observationally equivalent () iff no experiment distinguishes them — for every input sequence and every , the output sequences from and agree on the first outputs. The minimal state space of a system is . In the theory of coalgebras (Rutten 2000), iff they are bisimilar — they map to the same element of the final coalgebra.
Shannon: states as information
Claude Shannon (A Mathematical Theory of Communication, 1948) introduced states as a formal device for analyzing channels with memory. A finite-state channel has state set ; the channel transitions from to upon transmitting symbol , described by transition probabilities where is the output symbol. The state encodes the channel’s memory — the residual effect of past transmissions that influences future behavior.
Shannon’s key insight: channel capacity depends only on the stationary distribution over states, not on individual transmission histories. The state makes this separation possible. Without states, analysis would require accounting for all possible histories (an infinite object); with states, memory is finitely encoded.
Shannon entropy of a state distribution over : . A state drawn from carries at most bits of information (uniform distribution). The state space’s size is the system’s memory capacity.
Moore and Mealy: the two machine models
George H. Mealy (A Method for Synthesizing Sequential Circuits, 1955): a Mealy machine where is the state transition and is the output function — output depends on both current state and current input.
Edward F. Moore (Gedanken-Experiments on Sequential Machines, 1956): a Moore machine where — output depends on state alone.
Moore proved every Mealy machine has an equivalent Moore machine (same output sequences on same inputs), and vice versa, with state spaces differing by at most . The models are behaviorally equivalent; the difference is an implementation choice about output timing.
State minimization: the minimum-state machine is unique up to isomorphism. Its states are exactly the equivalence classes of the Myhill-Nerode relation (Myhill 1957, Nerode 1958): iff the machine starting from and from produce identical output sequences on every input sequence. Minimum-state machines are the canonical representatives of their behavioral equivalence classes.
Kripke: states as possible worlds
Saul Kripke (Semantical Considerations on Modal Logic, 1963) gave modal logic its standard semantics using Kripke frames where is a set of possible worlds and is the accessibility relation. Each world is a state — a complete specification of what is true at that point.
A Kripke model adds a valuation assigning each atomic proposition the set of worlds where it holds. Modal operators are quantifiers over accessible states:
Van Benthem’s characterization theorem (1984): two pointed Kripke models and satisfy the same modal formulas iff they are bisimilar — iff there exists a relation with such that -related worlds agree on all atomic propositions and can match each other’s transitions. Bisimulation is simultaneously the right notion of state equivalence and the bridge between modal logic and process theory.
Temporal logics (LTL, CTL) replace the single relation with timed transitions , making state the location in a temporal execution and giving the Kripke framework a dynamical interpretation.
Scott: states as domain elements
Dana Scott (Outline of a Mathematical Theory of Computation, 1970) provided the mathematical framework for states of recursive computations. A Scott domain is a directed-complete partial order (dcpo) with a least element (the “undefined” or “uninitialized” state). The partial order means “ extends the information in ” — is a more defined state.
Scott-continuous functions preserve directed joins: . The least fixed point theorem: every Scott-continuous has . This least fixed point is the state reached by iterating a recursive computation from the undefined state.
In Scott’s denotational semantics, a program denotes a Scott-continuous function on the state domain. Recursive programs are defined as least fixed points:
The domain-theoretic state space is not merely a set but carries an approximation order: partial states () are approximated by more complete states (). The tower is the sequence of computational approximations converging to the final state .
Open questions
- Whether the final coalgebra of a behavior functor — the canonical minimal state space for processes with input type and output type — has a concrete description in terms of the history presheaf; and whether states in the final coalgebra correspond to global sections of some behavioral sheaf over the history site.
- Whether Scott’s domain-theoretic partial order ( = “ is more defined than ”) corresponds to a fiber order in the relational history — whether the approximation order on states is the restriction of the sub-history order to the fiber, with corresponding to the empty history and the directed join corresponding to the history colimit.
- Whether the Myhill-Nerode characterization of minimal state spaces (states = observational equivalence classes) has a topos-theoretic interpretation — whether the Nerode equivalence is a congruence in the sheaf topos and the quotient is the appropriate subobject.
- Whether modal bisimulation (van Benthem) and coalgebraic bisimulation (Rutten) are exactly the same notion in all generality, or whether there exist behavioral equivalences detected by modal logic that differ from coalgebraic bisimulation for non-standard behavior functors.