Table of contents
Task
Formal definition
A Task is a triple :
where:
- is the goal — the desired outcome or end-state the task aims to achieve; is what makes the task purposive: the task succeeds iff is achieved; the goal is not merely a postcondition but a purposive state — a state the agent is attempting to bring about; the goal gives the task its unit-character: a task is bounded by its goal’s achievement
- is the subtask set or operation set — the constituent elements required to achieve ; at a non-atomic level, is a set of sub-tasks, each with their own structure; at the atomic level, consists of primitive operations — elementary actions that cannot be further decomposed; the hierarchy terminates when every leaf is an operation
- is the plan — the partial ordering and conditional structure specifying how elements of must be sequenced to achieve ; specifies ordering constraints, conditions under which each subtask applies, and the stopping condition (when is deemed achieved); may be underspecified — leaving open the choice between multiple valid orderings
Three invariants. is a task iff:
-
Goal-directed: is defined before and — the task’s purpose determines what counts as a subtask and what counts as a valid plan. An operation not connected to is not part of the task. This distinguishes a task from a mere sequence: a sequence has no goal; a task has a goal that defines when the sequence is complete and which operations are relevant.
-
Hierarchical decomposability: each non-atomic subtask is itself a task where achieving makes progress toward . The hierarchy terminates: there are no infinite chains of decomposition. The atomic operations at the leaves are the irreducible executions from which higher-level behavior is built.
-
Plan-goal coherence: executing the operations in in the order specified by produces a state satisfying . A task whose plan does not achieve its goal is a broken task. Coherence is not a syntactic property of ; it is the semantic requirement that ’s execution actually achieves given the task’s context.
Annett and Duncan: hierarchical task analysis
John Annett and Keith Duncan (Task Analysis and Training Design, 1967, Occupational Psychology) introduced hierarchical task analysis (HTA) as a method for analyzing task structure for training and system design:
An HTA decomposition of a task has:
- A goal for the whole task
- A plan specifying the operations and the order/conditions under which they occur
- An iterative subgoal decomposition: each non-atomic operation is re-analyzed as a sub-goal with its own operations and plan
The stopping criterion — the rule: stop decomposing when the product of (probability that inadequate performance causes failure) and (cost of inadequate performance) falls below an acceptable threshold. High operations require further analysis; low can be treated as atomic.
Plans in HTA specify conditions under which subtasks execute:
- Fixed sequence: “Do 1, then 2, then 3”
- Contingent: “If condition , do 1; else do 2”
- Concurrent: “Do 1 and 2 simultaneously, then do 3 when both complete”
- Cyclic: “Do 1 repeatedly until condition ”
HTA is foundational in human factors and cognitive systems engineering for understanding how complex skilled behaviors are organized and how they can fail.
Erol, Hendler, and Nau: hierarchical task networks
Kutluhan Erol, James Hendler, and Dana Nau (HTN Planning: Complexity and Expressivity, 1994; UMCP: A Sound and Complete Procedure for Hierarchical Task-Network Planning, 1994) formalized hierarchical task network (HTN) planning — making precise the distinction between primitive and compound tasks:
A primitive task is a task directly executable by an operator (a STRIPS-style action schema): it has no further decomposition. An operator is applicable to iff holds in the current state; applying produces effects .
A compound task is a task that must be reduced to primitive tasks via methods. A method for compound task consists of:
- : the precondition under which is applicable
- : a task network — a partially ordered set of subtasks that accomplishes
HTN planning is the process of reducing all compound tasks to primitive tasks via applicable methods:
- Select an unresolved compound task in the current task network
- Select an applicable method for (one whose precondition holds)
- Replace with the subtask network , inheriting ordering constraints
- Repeat until all tasks are primitive
The resulting totally ordered sequence of primitive tasks is the executable plan.
Expressivity: Erol, Hendler, and Nau prove that HTN planning is strictly more expressive than classical (STRIPS-style) planning: any classical planning problem can be encoded as an HTN problem, but not vice versa. HTN planning can directly represent task hierarchies, procedural knowledge, and decomposition constraints that classical planning cannot express without encoding. The complexity: HTN planning with general methods is undecidable; with ground methods (fully instantiated, no variables), it is decidable and EXPTIME-complete.
The primitive/compound distinction is the formal realization of HTA’s leaf/internal-node decomposition: primitive tasks are the leaves of the task hierarchy (corresponding to HTA’s atomic operations at the stopping criterion); compound tasks are the internal nodes (corresponding to HTA’s subgoals requiring further analysis).
Card, Moran, and Newell: GOMS
Stuart Card, Thomas Moran, and Allen Newell (The Psychology of Human-Computer Interaction, 1983) developed GOMS (Goals, Operators, Methods, Selection rules) as a computational cognitive model of routine skilled behavior:
- Goals: desired states; goals have a hierarchical structure — top-level goals decompose into sub-goals, down to method-executable atomic goals
- Operators: elementary actions, each with a fixed execution time; the Keystroke Level Model (KLM, Card et al. 1980) assigns durations: = keystroke (0.28s), = pointing (1.1s), = homing (0.4s), = mental operation (1.35s), = system response (variable)
- Methods: sequences of operators and sub-goals that accomplish a goal; each method is a procedural specification for achieving a goal
- Selection rules: when multiple methods exist for the same goal, rules specify which to use based on contextual conditions
GOMS prediction: total task time = sum of all operator durations executed by the selected methods. This gives quantitative performance predictions without experiments.
GOMS is the first formal cognitive model of task performance: it provides an algorithmic account of how a skilled user performs a task, decomposing it into a structured hierarchy of goals and operators.
Norman: the action cycle
Donald Norman (The Design of Everyday Things, 1988) introduced the action cycle as a model of how intentional action is structured:
- Goal formation: forming a desired state
- Intention to act: forming an intention toward the goal
- Action sequence specification: determining what actions to take
- Action execution: executing the actions
- Perception of system state: perceiving the resulting state
- Interpretation: interpreting the perceived state
- Evaluation: evaluating whether the goal has been achieved
Norman’s two gulfs:
- Gulf of execution: difficulty translating intentions into available actions (gap between intent and affordance)
- Gulf of evaluation: difficulty interpreting system state (gap between system display and user comprehension)
The action cycle makes explicit that a task is not just an execution sequence but a loop involving perception and evaluation — the task includes the agent’s assessment of whether the goal has been achieved.
Open questions
- Whether GOMS selection rules for a goal are formally equivalent to the guard structure of a procedure — making a GOMS model a procedure specification for cognitive task execution, with each goal-with-selection-rules being a guarded-command procedure.
- Whether there is a natural notion of task refinement — where refines iff every acceptable execution of is an acceptable execution of (achieving the more specific goal subsumes achieving the more general) — and whether this corresponds to goal subsumption in the HTA hierarchy.
- Whether the HTA hierarchy corresponds to the tower structure in the relational universe — whether each level of decomposition maps to a tower level, with atomic operations at the base and the overall goal at the top, and whether task correctness (plan-goal coherence) is a property of how these levels connect.
- Whether the stopping criterion (probability of failure times cost) has a formal representation as a decision-theoretic threshold on the fiber — whether it corresponds to a condition on the Heyting algebra value of the subtask’s completion proposition, making the threshold a formal boundedness condition on the fiber.