Multimodal composition is the practice of making meaning through multiple modes — text, image, sound, video, spatial arrangement, interactivity — rather than through alphabetic text alone.
Kathleen Blake Yancey argued in her 2004 CCCC Chair’s Address that composition must expand beyond print literacy to encompass the full range of composing practices people already use [@yancey2004]. Students compose in multiple modes daily — social media posts, video, code, presentations — but writing instruction often treats alphabetic text as the only legitimate form. Anne Wysocki and Cynthia Selfe extended this into practical pedagogy, arguing that writing instruction must account for visual, spatial, and interactive rhetorics [@wysocki2004] [@selfe2009].
Multimodal composition doesn’t replace alphabetic writing — it contextualizes it. Text remains a mode, but it’s one mode among several, and effective communication often requires combining modes. A technical document that includes diagrams, code examples, and step-by-step screenshots is multimodal. So is a research vault that combines text, cross-references, and hierarchical navigation.
Annette Vee’s work on coding literacy extends this further, arguing that computer programming is becoming a literacy — a socially organized symbolic system — parallel to alphabetic writing [@vee2017]. This frames code not as a tool writers use but as a composing practice with its own rhetorical conventions.
Stacey Pigg’s research on distributed writing shows how contemporary professional writing occurs across fragmented digital environments — email, documentation platforms, collaborative editors, social media — extending the cognitive process model to account for tool-mediated composition [@pigg2014].
For this vault, multimodal composition raises practical questions:
- Markdown as mode: the vault’s Markdown format supports text, headings, lists, tables, links, and embedded images — but not video, audio, or interactive elements. The vault’s writing conventions are optimized for this specific modal range.
- Cross-references as composition: the vault’s internal linking system is a form of spatial/navigational composition. How pages connect is as much a composing decision as what each page says.
- Code as writing: technical documentation that includes code examples is multimodal — the code and the prose serve different rhetorical functions and follow different conventions.
Related terms
- genre — multimodal composition raises questions about whether genres are defined by mode or by social action
- document design — document design is the practice of composing across visual and textual modes
- rhetoric and composition — the discipline that studies multimodal composition
- discourse community — different discourse communities privilege different modes