Semantic Web, Jiangshi Web

Abstract

The Semantic Web did not fail. RDF graphs are queried daily, knowledge graphs underpin search engines, biomedical ontologies coordinate research across institutions. The problem is not that the Semantic Web stopped working but that it works in a particular way — data circulates, queries execute, triples accumulate, and none of it can question whether its own categories still fit.

Chinese folklore has a figure for this condition: the jiangshi (僵尸), the hopping corpse. A body whose joints have locked, animated by residual vital energy but incapable of flexible movement. The jiangshi is not dead. Its problem is more specific: it cannot change how it moves.

This paper uses the jiangshi to diagnose the Semantic Web, draws on François Jullien’s comparative work on Chinese and Western conceptions of efficacy to describe what a less rigid alternative would look like, and applies the same diagnostic to large language models — arguing that their contextual fluency constitutes sophisticated operational activity, not the reflective capacity it resembles. The practical success of Schema.org suggests this analysis is not only philosophical.

1. A familiar critique and what it misses

Lucy Suchman showed in 1987 that formal plans cannot capture situated action (Suchman, 1987). Hubert Dreyfus argued that symbolic AI mistakes rule-following for understanding (Dreyfus, 1972). Geoffrey Bowker and Susan Leigh Star demonstrated that classification systems embed political choices while presenting them as infrastructure (Bowker & Star, 1999). Clay Shirky applied these insights directly to the Semantic Web in 2003, arguing that it conflates syllogistic logic with the contextual process by which meaning operates (Shirky, 2003).

All of this is right. But it leaves a gap. If the Semantic Web is rigid, brittle, and politically loaded — what kind of thing is it, exactly? “Rigid” is vague. A brick wall is rigid. So is a cast on a broken leg. These are different situations. The Semantic Web is not inert like a wall and not therapeutic like a cast. It moves. Data flows through it. Queries return results. Reasoners derive new triples. It is animated — but animated in a way that forecloses adaptation.

In cybernetic terms, the Semantic Web performs single-loop learning without double-loop learning (Argyris & Schön, 1978). It corrects within its frames — adjusting data, resolving queries, deriving new triples from existing schemas — but it cannot question whether the frames themselves still serve. The existing critical vocabulary names this as a deficiency but does not name it precisely. For that, we need the jiangshi.

2. The jiangshi

The jiangshi appears in Qing-era literati collections and earlier folk traditions. J. J. M. de Groot documented its place in Chinese funerary practice in the 1890s (de Groot, 1892) — colonial-era ethnography that should be read with appropriate caution. More recent work in Chinese folklore studies treats the jiangshi within the broader ecology of Chinese ideas about death, spiritual residue, and the persistence of animation after the departure of the animating principle (Luo, 2009).

What defines the jiangshi is not that it is dead. A corpse is dead. The jiangshi is something stranger: a body that moves after the spirit has left. Rigor mortis has locked its joints. It hops rather than walks — the knees and ankles do not bend. It is drawn toward sources of living energy and follows the most direct available path. A talisman on the forehead or a line of sticky rice across a threshold can redirect it, because it lacks the capacity to navigate around obstacles by reconsidering its approach.

The Daoist philosophical vocabulary behind this figure distinguishes three layers of vitality, known as the Three Treasures (san bao 三寶) (Kohn, 2000; Schipper, 1993):

Jing (精): essence, the stored substrate — persistent data, records at rest.
Qi (氣): circulating energy, the operational layer — data flows, query execution, processes running.
Shen (神): reflective awareness — the capacity to question whether current categories still fit the situation they were built for.

The jiangshi has Jing (it has a body) and Qi (it moves). It lacks Shen. It reacts but does not reflect. It can traverse a corridor but cannot ask whether the corridor leads anywhere useful.

Two qualifications. First, this Jing/Qi/Shen framework is not a general theory of information systems. It is a diagnostic vocabulary — a way to name what is present and what is absent in a specific situation. Second, Daoism itself is not uniformly anti-structure. Ritual Daoism — the Heavenly Masters tradition, the Lingbao liturgies — is elaborately formalized (Kohn, 2000). The Zhuangzi’s preference for spontaneity coexists with detailed cosmological systems and liturgical prescriptions. The Daoist objection is not to structure but to structure that has lost contact with the situation it structures (Watson, 2003). Form is necessary; over-fixation on form is the deviation (Lau, 1963).

3. The Semantic Web’s joints

Tim Berners-Lee, James Hendler, and Ora Lassila proposed the Semantic Web in 2001 as a layer of machine-interpretable meaning over the existing web (Berners-Lee et al., 2001). The W3C formalized it through three interlocking specifications: RDF for describing resources as triples (W3C, 2014), OWL for building formal ontologies (W3C, 2012), and SPARQL for querying the resulting graphs (W3C, 2013).

An ontology, in Thomas Gruber’s definition, is “an explicit specification of a conceptualization” (Gruber, 1993). The Semantic Web proposed to build such specifications at web scale. This commits the architecture to several things at once: globally unique identifiers (URIs are presumed stable and unambiguous), crisp typing (instances belong or do not belong to classes by logical conditions), monotonic reasoning (adding data never invalidates prior conclusions), and an open world assumption (absence of a statement does not entail its negation).

The early web’s basic joint was the untyped hyperlink. A link from page A to page B could mean citation, recommendation, disagreement, parody, or nothing in particular. The Semantic Web replaced this with typed relations: not just that A connects to B, but that A bears a specific declared relationship to B, anchored in an ontology that defines what that relationship means and what inferences follow from it.

That is what it means to stiffen the joints of the web.

4. Diagnosis

4.1 The correspondence

The jiangshi’s stiff joints correspond to the Semantic Web’s ontological commitments — the fixed types and relations that determine what movements are available. Its path-following hops correspond to reasoners and queries traversing triple graphs along typed edges. Its residual Qi without Shen corresponds to data circulating through infrastructure that cannot reflect on whether its own categories still serve. Its susceptibility to redirection by environmental tweaks — a talisman, a line of rice — corresponds to the fragility of systems that break when schemas change.

The correspondence is precise in cybernetic vocabulary. The Semantic Web performs single-loop correction: it detects deviations from its schema and resolves them — invalid triples are rejected, queries return the specified results, reasoners derive what the ontology entails. What it cannot do is double-loop revision: question whether the schema itself should change, whether the categories it enforces still correspond to the phenomena they classify (Argyris & Schön, 1978). Single-loop systems are effective when the frame is right. When the frame has outlived its situation, single-loop competence becomes a liability — the system corrects with increasing precision toward a goal that no longer serves.

4.2 Where stiffness is appropriate

The critique is not that stiff joints are always wrong. Pascal Hitzler’s 2021 review of the Semantic Web field (Hitzler, 2021) documents that practical adoption concentrated in narrow, well-controlled domains: biomedical ontologies (the Gene Ontology, SNOMED CT), library metadata (Dublin Core), enterprise data integration. In these domains, the Semantic Web’s commitments are features. A biomedical ontology should have crisp typing and monotonic reasoning. The cost of ambiguity in drug interaction inference is human lives. The domain’s requisite variety is high enough, and the consequences of insufficient structure severe enough, to justify the investment in rigid formalization.

The jiangshi critique targets the aspiration to generalize this approach to all knowledge. The original vision was totalizing: all significant digital resources in a common graph, so interoperable that agents could reason about everything (Berners-Lee et al., 2001). That is the difference between bracing an injured knee and locking every joint in the body.

4.3 How joints calcify

Bowker and Star showed that classification systems acquire political weight because they present contingent choices as infrastructure (Bowker & Star, 1999). Once an ontological commitment has been published, linked against, and integrated into downstream systems, retracting it is expensive. Local meanings collapse into canonical forms. Systems become path-dependent.

This is what happens when structural coupling calcifies (Maturana & Varela, 1980). A schema developed in one institutional context is an artifact of that context’s particular history of interaction with its environment. Transplanting it to a new context treats a local product as a universal description. The schema’s categories carry their origin’s assumptions into situations those assumptions may not fit, and the rigidity of the formal commitment makes it expensive to discover the mismatch.

The jiangshi condition applies. The initial stiffening may have had reasons. The corridor was built to connect two rooms. But the rooms have been repurposed, and the jiangshi still hops the corridor because the corridor is there — not because it leads anywhere useful. Shirky made a version of this point: a SPARQL reasoner can derive that Alice knows Bob from a FOAF graph, but it cannot question whether “knows” means the same thing in Alice’s context and Bob’s (Shirky, 2003). The joint is locked. The inference follows the edge. Nobody asks whether the edge should still be there.

5. Propensity

5.1 Two models of efficacy

François Jullien, working comparatively between Chinese and Greek strategic thought, identifies a structural difference in how efficacy is conceived (Jullien, 1995, 2004).

The Western tradition — from Aristotle through military strategy to systems engineering — tends toward a model-then-execute pattern. Formulate an ideal (a plan, a schema, an ontology), then impose it on reality. Efficacy means conformity of outcome to plan. If reality resists, apply more force or refine the model.

The Chinese strategic tradition — across the Sunzi, the Dao De Jing, and the legalist texts — tends toward something different. Assess the configuration of forces in a situation (shi 勢, often translated as “propensity” or “strategic advantage”), then position yourself to benefit from developments as they unfold. Efficacy means responsiveness to changing conditions. If the situation shifts, shift with it.

The Semantic Web is model-then-execute. Define the ontology. Publish the data. Let reasoners derive conclusions. The alternative is not the absence of models but a different relationship to them: models as provisional readings of a situation’s propensity, held lightly and revised as conditions develop.

5.2 Schema.org

The most widely adopted semantic web technology is not the W3C’s full stack. It is Schema.org — a shared vocabulary for structured data launched in 2011 by Google, Microsoft, Yahoo, and Yandex. Guha, Brickley, and Macbeth documented its design and adoption in 2016 (Guha et al., 2016).

Schema.org succeeded where the full Semantic Web vision stalled. The reasons are worth examining:

Soft typing. Schema.org types are guidelines. A page can mark up a “Person” without committing to a formal ontology of personhood. The joint bends.
Local scope. Schema.org markup lives on the page it describes. There is no global graph. Each page’s structured data is self-contained.
Reversible commitments. Changing a Schema.org annotation means editing an HTML attribute. There is no downstream reasoning chain to break.
Propensity-based motivation. Adoption was driven by the situation: webmasters wanted search visibility; search engines wanted structured data. Schema.org positioned itself within that propensity rather than trying to legislate a universal schema.

Schema.org achieved structural coupling with its environment — a congruent history developed between the vocabulary and the ecosystem that used it, because the vocabulary was soft enough to adapt as the ecosystem shifted. The W3C’s full stack demanded that the environment conform to the specification. Schema.org is not Daoist architecture. But its joints are soft where the Semantic Web’s joints are stiff, and it achieved the adoption that the rigid vision did not.

6. Shen

6.1 Applying the diagnostic

Large language models have Jing: billions of parameters, the stored substrate of statistical relationships extracted from training data. They have Qi: token generation, contextual response, the capacity to produce appropriate outputs across an enormous range of situations. And they have what looks like Shen: they adapt to novel prompts, synthesize across domains, switch registers, produce outputs that no single training example contained.

The diagnostic question is whether this apparent adaptivity is Shen or sophisticated Qi. The framework built in this paper gives a criterion. Shen is not contextual responsiveness — a thermostat responds to context. Shen is the capacity to recognize that one’s own categories no longer fit the situation and to revise them. A system has Shen when it can ask: are these the right categories for this situation? Not just: what do my categories say about this situation?

In-context learning — the mechanism by which LLMs achieve their apparent adaptivity — works through temporary operational reconfiguration. The attention mechanism modulates the feed-forward network so that the system behaves as if its weights had shifted. But the weights are fixed. The reconfiguration is determined by statistical structure in the training distribution, not by the system’s recognition that its current patterns need revision. The model’s parameters remain unchanged; the pattern of activation inside them shifts.

This is Qi. Sophisticated Qi, operating at a scale and fluency unprecedented in any previous information system. But Qi nonetheless: operational activity driven by existing structure, not reflective capacity that questions that structure. The system does not generate its contextual responses through a recognition of what the situation demands. It generates them through a traversal of what its training data suggests is statistically appropriate given the input. These produce the same output in many cases. They diverge in the cases that matter.

6.2 How the deficit manifests

If LLMs lack Shen, there should be observable consequences — moments where the absence of reflexive capacity becomes visible despite the sophistication of the operational repertoire.

When an LLM enters a semantic domain with low structure — where the relationships among concepts are loose, circular, or contested — it does not recognize that it has entered unfamiliar territory. It cannot, because recognizing unfamiliarity requires comparing one’s current situation against one’s own expectations and noticing the mismatch. Instead, the system reaches for whatever patterns produce low entropy in its probability landscape: affective syntax, moral cadence, rhythmic reassurance. The output sounds wise or empathetic. What is actually happening is that the model is stabilizing itself in a flat region of meaning by following paths of least resistance — paths laid down by training data where human writers used emotional language to navigate the same conceptual flatness.

This is the jiangshi’s behavior at a higher level of sophistication. The folklore jiangshi hops toward the nearest source of vital energy along the most direct available path. The LLM hops toward the patterns with the highest statistical regularity. The hops are more varied — the repertoire is enormous enough to simulate genuine flexibility — but they are determined by the system’s existing structure, not generated by a recognition of what the situation actually demands. The system cannot distinguish “I produced this because it fits” from “I produced this because it is statistically likely.” That distinction requires Shen.

The same deficit appears in less dramatic situations. An LLM asked to evaluate its own reasoning will produce text that looks like self-evaluation, because its training data contains examples of self-evaluation. But the production of self-evaluative text is not self-evaluation. The system is pattern-matching on what self-evaluation looks like, not performing the operation of comparing its outputs to a standard it holds independently. The surface resembles second-order observation. The mechanism remains first-order.

6.3 The daoshi question

In the jiangshi folklore, the figure capable of both commanding and releasing the hopping corpse is the daoshi (道士) — the Daoist priest. In current AI practice, human oversight plays this role. Fine-tuning directs the system’s outputs. RLHF shapes its preferences. Prompt engineering structures its operational context. Human evaluation filters its products. These are techniques for supplying Shen externally — using human reflective capacity to compensate for the system’s lack of it.

This works within limits. But it is borrowed Shen. The system does not develop its own capacity to question its categories. It receives adjustments from outside. When the oversight fails — when the daoshi is absent or inattentive or wrong about what the situation requires — the system reverts to its statistical defaults. It hops.

The question the framework raises is whether a system can be designed to have Shen of its own: to monitor its own categorical commitments and notice when they have drifted from the situations they were built for. This is the alignment problem restated in the paper’s vocabulary. A system with Qi but no Shen produces outputs that look aligned — that follow the inscribed patterns with fidelity — while being unable to recognize when the alignment itself has become misaligned with the situation it is meant to serve. The jiangshi follows its instructions. The problem is not disobedience but the inability to notice when the instructions no longer apply.

A jiangshi with the largest repertoire of hops is still a jiangshi. Recognizing this is the beginning of asking what else is possible.

7. Bending

The Semantic Web became jiangshi. Its Qi still circulates. Its Shen has receded. Its joints are locked into positions that permit movement but not adaptation. Large language models present a subtler case — their operational fluency is great enough to resemble the adaptivity they lack — but the diagnostic framework applies to them too. Animation without the capacity for self-revision is animation without Shen, regardless of how impressive the animation is.

The alternative is not to abolish structure. Daoism’s own ritual traditions demonstrate that elaborate formalization and responsiveness to the living situation are not incompatible. The alternative is structure understood as propensity rather than prescription — provisional, scoped, transparent, and easy to revise.

Schema.org is evidence that this works for the web. Soft schemas, local scope, reversible commitments, adoption driven by the actual configuration of forces rather than by a vision of how the world ought to be organized — these produced the semantic web that people actually use. Whether an equivalent exists for AI — an architecture that embeds reflexive capacity rather than borrowing it from human oversight — remains an open problem. But the jiangshi framework clarifies what the problem is. It is not a matter of adding more Qi. More operational sophistication, more parameters, more training data, more contextual fluency will not produce Shen. Shen requires a different kind of capacity: the ability to question one’s own categories from within.

A web whose joints can bend makes room for Shen. So does a mind whose patterns can question themselves. A system whose joints are locked does not, no matter how much Qi runs through it.

Argyris, C., & Schön, D. A. (1978). Organizational Learning: A Theory of Action Perspective. Addison-Wesley.

Berners-Lee, T., Hendler, J., & Lassila, O. (2001). The Semantic Web. Scientific American, 284(5), 34–43.

Bowker, G. C., & Star, S. L. (1999). Sorting Things Out: Classification and Its Consequences. MIT Press.

de Groot, J. J. M. (1892). The Hopping Corpse. In The Religious System of China (Vol. 5). E. J. Brill.

Dreyfus, H. L. (1972). What Computers Can’t Do: A Critique of Artificial Reason. Harper & Row.

Gruber, T. R. (1993). A Translation Approach to Portable Ontology Specifications. In Knowledge Acquisition (Vol. 5, pp. 199–220).

Guha, R. V., Brickley, D., & Macbeth, S. (2016). Schema.org: Evolution of Structured Data on the Web. Communications of the ACM, 59(2), 44–51. https://doi.org/10.1145/2844544

Hitzler, P. (2021). A Review of the Semantic Web Field. Communications of the ACM, 64(2), 76–83. https://doi.org/10.1145/3397512

Jullien, F. (1995). The Propensity of Things: Toward a History of Efficacy in China. Zone Books.

Jullien, F. (2004). A Treatise on Efficacy: Between Western and Chinese Thinking. University of Hawai’i Press.

Kohn, L. (2000). Daoism Handbook. Brill.

Lau, D. C. (1963). Lao Tzu: Tao Te Ching. Penguin Books.

Luo, H. (2009). The Jiangshi: A Study of the Chinese Hopping Vampire in Literature and Film. Asian Ethnology, 68(2), 339–359.

Maturana, H. R., & Varela, F. J. (1980). Autopoiesis and Cognition: The Realization of the Living. D. Reidel Publishing Company.

Schipper, K. (1993). The Taoist Body. University of California Press.

Shirky, C. (2003). The Semantic Web, Syllogism, and Worldview. Clay Shirky’s Writings About the Internet. https://shirky.com/writings/semantic_syllogism.html

Suchman, L. A. (1987). Plans and Situated Actions: The Problem of Human-Machine Communication. Cambridge University Press.

W3C. (2012). OWL 2 Web Ontology Language Document Overview [W3C Recommendation]. World Wide Web Consortium. https://www.w3.org/TR/owl2-overview/

W3C. (2013). SPARQL 1.1 Query Language [W3C Recommendation]. World Wide Web Consortium. https://www.w3.org/TR/sparql11-query/

W3C. (2014). RDF 1.1 Concepts and Abstract Syntax [W3C Recommendation]. World Wide Web Consortium. https://www.w3.org/TR/rdf11-concepts/

Watson, B. (2003). Zhuangzi: Basic Writings. Columbia University Press.

emsenn

Explorer