<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Survey on emsenn.net</title>
    <link>https://emsenn.net/tags/survey/</link>
    <description>Recent content in Survey on emsenn.net</description>
    <generator>Hugo</generator>
    <language>en-us</language>
    <lastBuildDate>Sun, 08 Mar 2026 00:00:00 +0000</lastBuildDate>
    <atom:link href="https://emsenn.net/tags/survey/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>NPU LLM Inference</title>
      <link>https://emsenn.net/library/domains/engineering/domains/tech/domains/computing/domains/on-device-inference/npu-llm-inference/</link>
      <pubDate>Sun, 08 Mar 2026 00:00:00 +0000</pubDate>
      <guid>https://emsenn.net/library/domains/engineering/domains/tech/domains/computing/domains/on-device-inference/npu-llm-inference/</guid>
      <description>&lt;p&gt;An assessment of running large language models on &lt;a href=&#34;../../terms/neural-processing-unit.md&#34; class=&#34;link-internal&#34;&gt;neural processing units&lt;/a&gt;, with particular attention to the Qualcomm Snapdragon X Elite (45 &lt;a href=&#34;terms/tops.md&#34; class=&#34;link-internal&#34;&gt;TOPS&lt;/a&gt;), as of early 2026.&lt;/p&gt;&#xA;&lt;h2 id=&#34;the-fundamental-tension&#34;&gt;The fundamental tension&lt;/h2&gt;&#xA;&lt;p&gt;NPUs were designed for convolutional neural network (CNN) inference: image classification, object detection, segmentation. These workloads have regular, predictable compute patterns that map cleanly onto fixed-point arithmetic and static data flow. Transformer-based language models are architecturally different: the attention mechanism involves softmax and exponentiation that create extreme dynamic ranges, and autoregressive token generation is inherently sequential. This mismatch is the root of most current limitations.&lt;/p&gt;</description>
    </item>
    <item>
      <title>On-Device Inference Frameworks</title>
      <link>https://emsenn.net/library/domains/engineering/domains/tech/domains/computing/domains/on-device-inference/on-device-inference-frameworks/</link>
      <pubDate>Sun, 08 Mar 2026 00:00:00 +0000</pubDate>
      <guid>https://emsenn.net/library/domains/engineering/domains/tech/domains/computing/domains/on-device-inference/on-device-inference-frameworks/</guid>
      <description>&lt;p&gt;A survey of software frameworks for running machine learning models locally on consumer hardware, focusing on Windows and &lt;a href=&#34;../../terms/neural-processing-unit.md&#34; class=&#34;link-internal&#34;&gt;NPU&lt;/a&gt; support as of early 2026.&lt;/p&gt;&#xA;&lt;h2 id=&#34;onnx-runtime&#34;&gt;ONNX Runtime&lt;/h2&gt;&#xA;&lt;p&gt;Microsoft&amp;rsquo;s ONNX Runtime is the primary framework for NPU inference on Windows. Its QNN execution provider integrates Qualcomm&amp;rsquo;s AI Engine Direct SDK, routing &lt;a href=&#34;terms/onnx.md&#34; class=&#34;link-internal&#34;&gt;ONNX&lt;/a&gt; model operations to the Hexagon NPU. Since version 1.18.0, prebuilt packages include the QNN dependencies. Microsoft distributes updates to the QNN execution provider through Windows Update. This is the path jointly recommended by Microsoft and Qualcomm.&lt;/p&gt;</description>
    </item>
    <item>
      <title>Structured-work paradigms: a cross-disciplinary survey</title>
      <link>https://emsenn.net/library/domains/engineering/domains/tech/domains/computing/domains/software-engineering/domains/structured-work-paradigms/texts/cross-disciplinary-survey-2026/</link>
      <pubDate>Sat, 07 Mar 2026 00:00:00 +0000</pubDate>
      <guid>https://emsenn.net/library/domains/engineering/domains/tech/domains/computing/domains/software-engineering/domains/structured-work-paradigms/texts/cross-disciplinary-survey-2026/</guid>
      <description>&lt;p&gt;This text surveys structured-work methodologies across three&#xA;disciplines — business/management, government/military, and&#xA;scientific/academic research — to identify patterns that inform&#xA;the design of agent workflows in the&#xA;&lt;a href=&#34;../../../../../../specifications/agential-semioverse-repository/index.md&#34; class=&#34;link-internal&#34;&gt;agential semioverse repository&lt;/a&gt;.&lt;/p&gt;&#xA;&lt;p&gt;The survey was conducted via web research in March 2026. Each&#xA;domain was researched independently to avoid cross-contamination&#xA;of framing. Citations are provided per finding.&lt;/p&gt;&#xA;&lt;h2 id=&#34;cross-cutting-patterns&#34;&gt;Cross-cutting patterns&lt;/h2&gt;&#xA;&lt;p&gt;Five patterns recur across all three domains. These are the&#xA;findings most relevant to the emsemioverse&amp;rsquo;s workflow design.&lt;/p&gt;</description>
    </item>
  </channel>
</rss>
