Making Sense of Web3’s Burgeoning AI Ecosystem

Although the intersection of Web3 and AI has great potential, there is a lot of confusion about this emerging technology in the market today. Mapping out the GPU supply chain, layers of the tech stack, and various competitive landscapes can help investors better understand the ecosystem and make more informed investment decisions, says David Attermann, at M31 Capital.

AccessTimeIconJan 10, 2024 at 4:45 p.m. UTC
Updated Mar 8, 2024 at 7:38 p.m. UTC
AccessTimeIconJan 10, 2024 at 4:45 p.m. UTCUpdated Mar 8, 2024 at 7:38 p.m. UTC
AccessTimeIconJan 10, 2024 at 4:45 p.m. UTCUpdated Mar 8, 2024 at 7:38 p.m. UTC

In just over a year since ChatGPT’s debut release, generative AI has arguably become the most influential global narrative today. OpenAI’s early success drove a surge in investor interest for large language models (LLMs) and AI applications, attracting $25 billion in funding in 2023 (up 5x YoY!), in pursuit of the potential multi-trillion-dollar market opportunity.

You're reading Crypto Long & Short, our weekly newsletter featuring insights, news and analysis for the professional investor. Sign up here to get it in your inbox every Wednesday.

Chart

As I’ve previously written, AI and crypto technologies complement each other well, so it’s not surprising to see a growing AI ecosystem emerging within Web3. Despite all the attention, I’ve noticed a lot of confusion about what these protocols do, what’s hype vs. real, and how they all fit together. This report will map out the Web3 AI supply chain, define each layer in the tech stack, and explore the various competitive landscapes. By the end you should have a basic understanding of how the ecosystem works and what to look out for next.

Web3’s AI tech stack

Chart
Description

Infrastructure Layer

GPU Aggregator

Generative AI is powered by LLMs, which run on high-performance GPUs. LLMs have three main workloads: training (model creation), fine-tuning (sector/topic specialization) and inference (running the model). I’ve segmented this layer into general-purpose GPU, ML-specific GPU, and GPU aggregators, which are characterized by their different workload capabilities and use-cases. These P2P marketplaces are crypto-incentivized to ensure secure decentralization, but it’s important to note the actual GPU processing occurs off-chain.

  • General-purpose GPU: Crypto-incentivized (decentralized) marketplaces for GPU computing power which can be used for any type of application. Given its general-purpose nature, the computing resource is best suited for model inference only (the most used LLM workload). Early category leaders include Akash and Render, but, with many new entrants emerging, it’s unclear how protocol differentiation will play out. Although compute is technically a commodity, Web3 demand for permissionless, GPU-specific compute should continue to grow exponentially over the next decade-plus as we integrate AI more into our daily lives. Key long-term differentiators will be distribution and network effects.
  • ML-specific GPU: These marketplaces are more specific to machine learning (ML) applications and can therefore be used for model training, fine-tuning, and inference. Unlike general-purpose marketplaces, these protocols can better differentiate through the overlay of ML-specific software, but distribution and network effects will also be key. Bittensor has an early lead, but many projects are launching soon.
  • GPU Aggregators: These marketplaces aggregate GPU supply from the previous two categories, abstract away networking orchestration, and overlay with ML-specific software. They are like Web2 VARs (valued-added resellers) and can be thought of as product distributors. These protocols offer more complete GPU solutions that can run model training, fine-tuning, and inference. Io.net is the first protocol to emerge in the category, but I expect more competitors to emerge given the need for more consolidated GPU distribution.
General-Purpose GPU

Middleware Layer

Zero-knowledge inference

The previous layer enables permissionless access to GPUs, but middleware is needed to connect this computing resource to on-chain smart contracts in a trust-minimized manner (i.e., for use by Web3 applications). Enter zero-knowledge proofs (ZKPs), a cryptographic method by which one party (prover) can prove to another party (verifier) that a given statement is true, while avoiding conveying to the verifier any information beyond the fact of the statement’s truth. In our case, the “statement” is the LLMs output given specific input.

  • Zero-Knowledge (ZK) Inference Verification: Decentralized marketplaces for ZKP verifiers to bid on the opportunity to verify (for compensation) that inference outputs are accurately produced by the desired LLM (while keeping the data and model parameters private). Although ZK technology has come a long way, ZK for machine-learning (zkML) is still early days and must get cheaper and faster to be practical. When it does, it has the potential to dramatically open the Web3 and AI design space, by allowing smart contracts to access LLMs in a decentralized manner. Although still early, =nil;, Giza, and RISC Zero lead developer activity on GitHub. Protocols like Blockless are well positioned whichever ZKP providers win since they act as aggregation & abstraction layers (ZKP distribution).
  • Developer Tooling & Application Hubs: In addition to ZKPs, Web3 developers require tooling, software development kits (SDKs) and services to efficiently build applications like AI agents (software entities that carry out operations on behalf of a user or another program with some degree of autonomy, employing representation of the user's goals) and AI-powered automated trading strategies. Many of these protocols also double as application hubs, where users can directly access finished applications that were built on their platforms (application distribution). Early leaders include Bittensor, which currently hosts 32 different “subnets” (AI applications), and Fetch.ai, which offers a full-service platform for developing enterprise-grade AI agents.
ZK Inference

Application Layer

AI Applications

And finally, at the top of the tech stack, we have user-interfacing applications that leverage Web3’s permissionless AI processing power (enabled by the previous two layers) to complete specific tasks for a variety of use-cases. This portion of the market is still nascent, and still relies on centralized infrastructure, but early examples include smart contract auditing, blockchain-specific chatbots, metaverse gaming, image generation, and trading and risk-management platforms. As the underlying infrastructure continues to advance, and ZKPs mature, next-gen AI applications will emerge with functionality that’s difficult to imagine today. It’s unclear if early entrants will be able to keep up or if new leaders will emerge in 2024 and beyond.

Applications

Investor outlook: While I’m bullish on the whole AI tech stack, I believe infrastructure and middleware protocols are better investments today given the uncertainty in how AI functionality will evolve over time. However it does evolve, Web3 AI applications will no doubt require massive GPU power, ZKP technology, and developer tooling and services (i.e. infrastructure & middleware).

Disclosure: M31 Capital has positions in several tokens mentioned in this article.

Edited by Benjamin Schiller.


Learn more about Consensus 2024, CoinDesk's longest-running and most influential event that brings together all sides of crypto, blockchain and Web3. Head to consensus.coindesk.com to register and buy your pass now.


Disclosure

Please note that our privacy policy, terms of use, cookies, and do not sell my personal information has been updated.

CoinDesk is an award-winning media outlet that covers the cryptocurrency industry. Its journalists abide by a strict set of editorial policies. In November 2023, CoinDesk was acquired by the Bullish group, owner of Bullish, a regulated, digital assets exchange. The Bullish group is majority-owned by Block.one; both companies have interests in a variety of blockchain and digital asset businesses and significant holdings of digital assets, including bitcoin. CoinDesk operates as an independent subsidiary with an editorial committee to protect journalistic independence. CoinDesk employees, including journalists, may receive options in the Bullish group as part of their compensation.

David Attermann

David is a Sr. Portfolio Manager at M31 Capital.