"token" changed its default meaning from crypto to AI
The word 'token' originally meant a sign or marker. It was adopted by cryptocurrency for blockchain-based digital assets, and later by AI for the smallest unit of text processing. Over the past few years, the dominant mental association has shifted from crypto to AI, reflecting broader technological trends.
dd_688
Jun 07, 2026
If you’ve ever used a large language model — ChatGPT, Gemini, Claude, DeepSeek — you’ve probably encountered the word token. It shows up in your usage dashboard, in model spec sheets, or in some explainer about how AI actually works. If you’ve had any exposure to cryptocurrency, you met it earlier. Token: the thing in your wallet or exchange account that you buy, hold, and transfer.
The same English word carries two entirely different meanings in two technology domains. In one, it is the smallest unit a machine processes when it reads or generates language. In the other, it is a transferable claim to value on a blockchain. Most people have never stopped to consider that these two senses are even related — much less noticed that, over the past few years, the image that first lights up in their mind when they hear token has quietly shifted.
This article traces the word back to its roots, examines how it was successively adopted by cryptocurrency and artificial intelligence — two defining technology waves of the past fifteen years — and tracks how its center of gravity shifted between them.
The Word’s Origins: An Ancient Marker
Start with what token originally means in English.
The Oxford English Dictionary lists 29 senses for the noun [1]. In everyday usage, they boil down to three main clusters: a substitute coin (a round piece of metal or plastic used instead of money), a symbol or keepsake (something that represents a feeling, a fact, or an event), and a technical term in linguistics and computer science — the smallest meaningful unit extracted from a sequence of data. The word also works as an adjective meaning “symbolic, perfunctory” — a token gesture, a token fee.
These senses look scattered, but they share a single etymological root. Token descends from Old English tācen (mark, sign, evidence), which is cognate with German Zeichen (sign). Further back, it traces to the Proto-Indo-European root *deik- (to show, to point), the same root that gave Latin index and English diction [2]. The word’s deep logic has always been the same: it is not the thing itself; it points elsewhere, standing in for something else.
Every subsequent meaning the word acquired is a variation on this logic. In the Middle Ages it meant a keepsake — standing in for a bond of affection. Later it meant a pledge exchanged at the sealing of a contract — standing in for a promise. By the late sixteenth century it had added “a metal disc resembling a coin,” its face value set by whoever issued it — standing in for real money. The token you drop into a subway turnstile or an arcade machine descends from this branch. Its value is determined by what it can be exchanged for.
From substitute coin to keepsake to contractual pledge, token has always done the same thing: standing in for something else. It is not the destination; what it points to is. This logic determined what kind of new concepts would come looking for it.
“Token” — The Crypto Coin
In the blockchain and cryptocurrency world, token underwent a clear process of semantic differentiation, and its starting point was a different word: coin.
In 2008, Satoshi Nakamoto published the Bitcoin white paper, subtitled “A Peer-to-Peer Electronic Cash System.” In 2009, the Bitcoin network went live [3]. Bitcoin’s core technology is a cryptographic distributed ledger: transaction records are not stored on a central server but jointly maintained and verified by every node in the network — hence the term “chain.” The native currency running on this chain was called bitcoin, and since its stated purpose was electronic cash, calling it a coin was natural. Subsequent blockchain projects — Litecoin, Dogecoin, and many others — followed the same convention: each had its own chain, its own ledger, and its own coin serving as the chain’s native currency.
Before Ethereum, the embryonic form of on-chain assets had already appeared in the Bitcoin ecosystem. In 2012–2013, a group of developers proposed the concept of “colored coins”: marking Bitcoin’s smallest units to represent off-chain assets — equity, bonds, even physical commodities. In these discussions, token was already being used to describe “an on-chain marker that represents something else.” This usage was directly inherited by Ethereum’s founder, Vitalik Buterin — he had participated in the colored coins project before creating Ethereum, and one of his motivations for building Ethereum was precisely that what colored coins could do on Bitcoin was too limited.
Ethereum launched in 2015, bringing a key innovation: the smart contract. Smart contracts allowed people to issue assets directly at the code level of Ethereum’s blockchain, without building a chain from scratch. In November of that year, the ERC-20 standard formalized this process [4]: anyone who wrote a contract conforming to ERC-20 could mint their own asset on Ethereum. The colored coins vision finally had truly usable infrastructure.
These assets, parasitic on Ethereum’s chain, needed a name to distinguish them from coins with chains of their own — and the name required almost no debate. Linguistics has a branch called onomasiology, which studies why a new concept ends up landing on a particular word [5]. Here, coin didn’t fit, because it implies “money,” while what ERC-20 produced was, more often than not, a claim on a project’s future — closer to an equity certificate than to cash. Token‘s core meaning — a thing that represents something else — covered this “equity” layer neatly. Vitalik and the early Ethereum community used the word from the start; the ERC-20 standard’s official title is simply “Token Standard.” The derivative concepts that followed — token economics, token rights, tokenization — all grew from the token root. A 2023 linguistic study found that token ranked seventh among high-frequency cryptocurrency terms (behind exchange, crypto, trading, blockchain, asset, and cryptocurrency), defined as “a digital representation of an asset value” [6] — a definition that reads almost like a modern translation of the word’s original meaning.
With the name settled, the numbers exploded. In the years following Ethereum’s launch, ERC-20 tokens proliferated rapidly across a wide range of uses: dollar-pegged stablecoins (USDT migrated to Ethereum in 2017, followed by USDC and DAI), exchange platform tokens, utility tokens for decentralized applications, and vast numbers of projects raising money from the public on the strength of nothing more than a white paper. During the ICO (Initial Coin Offering) frenzy of 2017–2018, thousands of projects raised funds by issuing ERC-20 tokens, with cumulative fundraising exceeding $13 billion over two years [7]. The word token surged in usage within the crypto community, becoming the industry’s standard term for non-coin on-chain assets — and eventually came to be used almost interchangeably with coin.
It was precisely this “equity certificate” connotation that got token into legal trouble. The central regulatory dispute around crypto has always been whether a given token constitutes a security. The SEC’s primary analytical tool for this question is the Howey Test: if an asset qualifies as an “investment contract,” it is a security and falls under the Securities Act. In 2017, the SEC’s investigation report on The DAO was the first to conclude that a digital asset constituted a security; the press release title read: “DAO Tokens, a Digital Asset, Were Securities” [8]. Two years later, the SEC’s analytical framework placed “virtual currencies, coins, and tokens” collectively under the heading of “digital asset” [9]. The debate remains unresolved. The SEC’s concern is economic substance — whether an asset constitutes an investment contract — not etymology. But the fact that token has leaned toward “claim” rather than “currency” since the day it was chosen has, objectively, placed it under the regulatory spotlight.
As token became bound to crypto, the industry’s scale continued to grow. After DeFi (decentralized finance) took off, total value locked rose from roughly $1 billion in early 2020 to approximately $260 billion by December 2021. By 2024, mainstream price-tracking sites listed over ten thousand crypto assets; then, in the second half of the year, meme coin platforms like Pump.fun pushed the number of newly minted on-chain tokens into the millions, the vast majority lacking any sustained liquidity [10]. Each incremental rise in the industry’s scale tightened the association between token and “crypto.”
For people outside the crypto industry, many of blockchain’s core concepts — decentralization, consensus mechanisms, smart contracts — remain, seventeen years after Bitcoin’s birth, too abstract for ordinary people to grasp intuitively. Out of the mountain of crypto-native slang (HODL, whale, shitcoin, to the moon), virtually the only term that broke through into mainstream language was token. Perhaps precisely because the word is plain enough, everyday enough — a coin substitute, a voucher — it became the nearest point of entry for ordinary people trying to understand what this industry actually does.
“Token” — AI’s Building Block
During the years when crypto was busy developing tokens and attaching new meaning to the word, another sense of token had been sitting quietly in the dictionary all along.
In computer science, token has always meant the smallest processing unit extracted from a text. Compilers analyzing code, programs parsing natural language — for decades, the word has been used in this way. Going further back, the logician C.S. Peirce drew the distinction between type and token in 1906: a page with twenty printed instances of the contains twenty tokens, but only one type [11]. This lineage runs straight to today’s AI. It is far older than Bitcoin.
In the context of today’s large language models, token can be understood as follows: before the model processes text, it breaks the input into small fragments — these fragments are tokens. One token corresponds roughly to one English word or a short stretch of punctuation and whitespace. Everything the model does — understanding your question, generating a response — happens token by token. How much you say to the AI and how much it says back ultimately reduces to a token count.
So strictly speaking, AI did not “steal” this word from the blockchain industry, nor did it change its meaning. What happened is subtler: the definition stayed the same, but the thing that “smallest processing unit” now processes has become incomparable to what it once was.
In November 2022, OpenAI released ChatGPT, and a large language model entered ordinary people’s daily lives for the first time. GPT-3 had already stunned the industry, but its usage barrier confined it to the developer community; ChatGPT’s conversational interface removed that barrier, reaching 100 million users within two months [12]. In 2023, GPT-4 introduced multimodal capabilities, extending token from pure text to image comprehension. The industry began to recognize that scaling laws — bigger models, more data, better performance — were not an academic hypothesis but a viable engineering bet. A global model arms race followed: Google’s Gemini, Anthropic’s Claude, X’s Grok, and from China, Kimi, DeepSeek, Qwen, Doubao, and Pangu. In early 2025, DeepSeek used an efficient architecture to push inference costs dramatically lower, triggering an industry-wide price war — the unit price of a token was collapsing, but total call volume was exploding [13]. By 2026, multimodal models and AI agents had become mainstream: a single AI-generated video might consume tens of millions of tokens; an autonomous agent completing a workflow might consume hundreds of thousands [14].
The definition of “smallest processing unit” remains unchanged, but what it carries is unrecognizable. A decade ago, token was a footnote in a compiler manual. Today, it
[truncated for AI cost control]