From the clay tablets of Uruk to the syllabaries of Linear B, from Champollion's bilingual to Hangul's letterforms designed by a king. The independent inventions of writing and the histories of decipherment.
Language is universal among humans; writing is not. Writing has been independently invented at most five times in the history of the species, and most peoples have lived and died without it.
The four (perhaps five) independent inventions: Sumerian cuneiform in Mesopotamia (~3200 BCE), Egyptian hieroglyphs (~3100 BCE, possibly inspired by cuneiform), Chinese oracle-bone script (~1200 BCE, by current evidence), Mesoamerican Zapotec/Maya script (~600 BCE), and contested cases like the Indus script.
This deck surveys the major scripts, the structural types (logographic, syllabic, alphabetic, abugidic, abjadic), the great decipherments โ Champollion on hieroglyphs, Ventris on Linear B, the running argument over the Indus โ and the role of writing in stratifying knowledge.
The earliest unambiguous writing comes from late-fourth-millennium Uruk, in southern Mesopotamia. The signs are pictographic at first โ a head of grain, a fish, a foot โ pressed with a reed stylus into wet clay. By ~2900 BCE the pictograms had abstracted into the wedge-shaped strokes that give cuneiform its name (cuneus, Latin for wedge).
The earliest tablets are administrative โ receipts for grain, rations for workers, livestock counts. Literature comes later. Sumerian is the first written language; cuneiform was adopted to write Akkadian, then Assyrian, Babylonian, Hittite, Hurrian, Elamite, Old Persian. It served as the cosmopolitan script of the ancient Near East for three millennia.
The decipherment was largely the work of Henry Rawlinson, whose 1830sโ40s copying of the Behistun trilingual inscription โ Old Persian, Elamite, Babylonian โ did for cuneiform what the Rosetta Stone did for hieroglyphs.
Egyptian hieroglyphic writing appears around 3100 BCE โ close enough to Sumerian cuneiform that the question of independence is open, though Egyptologists generally treat it as native development with possibly stimulus diffusion of the idea of writing. The script remained in continuous use for over three thousand years, the longest-lived writing system in history.
Egyptian was three scripts in one: hieroglyphic (the formal monumental script, with each sign carved as a discrete picture), hieratic (a cursive form for ink on papyrus), and demotic (a still more cursive script for everyday use). All three encoded the same language with different graphic conventions.
The system mixed logograms (sign = word), phonograms (sign = sound), and determinatives (silent classifiers indicating semantic category). A single text used all three classes simultaneously โ which is what made decipherment so hard, until the Rosetta Stone provided the bilingual key.
The stone was found in 1799 by French soldiers digging fortifications at Rashid (Rosetta) in the Nile Delta. Carved 196 BCE, it bears a decree of Ptolemy V in three scripts โ hieroglyphic, demotic, Greek. The Greek text was readable. Match the names โ and there were Ptolemys and Cleopatras in cartouches โ and the cipher cracks open.
Two scholars raced. The English polymath Thomas Young correctly identified phonetic values for several signs in cartouche-bound royal names (1814โ19). The French linguist Jean-Franรงois Champollion, drawing on his fluent Coptic โ the late Egyptian language preserved in the Christian liturgy โ saw further. His 1822 letter to the Acadรฉmie des Inscriptions, the Lettre ร M. Dacier, established that the script was not purely symbolic but encoded the sounds of an Afroasiatic language.
The Champollion breakthrough is the founding event of modern Egyptology. He died in 1832, aged 41, before completing his grammar. His brother published it posthumously.
The Phoenician script โ twenty-two consonant signs, no vowels โ emerged on the Levantine coast in the late second millennium BCE, derived from earlier Proto-Sinaitic experiments by Semitic-speaking miners in the Egyptian-controlled Sinai. Its signs were acrophonic: each sign began with the sound it represented (๐ค 'aleph, "ox"; ๐ค bet, "house"; ๐ค gimel, "camel").
The Phoenician achievement was the alphabetic principle. Where cuneiform and hieroglyphs required hundreds or thousands of signs, Phoenician got the job done with twenty-two. The system was efficient enough that any literate adult could remember it. Writing, for the first time, escaped the scribal class.
Every alphabetic script in use today โ Greek, Latin, Cyrillic, Hebrew, Arabic, Devanagari and the Brahmic family more generally โ descends from Phoenician through one route or another. The Latin alphabet you are reading is the great-great-grandchild of those twenty-two consonants.
The Greeks adopted the Phoenician alphabet around 800 BCE. They had no use for several Semitic consonants Greek did not have โ 'aleph, he, yod, ayin, waw โ and repurposed them as vowel signs: alpha, epsilon, iota, omicron, upsilon. The first true alphabet โ full representation of consonants and vowels โ was a Greek innovation by adaptation.
The earliest Greek inscriptions are in regional variants โ Euboean, Attic, Ionic, Corinthian โ that take some decades to converge. The Ionic version, formalised at Athens in 403 BCE, became the standard. The same script, with minor adjustments, served Etruscan, then Latin, then most of the languages of Europe.
The Greek innovation mattered for a structural reason. With consonants alone, writing is good for languages where word-roots are predictable consonant patterns (as in Semitic). For Indo-European languages, where vowel choice carries grammatical information (Greek lรฉgล, "I say"; lรฉgeis, "you say"), vowels must be written.
The Romans adopted a variant of Greek through the Etruscans around the seventh century BCE. The classical Latin alphabet of twenty-three letters (no J, U, or W; I and V served double duty) was largely fixed by the late Republic. J and U emerged in the late medieval period; W in the Old English transition.
Latin script became the writing of Western Christianity and through it the writing of European modernity. It now serves more languages than any other script โ every European language west of the Cyrillic line, most African languages with a written tradition, Vietnamese (replacing Chinese characters in the 17thโ20th centuries), Turkish (replacing Arabic in 1928), Indonesian, Malay, Tagalog, and many Indigenous American languages.
The aesthetic conventions โ capitals derived from monumental Roman inscription, lowercase from the Carolingian minuscule of the 8thโ9th centuries, italic from the Italian Renaissance chancery hand โ accumulated over two millennia. The fonts you read are stratified historical artefacts.
Chinese writing in its earliest known form is the oracle-bone script of the late Shang dynasty, about 1200 BCE โ divinatory inscriptions on heated turtle plastrons and ox scapulae. The script was already complex by then, suggesting an earlier development now lost.
The structural principle is logographic with phonetic component. Most characters are ๅฝขๅฃฐๅญ โ radical (semantic key) plus phonetic. The radical ๆจ (tree) plus the phonetic ๅ (tรณng) gives ๆก, a kind of tree pronounced tรณng. About 80% of modern characters are built this way; the remaining 20% are pictographs, ideographs, and compound ideographs of various ages.
Modern Chinese writing uses about 3,000โ4,000 characters for full literacy, with some 7,000 in regular educated use. The system has been continuously legible for over three thousand years โ a Tang-dynasty poem (~750 CE) is readable to a modern educated speaker with little more difficulty than Shakespeare to an English reader.
The Maya script, in continuous use from about 300 BCE to the Spanish conquest, is the most fully understood writing of pre-Columbian America. It is a logosyllabic system โ some signs represent words, others syllables, often combined within a single block.
The decipherment is largely the work of three figures. The Soviet linguist Yuri Knorozov (1922โ1999), reading on a Leningrad tram, recognised in the 1950s that Diego de Landa's mid-16th-century alphabet โ long dismissed as bogus โ preserved syllabic values, not alphabetic ones. Tatiana Proskouriakoff at Harvard (1909โ1985) showed in 1960 that Maya inscriptions recorded political history with names, dates, and dynastic events, not just calendrical and astronomical material. David Stuart, working from the 1980s onward, completed the syllabic-logographic synthesis.
The script is now perhaps 80% readable. The Maya wrote dynastic chronicles, mythology, and ritual โ not, despite earlier fantasies, the secret of an alien lost civilisation, but the records of recognisable human polities arguing about land and lineage.
Sir Arthur Evans excavated the Bronze Age palace at Knossos on Crete from 1900 onward, recovering thousands of clay tablets in two undeciphered scripts. He named them Linear A and Linear B. Linear A, used by the Minoans (~1800โ1450 BCE), remains undeciphered. Linear B, used in the later Mycenaean period (~1450โ1200 BCE), was cracked in 1952.
The decipherer was Michael Ventris, an English architect with no formal classical training, working from his apartment with stunning intellectual energy. The 1952 BBC Third Programme broadcast in which he announced that Linear B encoded an early form of Greek is one of the great moments in the humanities of the twentieth century. He died in a car accident in 1956, aged 34.
The breakthrough came from a grid: tabulating which signs occurred in which positions, hypothesising a syllabic script of CV (consonant-vowel) combinations, then testing which language fit. Greek did. The Linear B tablets โ palace inventories, weapons lists, rosters of slaves โ gave us five hundred years of pre-Homeric Greek civilisation we had not previously been able to read.
Hangul is unique among the world's scripts: it is the only major writing system whose origin is not lost in prehistory but documented in detail. King Sejong the Great of Joseon Korea promulgated it in 1446 with the treatise Hunminjeongeum ("The proper sounds for the instruction of the people").
The system is featural. The basic consonant shapes diagram the position of the speech organs producing them: ใฑ (g/k) shows the back of the tongue against the soft palate; ใด (n) shows the tongue tip against the alveolar ridge; ใ (m) shows the closed lips; ใ (s) shows the teeth; ใ (ng/silent) shows the open throat. Vowels are built from heaven (โข), earth (ใ ก), and human (ใ ฃ). The result is a script linguistically explicit at a level no other writing system attempts.
Hangul was suppressed by the Confucian literati, who preferred Chinese characters. It survived in vernacular use among women and the lower classes; it became the official script after 1945. Sejong's design has held up: Hangul is the most efficient writing system ever invented for the language it serves.
The Brahmi script, attested from the third century BCE in the Ashokan inscriptions across India, is the parent of the Brahmic family โ Devanagari, Bengali, Tamil, Kannada, Telugu, Sinhala, Burmese, Khmer, Thai, Lao, Tibetan, Javanese, Balinese, and dozens more. It is by descendants the most successful script-family after Latin.
Its structural type is the abugida (the term coined by Peter Daniels from Ge'ez): a writing system in which consonant-vowel sequences are written as single units, with an inherent vowel modified by diacritics. เค in Devanagari is "ka"; เคเคพ is "kฤ"; เคเคฟ is "ki"; เคเฅ is "ke". The default vowel is silent in some traditions, voiced in others.
The abugida is the third great structural type of writing โ distinct from the alphabet (consonants and vowels equal), the abjad (consonants only, vowels optional), and the syllabary (each syllable an independent sign). Roughly two billion people read scripts of the Brahmic family today.
The Arabic script, derived from the Nabataean-Aramaic line of the Phoenician tradition, became the script of an empire and a religion in the seventh century. Where Greek added vowels to the Phoenician consonant alphabet, Arabic kept them off the line: an abjad in the strict sense, with optional diacritics for vowels (used in Quran editions and beginner texts, omitted in adult prose).
The script is cursive in design โ each letter has up to four positional forms (initial, medial, final, isolated). Calligraphic traditions โ Kufic, Naskh, Thuluth, Diwani, Ta'liq โ developed across the Islamic world as the highest visual art form, in part because figural representation was theologically constrained.
The script now serves Arabic, Persian, Urdu, Pashto, Sindhi, Uyghur, Kurdish (in some regions), Sorani, and dozens of other languages โ second only to Latin in the number of languages it writes. Several Turkic languages used it until the 20th-century romanisations (Turkish 1928, Azeri repeatedly).
Cyrillic is named for the ninth-century Byzantine missionary Cyril, though strictly the script he and his brother Methodius designed for Slavic-speaking converts was Glagolitic, a now-vanished cousin. Cyrillic, developed by their disciples in the First Bulgarian Empire, took the dominant Greek capitals as its base and added letters for Slavic sounds without Greek equivalents (ะ for "zh", ะจ for "sh", ะฉ for "shch", ะช and ะฌ for hard and soft signs).
The script serves Russian, Bulgarian, Serbian (alternating with Latin), Ukrainian, Belarusian, Macedonian, Mongolian (since 1941), Kazakh and Kyrgyz (with ongoing transitions to Latin), and several languages of the Caucasus and Siberia.
Peter the Great's 1708 reform of Russian Cyrillic โ adopting Latin-style letterforms for civil typography while preserving older shapes for ecclesiastical use โ is the parent of the modern face. The Soviet 1918 reform dropped four letters; the post-1991 transitions in Central Asia have moved several languages off Cyrillic onto Latin.
The Hebrew alphabet of the Hebrew Bible was originally written in the Paleo-Hebrew script, a regional variant of Phoenician. After the Babylonian Exile (~600 BCE), Jewish communities adopted the Aramaic square script, which became standard for Hebrew by the late Second Temple period. The Dead Sea Scrolls (~250 BCEโ70 CE) are mostly in this square script.
Like Arabic, Hebrew is an abjad โ twenty-two consonant signs, vowels indicated by optional diacritical points (the niqqud system formalised by the Tiberian Masoretes in the 7thโ10th centuries CE). Modern Israeli Hebrew is normally written without points; the tradition of Masoretic vocalisation is preserved in religious texts.
The script's revival โ Eliezer Ben-Yehuda's project of remaking Hebrew as a spoken vernacular in late-Ottoman Palestine โ is one of the only successful resurrections of a language as a daily medium after centuries of liturgical-only use. By 1948 Hebrew had native speakers again.
Japanese is normally written with three scripts simultaneously. Kanji โ Chinese characters borrowed from the 5th century onward, used for native and Sino-Japanese vocabulary. Hiragana โ a cursive syllabary developed by Heian-period women writers (Murasaki Shikibu wrote Genji in it, around 1010), now used for grammatical particles and inflectional endings. Katakana โ a more angular syllabary developed by Buddhist monks for annotating Chinese texts, now used for foreign loanwords and emphasis.
The system is more efficient than its complexity suggests. The mixed-script convention disambiguates the deeply homophonous Japanese vocabulary โ many Sino-Japanese words sound identical and would be ambiguous in pure phonetic spelling. Kลen can be ๅ ฌๅ (park), ่ฌๆผ (lecture), ๅพๆด (support), or ้ซ้ (lofty); the kanji disambiguate.
Japanese-script literacy requires learning roughly 2,136 jลyล kanji (the postwar government list), the two syllabaries, and the conventions for switching between them. Few learners outside Japan get all the way there; most get further than they expect.
The Indus Valley civilisation (~2600โ1900 BCE), centred at Harappa and Mohenjo-daro in present-day Pakistan, left around four thousand inscribed objects โ most of them seals, with brief inscriptions averaging five signs. About 400 distinct signs have been catalogued. The script has not been read.
The obstacles are severe. Inscriptions are very short. There is no bilingual. The underlying language is unknown โ proposed candidates include Dravidian (the favoured hypothesis among Asko Parpola and others), Indo-Aryan, an isolate, or a non-linguistic system. Steve Farmer, Richard Sproat, and Michael Witzel argued in 2004 that the Indus signs do not have the statistical properties of true writing โ that they may be a non-linguistic symbol system.
The Farmer-Sproat-Witzel argument has been contested by Rao et al. (2009) using conditional-entropy analysis. The dispute continues. The honest answer: nobody can read it, and there is genuine disagreement about whether anyone ever will.
One of the only writing systems known to have been invented by a single individual whose name we have, working without literacy in any other language. Sequoyah (~1770โ1843), a Cherokee man who could not read English but had observed white people reading, designed a syllabary of 86 characters between 1809 and 1821.
The Cherokee Nation adopted it officially in 1825. By 1830, Cherokee literacy rates exceeded white literacy rates in the surrounding states. The first newspaper printed in an Indigenous American language, the Cherokee Phoenix, began publishing in 1828 in both English and Sequoyah's script.
The achievement is extraordinary. Sequoyah did not borrow the alphabetic principle; he invented a syllabic principle independently, proceeded by trial and error, and produced a working system in roughly a decade. The forms of the characters resemble Latin and Greek letters in some cases but the assignments are arbitrary โ Sequoyah saw the shapes in mission books and reused them as raw material, with no relation to their original phonetic values.
The Hindu-Arabic numeral system โ the digits 0โ9 and the place-value notation we use today โ is Indian by origin and Arabic by transmission. The Bakhshali manuscript (perhaps 3rdโ4th century CE), the Brahmi-derived Nagari numerals of the early medieval period, the formalisation of zero in Brahmagupta's seventh-century treatise Brahmasphuแนญasiddhฤnta.
The numerals reached Baghdad in the eighth and ninth centuries โ al-Khwarizmi's 825 CE work on calculation gave us the word "algorithm" from his name. They reached medieval Europe through the translations of Toledo and through Leonardo of Pisa's 1202 Liber Abaci, which introduced "Indian numerals" to Italian merchants. Adoption was contested for centuries โ the abacists vs the algorists fought into the sixteenth century โ before Hindu-Arabic numerals displaced Roman numerals for general arithmetic.
The decisive innovation was zero as both placeholder and number. Roman numerals have no zero and no place-value structure; division and long arithmetic are essentially impossible without them.
Block printing in China dates from the seventh or eighth century. Movable type was invented by Bi Sheng around 1040 โ clay characters for woodblock arrangement. The technique did not displace block printing in China because Chinese script is too character-rich to make small movable type efficient at scale.
Korea developed metal movable type in the 13th century โ the Jikji, printed in 1377, predates Gutenberg by nearly eighty years. The Korean innovation, however, did not produce a Korean print revolution.
Johannes Gutenberg's mid-fifteenth-century Mainz workshop combined movable metal type, oil-based inks, the press, and a paper supply, into a system that could be replicated. The 42-line Bible (~1455) is the conventional dating point. Within fifty years there were printing presses in over 250 European cities and millions of books in circulation. The democratisation of writing โ already begun with the alphabet โ was completed by the press.
Print created the first incentive for standardised spelling. Manuscript culture had tolerated wide variation; print culture punished it. The first English dictionaries โ Robert Cawdrey 1604, Samuel Johnson 1755 โ codified spelling conventions that had been emerging for two centuries.
The spelling reformers have always been a minority. Noah Webster, in his 1828 American Dictionary of the English Language, established the divergences that distinguish American from British orthography (color/colour, theater/theatre, defense/defence). George Bernard Shaw's alphabet, designed to replace the Latin alphabet for English, was funded by his estate and produced a 1962 Shavian edition of Androcles and the Lion; nobody adopted it.
The Acadรฉmie franรงaise has issued spelling reforms since 1635; the German Rechtschreibreform of 1996 was bitterly contested. Successful systematic reform tends to come from above (Atatรผrk's 1928 romanisation of Turkish) rather than from grassroots petition.
The Unicode Standard, first published in 1991, set out to assign a unique number to every character in every script ever used. The current version (15.x as of 2023) covers roughly 150,000 characters across 161 scripts, plus thousands of historical glyphs, mathematical symbols, dingbats, and the controversial emoji.
Before Unicode, writing on a computer required choosing among dozens of incompatible encodings โ ASCII for English, ISO 8859 variants for European languages, Shift-JIS for Japanese, Big5 for Traditional Chinese, GB for Simplified, and so on. Documents in mixed scripts were essentially impossible. The web in non-Latin scripts was crippled until Unicode and UTF-8 became universal in the early 2000s.
Unicode is now the foundation of multilingual computing. Its inclusivity has inflected the politics of script โ minority and endangered scripts have campaigned for inclusion (the Tifinagh of the Berber speakers, Adlam of Fulani West Africa, Osage). Inclusion in Unicode is now a milestone of script revitalisation.
Most scripts run left-to-right (LTR) โ Latin, Greek, Cyrillic, Brahmic, Hangul, modern Chinese. A minority run right-to-left (RTL) โ Hebrew, Arabic, Syriac, N'Ko. A few historical scripts ran boustrophedon โ alternating direction line by line, as the ox plows (Greek of the 7thโ6th century BCE).
Vertical writing โ top-to-bottom, columns running right-to-left โ is traditional in Chinese, Japanese, and Korean, and remains in use for newspaper headlines, signage, and book titles. Mongolian script, derived from Uyghur, runs top-to-bottom in left-to-right columns and is unique among major scripts in that orientation.
The directionality of a script is not arbitrary. The right-handed scribe writing in stone preferred to chip away from the supporting hand โ this has been argued for the original right-to-left orientation of Phoenician. The Greeks reversed direction probably under Etruscan influence, or for ease of writing on papyrus with the right hand without smudging. Once set, direction is hard to change.
Three calligraphic traditions stand out as the highest visual art forms of their cultures.
Chinese calligraphy. Five canonical script styles โ seal (็ฏ), clerical (้ธ), regular (ๆฅท), running (่ก), cursive (่). Wang Xizhi (303โ361), Yan Zhenqing (709โ785), Su Shi (1037โ1101), Zhao Mengfu (1254โ1322) โ calligraphers who are also among the most important figures in Chinese cultural history.
Islamic calligraphy. Six classical hands โ Naskh, Thuluth, Muhaqqaq, Rayhani, Tawqi', Riqa' โ formalised by Ibn Muqla (~885โ940) and Yaqut al-Musta'simi (d. 1298). The Quran was the central object; calligraphy carried theological weight as the visible form of revelation.
Japanese shodล, the Japanese calligraphic tradition, derived from Chinese practice but elaborated through the kana scripts and the aesthetic of wabi-sabi. The hiragana cursive of the Heian-period women โ Sei Shลnagon, Murasaki Shikibu โ is itself a calligraphic system.
Beyond the Indus, several scripts remain partially or wholly opaque.
Linear A (Minoan Crete, ~1800โ1450 BCE). The parent of Linear B; the underlying language is unknown and almost certainly not Greek. Without a bilingual or a candidate language, decipherment may be permanent.
Rongorongo (Easter Island). Wooden tablets with carved hieroglyphs, perhaps an indigenous invention or perhaps a response to seeing European writing. Most tablets were burned by missionaries; about 25 survive. The script is not read.
Etruscan is partly readable โ the script is alphabetic and similar enough to Greek to be transcribable โ but the Etruscan language is an isolate and the inscriptions are mostly funerary, with limited content.
Olmec and Cascajal signs โ predecessors of the Mesoamerican script tradition, now known mostly from a few inscribed stones. Whether they constitute writing in the technical sense remains contested.
โ Extra History ยท The History of Writing โ Where the Story Begins
Watch ยท Treasures of the UCLA Library โ Cuneiform Tablets
Watch ยท Why was the Rosetta Stone so important? โ TED-Ed
Walter Ong's Orality and Literacy (1982) and Jack Goody's The Domestication of the Savage Mind (1977) made the case that writing is not merely a record of speech but a cognitive technology that restructures thought. Lists, tables, footnotes, indexes, archives, calendars โ none possible in pure orality.
Eric Havelock's Preface to Plato (1963) argued that the rise of literate philosophy in classical Greece was not coincidental: the alphabet's compact efficiency enabled lay literacy, which enabled abstract argument detached from the social occasion of recitation. Plato distrusted writing (in the Phaedrus) precisely because he understood what it would do.
The historical record is consistent with these arguments. Where writing arrived, accounting, administration, codified law, and accumulated science arrived with it. The bureaucratic state, the monotheistic religion of the book, and the experimental science of Galileo are all literacies in some sense. The case is not that writing causes these things โ it does not โ but that they are not possible without it.
Three trends shape the next century of writing.
Latinisation continues. Vietnamese, Indonesian, Tagalog, Turkish are settled. Kazakh announced its transition to Latin in 2017; Uzbek's transition has been ongoing since 1993. The motivations are usually combinations of integration with global digital systems, distancing from a previous imperial power, and pedagogical efficiency.
Script revitalisation works in the other direction. The Cherokee syllabary, the Vai script of Liberia, Adlam (Fulani West Africa, designed by the Barry brothers in the 1980s), Osage (designed in 2006 for use by the Osage Nation), Tifinagh (the Berber script, given official status in Morocco in 2003). New scripts are still being invented and adopted.
Digital pressure accumulates. Emoji are not writing in the technical sense, but they have started to function as a writing-adjacent system; their global standardisation, governed by Unicode's emoji subcommittee, is the closest thing in modern writing to a deliberately designed shared symbol set. Whether they evolve further toward writing โ picking up grammar, productive composition, conventional spelling โ remains to be seen.
Writing Systems โ Volume XVI, Deck 2 of The Deck Catalog. Set in Cardo. Papyrus #efe4cc; lapis and rust accents.
Thirty-one leaves on the technology that made history possible. From clay tablet to UTF-8.
โ Vol. XVI ยท Lang. ยท Deck 2