The smartest design decision in Morse code is that the more frequent the letter, the shorter its code. The story behind it is famous: to estimate how often each letter was used, Samuel Morse's partner Alfred Vail visited the local newspaper office in Morristown, New Jersey, and counted the pieces of movable type in the printers' cases — print shops stocked more type for common letters and less for rare ones, a ready-made frequency table. The Library of Congress's historical essay on the telegraph records this episode of corpus linguistics by type case.
The result is the alphabet we use today: E, the most common letter, is a single dot (.); T, the runner-up, a single dash (-); while rare Q (--.-) and J (.---) stretch to four elements.
Modern corpora confirm Vail's type-case survey was remarkably accurate. Peter Norvig, Google's director of research, computed English letter frequencies over trillions of characters from the Google Books corpus. Set those against each letter's Morse duration (dot = 1 unit, dash = 3, gap between elements = 1):
| Letter | Corpus frequency | Code | Units |
|---|---|---|---|
| E | 12.49% | . | 1 |
| T | 9.28% | - | 3 |
| A | 8.04% | .- | 5 |
| O | 7.64% | --- | 11 |
| Q | 0.12% | --.- | 13 |
| J | 0.16% | .--- | 13 |
Frequency and length are roughly inversely related. The mapping is not perfect — O is longer than the more frequent N deserves, for instance — but for 1838 tools and knowledge, it is an impressively good approximation of an optimal code.
"Short codes for common symbols" is exactly the core idea information theory would formalize a century later. Claude Shannon's 1951 paper "Prediction and Entropy of Printed English" quantified the statistical structure and redundancy of English, proving how compressible text really is; Huffman's 1952 algorithm then constructed provably optimal variable-length codes — on the very principle Morse and Vail had used: rank by frequency, assign the short codes first. In a real sense, the telegraph wires of the 1840s ran a compression scheme that mathematics only caught up with a hundred years later.
This design is good news if you are learning: master the short, high-frequency letters E, T, A, O, I, N first, and you can already hear nearly half the characters in typical English text. Open our Morse code translator, type an English sentence and press play — listen for those fleeting short tones. Their rhythm is the echo of a type case counted 180 years ago.
Try the Morse Code Translator Now