A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to waldo in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
waldo (0) - 1 freq
wald (1) - 2 freq
aldo (1) - 266 freq
waldy (1) - 1 freq
bardo (2) - 1 freq
sado (2) - 1 freq
wall- (2) - 1 freq
wal (2) - 24 freq
wild (2) - 236 freq
wido (2) - 2 freq
ballo (2) - 1 freq
walee (2) - 1 freq
aldi (2) - 9 freq
warlds (2) - 40 freq
walkn (2) - 1 freq
wilno (2) - 2 freq
swald (2) - 2 freq
wuld (2) - 26 freq
laldy (2) - 42 freq
walop (2) - 1 freq
pallo (2) - 1 freq
nald (2) - 1 freq
wld (2) - 1 freq
watld (2) - 1 freq
swallo (2) - 1 freq
waldo (0) - 1 freq
waldy (1) - 1 freq
wald (1) - 2 freq
wulda (2) - 1 freq
wuld (2) - 26 freq
wld (2) - 1 freq
wouldo (2) - 1 freq
waled (2) - 29 freq
wyld (2) - 6 freq
awald (2) - 3 freq
weld (2) - 1 freq
wilde (2) - 3 freq
aldo (2) - 266 freq
wold (2) - 1 freq
wild (2) - 236 freq
wyled (3) - 2 freq
ald (3) - 14 freq
waid (3) - 1 freq
woulda (3) - 4 freq
walin (3) - 39 freq
wauled (3) - 1 freq
waldit (3) - 1 freq
wall (3) - 76 freq
waly (3) - 27 freq
baldy (3) - 28 freq
SoundEx code - W430
wild - 236 freq
would - 690 freq
waled - 29 freq
woulda - 4 freq
wauled - 1 freq
wilt - 3 freq
walthie - 6 freq
walth - 35 freq
wheeled - 14 freq
weel-ah-wat - 1 freq
wealth - 25 freq
wealthy - 11 freq
wailed - 4 freq
wellwood - 13 freq
walthy - 10 freq
wuld - 26 freq
wealthie - 1 freq
'wuld - 1 freq
wulda - 1 freq
walit - 11 freq
wheelt - 5 freq
wheel't - 1 freq
wallowed - 1 freq
walt - 10 freq
wallet - 23 freq
willed - 4 freq
'wealth - 2 freq
wielt - 3 freq
waaled - 2 freq
walled - 2 freq
wull't - 2 freq
wilde - 3 freq
would' - 2 freq
waylaid - 2 freq
wyld - 6 freq
waallet - 1 freq
wiled - 1 freq
weld - 1 freq
wullt - 1 freq
wallit - 1 freq
waeled - 5 freq
wailit - 1 freq
wealt - 1 freq
willit - 1 freq
wyled - 2 freq
wold - 1 freq
welt - 1 freq
'wild - 1 freq
waelit - 1 freq
weylaid - 1 freq
wald - 2 freq
waldy - 1 freq
weel-looed - 1 freq
weel-lo'ed - 1 freq
waldo - 1 freq
wield - 2 freq
wouldo - 1 freq
wheelied - 1 freq
€œwould - 1 freq
wuild - 4 freq
weild - 1 freq
wailth - 1 freq
wld - 1 freq
wheeld - 1 freq
MetaPhone code - WLT
wild - 236 freq
would - 690 freq
waled - 29 freq
woulda - 4 freq
wauled - 1 freq
wilt - 3 freq
wheeled - 14 freq
wailed - 4 freq
wuld - 26 freq
'wuld - 1 freq
wulda - 1 freq
walit - 11 freq
wheelt - 5 freq
wheel't - 1 freq
walt - 10 freq
wallet - 23 freq
willed - 4 freq
wielt - 3 freq
waaled - 2 freq
walled - 2 freq
wull't - 2 freq
wilde - 3 freq
would' - 2 freq
waylaid - 2 freq
waallet - 1 freq
wiled - 1 freq
weld - 1 freq
wullt - 1 freq
wallit - 1 freq
waeled - 5 freq
wailit - 1 freq
wealt - 1 freq
willit - 1 freq
wold - 1 freq
welt - 1 freq
'wild - 1 freq
waelit - 1 freq
weylaid - 1 freq
wald - 2 freq
waldy - 1 freq
waldo - 1 freq
wield - 2 freq
wouldo - 1 freq
wheelied - 1 freq
€œwould - 1 freq
wuild - 4 freq
weild - 1 freq
wheeld - 1 freq
WALDO
Time to execute Levenshtein function - 0.198526 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.372835 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028936 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039274 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000897 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.