A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to window in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
window (0) - 92 freq
widow (1) - 13 freq
windows (1) - 26 freq
windoo (1) - 1 freq
windsor (2) - 8 freq
windi (2) - 2 freq
'widow (2) - 1 freq
windie (2) - 4 freq
mindo (2) - 1 freq
willow (2) - 15 freq
wino (2) - 2 freq
kindo (2) - 32 freq
widows (2) - 4 freq
winder (2) - 112 freq
kindon (2) - 1 freq
weedow (2) - 2 freq
windea (2) - 2 freq
wind' (2) - 1 freq
windes (2) - 1 freq
endow (2) - 2 freq
winds (2) - 59 freq
wisdom (2) - 43 freq
windin (2) - 28 freq
windee (2) - 3 freq
wido (2) - 2 freq
window (0) - 92 freq
windows (2) - 26 freq
windoo (2) - 1 freq
widow (2) - 13 freq
windoes (3) - 1 freq
winda (3) - 47 freq
windup (3) - 1 freq
winds (3) - 59 freq
winded (3) - 2 freq
windin (3) - 28 freq
windee (3) - 3 freq
windze (3) - 1 freq
windy (3) - 35 freq
wind (3) - 469 freq
windis (3) - 2 freq
windae (3) - 537 freq
windit (3) - 5 freq
endow (3) - 2 freq
windle (3) - 1 freq
windas (3) - 2 freq
winder (3) - 112 freq
windes (3) - 1 freq
windi (3) - 2 freq
weedow (3) - 2 freq
windie (3) - 4 freq
SoundEx code - W530
went - 1912 freq
windae - 537 freq
want - 1616 freq
wind - 469 freq
window - 92 freq
wund - 104 freq
wynd - 14 freq
wint - 628 freq
win't - 4 freq
wound - 29 freq
wean-the - 1 freq
won't - 55 freq
wantae - 14 freq
winnd - 3 freq
wanty - 4 freq
whined - 3 freq
waant - 93 freq
winda - 47 freq
'want - 2 freq
waned - 6 freq
wand - 14 freq
whinnied - 3 freq
windy - 35 freq
wunnet - 1 freq
wont - 31 freq
whinniet - 1 freq
wanit - 1 freq
windie - 4 freq
wendy - 14 freq
wunt - 4 freq
weynd - 1 freq
whun-hud - 1 freq
'went - 1 freq
wanwit - 1 freq
wun't - 11 freq
wunnd - 1 freq
'wanty - 1 freq
whant - 15 freq
wined - 3 freq
windee - 3 freq
wunda - 8 freq
wundae - 14 freq
wundie - 1 freq
'want' - 2 freq
'wahnt' - 1 freq
'windy' - 1 freq
wend - 1 freq
wan-eyed - 1 freq
waand - 2 freq
wiind - 1 freq
wawn't - 1 freq
wonned - 1 freq
woont - 1 freq
woond - 5 freq
windea - 2 freq
wind' - 1 freq
windoo - 1 freq
weanhood - 2 freq
€œwant - 1 freq
wamth - 1 freq
whinneyied - 1 freq
€˜want - 1 freq
whinned - 1 freq
weened - 1 freq
wannt - 1 freq
windi - 2 freq
wmd - 1 freq
windae- - 1 freq
waand' - 1 freq
€œwint - 4 freq
wanda - 1 freq
wahnt - 1 freq
€™want - 1 freq
wonÂ’t - 5 freq
weehendo - 1 freq
weemowdie - 32 freq
wmt - 1 freq
wind” - 1 freq
weant - 1 freq
MetaPhone code - WNT
went - 1912 freq
windae - 537 freq
want - 1616 freq
wind - 469 freq
window - 92 freq
wund - 104 freq
wint - 628 freq
win't - 4 freq
wound - 29 freq
won't - 55 freq
wantae - 14 freq
winnd - 3 freq
wanty - 4 freq
whined - 3 freq
waant - 93 freq
winda - 47 freq
'want - 2 freq
waned - 6 freq
wand - 14 freq
whinnied - 3 freq
windy - 35 freq
wunnet - 1 freq
wont - 31 freq
whinniet - 1 freq
wanit - 1 freq
windie - 4 freq
wendy - 14 freq
wunt - 4 freq
weynd - 1 freq
'went - 1 freq
wun't - 11 freq
wunnd - 1 freq
'wanty - 1 freq
whant - 15 freq
wined - 3 freq
windee - 3 freq
wunda - 8 freq
wundae - 14 freq
wundie - 1 freq
'want' - 2 freq
'wahnt' - 1 freq
'windy' - 1 freq
wend - 1 freq
waand - 2 freq
wiind - 1 freq
wawn't - 1 freq
wonned - 1 freq
woont - 1 freq
woond - 5 freq
windea - 2 freq
wind' - 1 freq
windoo - 1 freq
€œwant - 1 freq
€˜want - 1 freq
whinned - 1 freq
weened - 1 freq
wannt - 1 freq
windi - 2 freq
windae- - 1 freq
waand' - 1 freq
€œwint - 4 freq
wanda - 1 freq
wahnt - 1 freq
€™want - 1 freq
wonÂ’t - 5 freq
wind” - 1 freq
weant - 1 freq
WINDOW
Time to execute Levenshtein function - 0.206798 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.330420 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027119 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036841 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000847 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.