A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to wadja in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
wadja (0) - 1 freq
wad'a (1) - 4 freq
wada (1) - 1 freq
nadja (1) - 1 freq
wadna (1) - 228 freq
wanna (2) - 17 freq
padua (2) - 1 freq
adjf (2) - 1 freq
wdjj (2) - 1 freq
wadno (2) - 3 freq
widaa (2) - 1 freq
waoa (2) - 1 freq
dadda (2) - 1 freq
wades (2) - 1 freq
waded (2) - 6 freq
wade (2) - 9 freq
wadit (2) - 1 freq
pada (2) - 1 freq
wadge (2) - 5 freq
wads (2) - 2 freq
wdma (2) - 1 freq
wudna (2) - 9 freq
baja (2) - 1 freq
ada (2) - 4 freq
adj (2) - 1 freq
wadja (0) - 1 freq
wadna (2) - 228 freq
nadja (2) - 1 freq
wada (2) - 1 freq
wad'a (2) - 4 freq
wud'a (3) - 1 freq
wadin (3) - 2 freq
wads (3) - 2 freq
wddj (3) - 1 freq
wdma (3) - 1 freq
wudna (3) - 9 freq
widda (3) - 36 freq
wudda (3) - 2 freq
wida (3) - 5 freq
wad (3) - 2263 freq
wadge (3) - 5 freq
widna (3) - 443 freq
wadnae (3) - 60 freq
adj (3) - 1 freq
wadno (3) - 3 freq
wades (3) - 1 freq
wdjj (3) - 1 freq
waded (3) - 6 freq
widaa (3) - 1 freq
wadit (3) - 1 freq
SoundEx code - W320
widds - 35 freq
whit's - 517 freq
withies - 1 freq
watch - 675 freq
'whit's - 76 freq
wits - 36 freq
-watch - 1 freq
wids - 75 freq
what's - 116 freq
widows - 4 freq
weeds - 56 freq
widdies - 14 freq
waatch - 67 freq
wit's - 11 freq
'wit's - 10 freq
'what's - 5 freq
'watch - 12 freq
wuts - 7 freq
woods - 35 freq
wits' - 3 freq
witch - 96 freq
'whits - 2 freq
whits - 47 freq
wha-haits - 1 freq
white's - 4 freq
wuids - 21 freq
wedge - 8 freq
witchy - 4 freq
wattie's - 2 freq
whites - 10 freq
whitehaugh - 1 freq
watk - 2 freq
wuds - 5 freq
whit''s - 2 freq
wit''s - 1 freq
wuty's - 1 freq
whitch - 1 freq
waits - 27 freq
widow's - 1 freq
whitehouse - 2 freq
whut's - 37 freq
wa-heids - 1 freq
whutch - 7 freq
wadja - 1 freq
whitewash - 2 freq
wads - 2 freq
wut's - 1 freq
widdas - 1 freq
whuts - 11 freq
wodehouse - 1 freq
white-ies - 1 freq
weedas - 1 freq
'whut's - 1 freq
wüt's - 1 freq
weedows - 1 freq
wytes - 3 freq
wid's - 2 freq
whets - 2 freq
weedoo's - 1 freq
watchie - 1 freq
whut''s - 2 freq
'whit''s - 1 freq
wutts - 1 freq
widd's - 1 freq
watt's - 4 freq
weedgie - 1 freq
wie'it's - 1 freq
widge - 1 freq
weethick - 1 freq
witchie - 3 freq
widda's - 3 freq
wadge - 5 freq
wades - 1 freq
whyte's - 2 freq
whitz - 3 freq
woodwick - 1 freq
wade's - 1 freq
waa-heids - 1 freq
weidaes - 2 freq
whyt-waash - 2 freq
€˜whyt-waash - 1 freq
waatchie - 1 freq
€œwhits - 1 freq
wudds - 12 freq
€œwatch - 1 freq
wide-os - 1 freq
€˜watch - 1 freq
whitshe - 1 freq
€˜witchie - 1 freq
€œwhit's - 1 freq
€˜wits - 1 freq
€˜witch - 1 freq
waaata's - 1 freq
waatÂ’ch - 1 freq
wots - 3 freq
whats - 18 freq
whitÂ’s - 12 freq
wdjj - 1 freq
wtoegq - 1 freq
whatÂ’s - 4 freq
“whit’s - 2 freq
wwdtk - 1 freq
wddj - 1 freq
widsaaay - 1 freq
weets - 1 freq
wtg - 1 freq
woodÂ’s - 2 freq
MetaPhone code - WTJ
wadja - 1 freq
WADJA
Time to execute Levenshtein function - 0.398017 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.517879 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028488 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039157 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000870 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.