A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to watk in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
watk (0) - 2 freq
waak (1) - 54 freq
wakk (1) - 3 freq
wak (1) - 53 freq
wat' (1) - 1 freq
wat (1) - 52 freq
wauk (1) - 40 freq
waik (1) - 28 freq
wark (1) - 903 freq
watt (1) - 27 freq
datk (1) - 1 freq
waek (1) - 7 freq
wank (1) - 8 freq
watl (1) - 1 freq
waty (1) - 1 freq
walk (1) - 483 freq
watp (1) - 1 freq
bat (2) - 50 freq
waalk (2) - 5 freq
waer (2) - 2 freq
wauks (2) - 5 freq
sati (2) - 1 freq
waark (2) - 110 freq
warth (2) - 64 freq
dak (2) - 1 freq
watk (0) - 2 freq
watl (2) - 1 freq
waek (2) - 7 freq
datk (2) - 1 freq
waty (2) - 1 freq
walk (2) - 483 freq
uwtk (2) - 1 freq
watp (2) - 1 freq
watt (2) - 27 freq
wank (2) - 8 freq
wark (2) - 903 freq
wakk (2) - 3 freq
waak (2) - 54 freq
wak (2) - 53 freq
wat' (2) - 1 freq
waik (2) - 28 freq
wauk (2) - 40 freq
wat (2) - 52 freq
awk (3) - 1 freq
wits (3) - 36 freq
wutt (3) - 1 freq
water (3) - 258 freq
weat (3) - 1 freq
waut (3) - 1 freq
wi'k (3) - 1 freq
SoundEx code - W320
widds - 35 freq
whit's - 517 freq
withies - 1 freq
watch - 675 freq
'whit's - 76 freq
wits - 36 freq
-watch - 1 freq
wids - 75 freq
what's - 116 freq
widows - 4 freq
weeds - 56 freq
widdies - 14 freq
waatch - 67 freq
wit's - 11 freq
'wit's - 10 freq
'what's - 5 freq
'watch - 12 freq
wuts - 7 freq
woods - 35 freq
wits' - 3 freq
witch - 96 freq
'whits - 2 freq
whits - 47 freq
wha-haits - 1 freq
white's - 4 freq
wuids - 21 freq
wedge - 8 freq
witchy - 4 freq
wattie's - 2 freq
whites - 10 freq
whitehaugh - 1 freq
watk - 2 freq
wuds - 5 freq
whit''s - 2 freq
wit''s - 1 freq
wuty's - 1 freq
whitch - 1 freq
waits - 27 freq
widow's - 1 freq
whitehouse - 2 freq
whut's - 37 freq
wa-heids - 1 freq
whutch - 7 freq
wadja - 1 freq
whitewash - 2 freq
wads - 2 freq
wut's - 1 freq
widdas - 1 freq
whuts - 11 freq
wodehouse - 1 freq
white-ies - 1 freq
weedas - 1 freq
'whut's - 1 freq
wüt's - 1 freq
weedows - 1 freq
wytes - 3 freq
wid's - 2 freq
whets - 2 freq
weedoo's - 1 freq
watchie - 1 freq
whut''s - 2 freq
'whit''s - 1 freq
wutts - 1 freq
widd's - 1 freq
watt's - 4 freq
weedgie - 1 freq
wie'it's - 1 freq
widge - 1 freq
weethick - 1 freq
witchie - 3 freq
widda's - 3 freq
wadge - 5 freq
wades - 1 freq
whyte's - 2 freq
whitz - 3 freq
woodwick - 1 freq
wade's - 1 freq
waa-heids - 1 freq
weidaes - 2 freq
whyt-waash - 2 freq
€˜whyt-waash - 1 freq
waatchie - 1 freq
€œwhits - 1 freq
wudds - 12 freq
€œwatch - 1 freq
wide-os - 1 freq
€˜watch - 1 freq
whitshe - 1 freq
€˜witchie - 1 freq
€œwhit's - 1 freq
€˜wits - 1 freq
€˜witch - 1 freq
waaata's - 1 freq
waatÂ’ch - 1 freq
wots - 3 freq
whats - 18 freq
whitÂ’s - 12 freq
wdjj - 1 freq
wtoegq - 1 freq
whatÂ’s - 4 freq
“whit’s - 2 freq
wwdtk - 1 freq
wddj - 1 freq
widsaaay - 1 freq
weets - 1 freq
wtg - 1 freq
woodÂ’s - 2 freq
MetaPhone code - WTK
watk - 2 freq
WATK
Time to execute Levenshtein function - 0.237934 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.360682 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030831 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.048958 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001106 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.