A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to waitin� in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
waitin' (3) - 4 freq
waitin (3) - 411 freq
waiting (3) - 49 freq
waitinÂ’ (3) - 3 freq
wanting (4) - 19 freq
writing (4) - 95 freq
maitin (4) - 1 freq
writing' (4) - 1 freq
whitins (4) - 1 freq
baitin (4) - 1 freq
wailin' (4) - 2 freq
whitin (4) - 25 freq
wailin (4) - 11 freq
waftin (4) - 8 freq
wantin (4) - 326 freq
whiting (4) - 1 freq
waitan (4) - 27 freq
waiten (4) - 1 freq
wastin (4) - 32 freq
laitin (4) - 19 freq
writin's (4) - 8 freq
wartin (4) - 1 freq
waytin (4) - 1 freq
wattin (4) - 2 freq
writin (4) - 375 freq
waitin' (6) - 4 freq
waiting (6) - 49 freq
waitinÂ’ (6) - 3 freq
waitin (6) - 411 freq
waytin (7) - 1 freq
awaitin (7) - 6 freq
waiten (7) - 1 freq
awaiting (7) - 2 freq
waitn (7) - 1 freq
witin (7) - 1 freq
waitan (7) - 27 freq
saitin (8) - 3 freq
haitin (8) - 7 freq
wantin' (8) - 7 freq
wasting (8) - 6 freq
writins (8) - 9 freq
wairin (8) - 1 freq
wailing (8) - 1 freq
wytin (8) - 66 freq
waetan (8) - 3 freq
witness (8) - 89 freq
weetin (8) - 2 freq
wtin (8) - 1 freq
weitnes (8) - 2 freq
writin' (8) - 2 freq
SoundEx code - W350
waitin - 411 freq
widna - 443 freq
widden - 134 freq
waddin - 96 freq
wadnae - 60 freq
widnae - 435 freq
whit'm - 3 freq
whitin - 25 freq
wadna - 228 freq
woudna - 1 freq
within - 189 freq
whitna - 42 freq
whudney - 1 freq
wytin - 66 freq
widen - 13 freq
'whitna - 1 freq
wooden - 37 freq
waitin' - 4 freq
'wait'n - 1 freq
wudnae - 101 freq
weedin - 7 freq
waitan - 27 freq
whatna - 21 freq
weddin - 40 freq
wuiden - 24 freq
wheaten - 4 freq
wudna - 9 freq
wadin - 2 freq
wadin' - 1 freq
withen - 5 freq
weyten - 4 freq
wudden - 14 freq
weytin - 3 freq
wittin - 30 freq
weethin - 28 freq
wettin - 2 freq
waitn - 1 freq
wide-an - 1 freq
wattenn - 1 freq
wuidnae - 33 freq
widno - 1 freq
whittin - 8 freq
watna - 1 freq
wid'nae - 1 freq
wydin - 2 freq
'wytin - 1 freq
whitten - 11 freq
wid'n - 2 freq
whiteen - 7 freq
'widny' - 1 freq
weddeen - 2 freq
wi-outen - 1 freq
wuidna - 25 freq
wiouten - 5 freq
wadno - 3 freq
'widden - 1 freq
whatno - 2 freq
widn - 4 freq
whatn - 1 freq
wetten - 1 freq
waddeen - 2 freq
wettan - 4 freq
waetan - 3 freq
whitema - 2 freq
wadeen - 1 freq
widdin - 4 freq
whit'n - 1 freq
wattin - 2 freq
within-a - 1 freq
whitan - 9 freq
waittan - 1 freq
wuidin - 24 freq
witten - 1 freq
woudnae - 9 freq
wuiddin - 1 freq
whiten - 1 freq
'waddin - 1 freq
€˜widnae - 2 freq
whiddin - 1 freq
whatten - 1 freq
weetin - 2 freq
weddin' - 2 freq
widni - 3 freq
wthin - 1 freq
€œwhitna - 1 freq
€œwhitten - 1 freq
witin - 1 freq
widney - 5 freq
widny - 4 freq
wiooten - 2 freq
wuidn - 1 freq
€œwadna - 1 freq
whitane - 1 freq
whitna' - 1 freq
wdma - 1 freq
waadin - 1 freq
weddinÂ’ - 1 freq
wiidawn - 1 freq
wudni - 4 freq
wtin - 1 freq
waytin - 1 freq
waiten - 1 freq
waitinÂ’ - 3 freq
whitemaa - 1 freq
withma - 1 freq
MetaPhone code - WTN
waitin - 411 freq
widna - 443 freq
widden - 134 freq
waddin - 96 freq
wadnae - 60 freq
widnae - 435 freq
whitin - 25 freq
wadna - 228 freq
woudna - 1 freq
whitna - 42 freq
whudney - 1 freq
widen - 13 freq
'whitna - 1 freq
wooden - 37 freq
waitin' - 4 freq
'wait'n - 1 freq
wudnae - 101 freq
weedin - 7 freq
waitan - 27 freq
whatna - 21 freq
weddin - 40 freq
wuiden - 24 freq
wheaten - 4 freq
wudna - 9 freq
wadin - 2 freq
wadin' - 1 freq
weyten - 4 freq
wudden - 14 freq
weytin - 3 freq
wittin - 30 freq
wettin - 2 freq
waitn - 1 freq
wide-an - 1 freq
wattenn - 1 freq
wuidnae - 33 freq
widno - 1 freq
whittin - 8 freq
watna - 1 freq
wid'nae - 1 freq
whitten - 11 freq
wid'n - 2 freq
whiteen - 7 freq
'widny' - 1 freq
weddeen - 2 freq
wi-outen - 1 freq
wuidna - 25 freq
wiouten - 5 freq
wadno - 3 freq
'widden - 1 freq
whatno - 2 freq
widn - 4 freq
whatn - 1 freq
wetten - 1 freq
waddeen - 2 freq
wettan - 4 freq
waetan - 3 freq
wadeen - 1 freq
widdin - 4 freq
whit'n - 1 freq
wattin - 2 freq
whitan - 9 freq
waittan - 1 freq
wuidin - 24 freq
witten - 1 freq
woudnae - 9 freq
wuiddin - 1 freq
whiten - 1 freq
'waddin - 1 freq
€˜widnae - 2 freq
whiddin - 1 freq
whatten - 1 freq
weetin - 2 freq
weddin' - 2 freq
widni - 3 freq
€œwhitna - 1 freq
€œwhitten - 1 freq
witin - 1 freq
widney - 5 freq
widny - 4 freq
wiooten - 2 freq
wuidn - 1 freq
€œwadna - 1 freq
whitane - 1 freq
whitna' - 1 freq
waadin - 1 freq
weddinÂ’ - 1 freq
wiidawn - 1 freq
wudni - 4 freq
waytin - 1 freq
waiten - 1 freq
waitinÂ’ - 3 freq
WAITIN�
Time to execute Levenshtein function - 0.217936 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.382534 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027839 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039134 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000867 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.