A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to taught in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
taught (0) - 73 freq
laught (1) - 1 freq
faught (1) - 2 freq
eaught (1) - 1 freq
'taught (1) - 1 freq
aught (1) - 4 freq
tought (1) - 3 freq
saught (1) - 1 freq
caught (1) - 191 freq
naught (1) - 27 freq
taucht (1) - 46 freq
laughit (2) - 1 freq
thight (2) - 2 freq
thoght (2) - 1 freq
faucht (2) - 3 freq
toght (2) - 1 freq
caucht (2) - 32 freq
tight (2) - 75 freq
aucht (2) - 33 freq
tough (2) - 43 freq
target (2) - 33 freq
oaght (2) - 1 freq
straught (2) - 1 freq
taupit (2) - 1 freq
dought (2) - 1 freq
taught (0) - 73 freq
tought (1) - 3 freq
taucht (2) - 46 freq
laught (2) - 1 freq
toght (2) - 1 freq
tight (2) - 75 freq
caught (2) - 191 freq
naught (2) - 27 freq
'taught (2) - 1 freq
saught (2) - 1 freq
eaught (2) - 1 freq
faught (2) - 2 freq
aught (2) - 4 freq
haughty (3) - 3 freq
rought (3) - 1 freq
tasht (3) - 1 freq
nought (3) - 32 freq
staight (3) - 1 freq
fought (3) - 19 freq
toucht (3) - 13 freq
teucht (3) - 1 freq
taiglt (3) - 2 freq
teugh (3) - 1 freq
sought (3) - 10 freq
bought (3) - 83 freq
SoundEx code - T230
thocht - 2668 freq
toast - 90 freq
touched - 75 freq
tsead - 3 freq
ticht - 228 freq
twist - 48 freq
taste - 200 freq
tigged - 2 freq
tucked - 36 freq
tysday - 8 freq
ticket - 172 freq
text - 170 freq
thought - 508 freq
taught - 73 freq
tight - 75 freq
taucht - 46 freq
thochtie - 56 freq
touchit - 1 freq
tuesday - 161 freq
tuxedo - 4 freq
'thought - 2 freq
test - 146 freq
toucht - 13 freq
taskit - 1 freq
toasty - 3 freq
ticked - 5 freq
tasty - 45 freq
theikit - 6 freq
tuckt - 3 freq
tackety - 16 freq
tackity - 2 freq
toosied - 1 freq
tossed - 20 freq
tuggit - 3 freq
thackit - 3 freq
thoucht - 119 freq
taxed - 4 freq
thochty - 7 freq
taised - 1 freq
teased - 6 freq
techt - 2 freq
taketh - 1 freq
tousit - 1 freq
tugged - 4 freq
teuched - 4 freq
teucht - 1 freq
teuked - 1 freq
thicket - 1 freq
taistie - 1 freq
thoct - 7 freq
toastie - 8 freq
tost - 2 freq
tasht - 1 freq
tastie - 2 freq
teachit - 2 freq
tiched - 2 freq
tweaked - 2 freq
tecth - 2 freq
takked - 1 freq
twixt - 3 freq
twisty - 2 freq
ticket'' - 1 freq
'taste - 1 freq
tickit - 3 freq
thegithe - 1 freq
togged - 1 freq
taist - 2 freq
taakt - 10 freq
tist - 1 freq
tukt - 2 freq
twisitt - 1 freq
tacked - 2 freq
thougty - 1 freq
ticed - 2 freq
tackett - 1 freq
taicht - 3 freq
taakit - 6 freq
twiced - 13 freq
tak'ed - 1 freq
tuckit - 10 freq
twigged - 3 freq
teacheth - 1 freq
toked - 1 freq
toast' - 1 freq
tocht - 166 freq
tact - 8 freq
tized - 5 freq
taekit - 4 freq
'taught - 1 freq
twice't - 1 freq
taaked - 13 freq
thocht-' - 1 freq
thusgate - 1 freq
tashit - 2 freq
thight - 2 freq
teuchit - 3 freq
taeset - 2 freq
tacketie - 2 freq
tised - 1 freq
'tuesday - 1 freq
thoecht - 2 freq
tacket - 1 freq
tocht-du - 1 freq
t'kut - 1 freq
toght - 1 freq
twiggit - 2 freq
twicet - 1 freq
taoist - 2 freq
taest - 1 freq
tiggit - 2 freq
thougt - 1 freq
ticketie - 1 freq
toushtie - 1 freq
tacit - 1 freq
takkity - 1 freq
thought' - 2 freq
theik't - 1 freq
tackit - 3 freq
tyesday - 1 freq
tycht - 1 freq
twyst - 1 freq
tasty' - 1 freq
€˜thocht - 3 freq
thacket - 1 freq
thigged - 1 freq
theekit - 2 freq
teached - 8 freq
tosst - 1 freq
teuchat - 1 freq
thusday - 1 freq
toucheth - 1 freq
thisgied - 1 freq
taxt - 3 freq
tashed - 2 freq
takkit - 2 freq
twycet - 1 freq
toukit - 2 freq
tickity - 1 freq
€œtouched - 1 freq
tekked - 1 freq
tought - 3 freq
t-o-c-h-t - 1 freq
€˜tocht - 1 freq
this'd - 1 freq
tasked- - 1 freq
tackitie - 1 freq
testy - 1 freq
taked - 1 freq
takd - 1 freq
thowcht - 1 freq
thoctie - 1 freq
tystie - 1 freq
tayside - 2 freq
tyseday - 1 freq
txt - 8 freq
thoght - 1 freq
txhudd - 1 freq
tasked - 1 freq
tozdee - 1 freq
tooskit - 1 freq
tesswhite - 1 freq
thiught - 1 freq
ticket' - 4 freq
tickety - 5 freq
ticketty - 1 freq
toocute - 1 freq
teeside - 1 freq
toost - 1 freq
MetaPhone code - TFT
daft - 444 freq
daftie - 23 freq
dauvit - 95 freq
taught - 73 freq
tight - 75 freq
devoid - 7 freq
deaved - 13 freq
divvied - 2 freq
dafty - 29 freq
'daft - 2 freq
david - 234 freq
doft - 1 freq
defeat - 30 freq
divide - 24 freq
divot - 13 freq
daavit - 25 freq
'daavit - 1 freq
defait - 7 freq
defaut - 20 freq
dived - 21 freq
duvet - 20 freq
daffed - 3 freq
tuft - 1 freq
defied - 5 freq
'dauvit - 2 freq
divid - 10 freq
dvd - 11 freq
dafft - 2 freq
dayvideee - 2 freq
deeved - 6 freq
taft - 1 freq
tufty - 1 freq
devout - 4 freq
daivit - 4 freq
devide - 1 freq
dighty - 2 freq
davit - 23 freq
divïd - 8 freq
taffeta - 1 freq
'taught - 1 freq
dight - 3 freq
toffed - 1 freq
doffed - 3 freq
tift - 4 freq
tuffet - 2 freq
tuffed - 1 freq
'david - 2 freq
deft - 4 freq
toght - 1 freq
deived - 1 freq
divvy-oot - 1 freq
€œdauvit - 3 freq
€œdavid - 1 freq
tuftie - 1 freq
toved - 1 freq
€˜devout - 1 freq
tought - 3 freq
deavit - 1 freq
daavid - 12 freq
daavd - 1 freq
teviot - 2 freq
€œdaft - 1 freq
tufd - 1 freq
dvd' - 1 freq
“daft - 1 freq
toft - 1 freq
'dived' - 3 freq
TAUGHT
Time to execute Levenshtein function - 0.576712 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.910211 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.091684 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.095633 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000856 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.