A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to toukit in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
toukit (0) - 2 freq
soukit (1) - 10 freq
tousit (1) - 1 freq
boukit (1) - 5 freq
poukit (1) - 7 freq
tourit (1) - 2 freq
doukit (1) - 3 freq
roukit (1) - 1 freq
joukit (1) - 15 freq
coukit (1) - 1 freq
poakit (2) - 23 freq
tuckit (2) - 10 freq
mockit (2) - 11 freq
howkit (2) - 28 freq
cowkit (2) - 1 freq
tackit (2) - 3 freq
feukit (2) - 1 freq
yokkit (2) - 4 freq
taukin (2) - 3 freq
tickit (2) - 3 freq
waukit (2) - 4 freq
taskit (2) - 1 freq
stockit (2) - 4 freq
stokit (2) - 1 freq
tousilt (2) - 1 freq
toukit (0) - 2 freq
joukit (2) - 15 freq
taakit (2) - 6 freq
tukt (2) - 2 freq
taekit (2) - 4 freq
roukit (2) - 1 freq
coukit (2) - 1 freq
doukit (2) - 3 freq
soukit (2) - 10 freq
boukit (2) - 5 freq
tousit (2) - 1 freq
poukit (2) - 7 freq
tourit (2) - 2 freq
leukit (3) - 301 freq
trokit (3) - 4 freq
cookit (3) - 3 freq
pookit (3) - 5 freq
bookit (3) - 3 freq
soukt (3) - 3 freq
lookit (3) - 338 freq
tooskit (3) - 1 freq
doukt (3) - 3 freq
keukit (3) - 1 freq
talkit (3) - 2 freq
stookit (3) - 1 freq
SoundEx code - T230
thocht - 2645 freq
toast - 86 freq
touched - 75 freq
tsead - 3 freq
ticht - 223 freq
twist - 45 freq
taste - 197 freq
tigged - 2 freq
tucked - 34 freq
tysday - 8 freq
ticket - 170 freq
text - 168 freq
thought - 490 freq
taught - 66 freq
tight - 71 freq
taucht - 46 freq
thochtie - 55 freq
touchit - 1 freq
tuesday - 162 freq
tuxedo - 4 freq
'thought - 2 freq
test - 145 freq
toucht - 13 freq
taskit - 1 freq
toasty - 3 freq
ticked - 5 freq
tasty - 45 freq
theikit - 6 freq
tuckt - 3 freq
tackety - 16 freq
tackity - 2 freq
toosied - 1 freq
tossed - 20 freq
tuggit - 3 freq
thackit - 3 freq
thoucht - 119 freq
taxed - 4 freq
thochty - 7 freq
taised - 1 freq
teased - 5 freq
techt - 2 freq
taketh - 1 freq
tousit - 1 freq
tugged - 4 freq
teuched - 4 freq
teucht - 1 freq
teuked - 1 freq
thicket - 1 freq
taistie - 1 freq
thoct - 7 freq
toastie - 8 freq
tost - 2 freq
tasht - 1 freq
tastie - 2 freq
teachit - 2 freq
tiched - 2 freq
takked - 1 freq
twixt - 3 freq
twisty - 2 freq
ticket'' - 1 freq
'taste - 1 freq
tickit - 3 freq
thegithe - 1 freq
togged - 1 freq
taist - 2 freq
taakt - 10 freq
tist - 1 freq
tukt - 2 freq
tecth - 1 freq
twisitt - 1 freq
tacked - 2 freq
thougty - 1 freq
ticed - 2 freq
tackett - 1 freq
taicht - 3 freq
taakit - 6 freq
twiced - 13 freq
tak'ed - 1 freq
tuckit - 10 freq
twigged - 3 freq
teacheth - 1 freq
toked - 1 freq
toast' - 1 freq
tocht - 166 freq
tact - 8 freq
tized - 5 freq
taekit - 4 freq
'taught - 1 freq
twice't - 1 freq
taaked - 13 freq
thocht-' - 1 freq
thusgate - 1 freq
tashit - 2 freq
thight - 2 freq
teuchit - 3 freq
taeset - 2 freq
tacketie - 2 freq
tised - 1 freq
'tuesday - 1 freq
thoecht - 2 freq
tacket - 1 freq
tocht-du - 1 freq
t'kut - 1 freq
toght - 1 freq
twiggit - 2 freq
twicet - 1 freq
taoist - 2 freq
taest - 1 freq
tiggit - 2 freq
thougt - 1 freq
ticketie - 1 freq
toushtie - 1 freq
tacit - 1 freq
takkity - 1 freq
thought' - 2 freq
theik't - 1 freq
tackit - 3 freq
tyesday - 1 freq
tycht - 1 freq
twyst - 1 freq
tasty' - 1 freq
€˜thocht - 3 freq
thacket - 1 freq
thigged - 1 freq
theekit - 2 freq
teached - 8 freq
tosst - 1 freq
teuchat - 1 freq
thusday - 1 freq
toucheth - 1 freq
thisgied - 1 freq
taxt - 3 freq
tashed - 2 freq
takkit - 2 freq
twycet - 1 freq
toukit - 2 freq
tickity - 1 freq
€œtouched - 1 freq
tekked - 1 freq
tought - 3 freq
t-o-c-h-t - 1 freq
€˜tocht - 1 freq
this'd - 1 freq
tasked- - 1 freq
tackitie - 1 freq
testy - 1 freq
taked - 1 freq
takd - 1 freq
thowcht - 1 freq
thoctie - 1 freq
tystie - 1 freq
tayside - 2 freq
tyseday - 1 freq
tweaked - 1 freq
txt - 8 freq
thoght - 1 freq
txhudd - 1 freq
tasked - 1 freq
tozdee - 1 freq
tooskit - 1 freq
tesswhite - 1 freq
thiught - 1 freq
ticket' - 4 freq
tickety - 5 freq
ticketty - 1 freq
toocute - 1 freq
teeside - 1 freq
toost - 1 freq
MetaPhone code - TKT
dooked - 12 freq
tigged - 2 freq
doocot - 28 freq
tucked - 34 freq
ticket - 170 freq
douked - 4 freq
decade - 30 freq
dogged - 3 freq
ticked - 5 freq
doukt - 3 freq
tuckt - 3 freq
dookit - 10 freq
tackety - 16 freq
tackity - 2 freq
dick'd - 1 freq
tuggit - 3 freq
tugged - 4 freq
teuked - 1 freq
doukit - 3 freq
dukket - 1 freq
duct - 4 freq
docket - 4 freq
takked - 1 freq
decked - 8 freq
ticket'' - 1 freq
doacked - 1 freq
tickit - 3 freq
togged - 1 freq
ducked - 4 freq
taakt - 10 freq
tukt - 2 freq
duckit - 1 freq
deckit - 6 freq
tacked - 2 freq
tackett - 1 freq
taakit - 6 freq
tak'ed - 1 freq
tuckit - 10 freq
toked - 1 freq
deckt - 3 freq
tact - 8 freq
taekit - 4 freq
decode - 2 freq
dekkid - 1 freq
taaked - 13 freq
dakota - 1 freq
tacketie - 2 freq
duggid - 1 freq
dae-guid - 2 freq
tacket - 1 freq
dockit - 4 freq
t'kut - 1 freq
decait - 2 freq
tiggit - 2 freq
ticketie - 1 freq
takkity - 1 freq
tackit - 3 freq
doocoot - 1 freq
duguid - 36 freq
takkit - 2 freq
toukit - 2 freq
tickity - 1 freq
doo-cot - 1 freq
tekked - 1 freq
dukit - 1 freq
t-o-c-h-t - 1 freq
docked - 2 freq
tackitie - 1 freq
taked - 1 freq
takd - 1 freq
dugged - 1 freq
‘dogged’ - 1 freq
ticket' - 4 freq
dquyda - 1 freq
dhgate - 1 freq
tickety - 5 freq
ticketty - 1 freq
toocute - 1 freq
ytgt - 1 freq
TOUKIT
Time to execute Levenshtein function - 0.342167 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.494301 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033537 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.052672 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001095 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.