A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to tight in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
tight (0) - 71 freq
light (1) - 285 freq
tights (1) - 15 freq
sight (1) - 134 freq
might (1) - 411 freq
ticht (1) - 223 freq
wight (1) - 1 freq
hight (1) - 2 freq
gight (1) - 3 freq
fight (1) - 94 freq
toght (1) - 1 freq
ight (1) - 1 freq
right (1) - 1385 freq
night (1) - 923 freq
thight (1) - 2 freq
eight (1) - 66 freq
dight (1) - 3 freq
aright (2) - 7 freq
tih (2) - 7 freq
high' (2) - 1 freq
night' (2) - 3 freq
dights (2) - 5 freq
dwight (2) - 1 freq
height (2) - 45 freq
nght (2) - 2 freq
tight (0) - 71 freq
toght (1) - 1 freq
right (2) - 1385 freq
ight (2) - 1 freq
night (2) - 923 freq
dight (2) - 3 freq
tought (2) - 3 freq
taught (2) - 66 freq
thight (2) - 2 freq
eight (2) - 66 freq
light (2) - 285 freq
fight (2) - 94 freq
sight (2) - 134 freq
tights (2) - 15 freq
might (2) - 411 freq
hight (2) - 2 freq
wight (2) - 1 freq
gight (2) - 3 freq
ticht (2) - 223 freq
dighty (3) - 2 freq
tocht (3) - 166 freq
staight (3) - 1 freq
tonight (3) - 59 freq
tighten (3) - 4 freq
eighty (3) - 12 freq
SoundEx code - T230
thocht - 2645 freq
toast - 86 freq
touched - 75 freq
tsead - 3 freq
ticht - 223 freq
twist - 45 freq
taste - 197 freq
tigged - 2 freq
tucked - 34 freq
tysday - 8 freq
ticket - 170 freq
text - 168 freq
thought - 490 freq
taught - 66 freq
tight - 71 freq
taucht - 46 freq
thochtie - 55 freq
touchit - 1 freq
tuesday - 162 freq
tuxedo - 4 freq
'thought - 2 freq
test - 145 freq
toucht - 13 freq
taskit - 1 freq
toasty - 3 freq
ticked - 5 freq
tasty - 45 freq
theikit - 6 freq
tuckt - 3 freq
tackety - 16 freq
tackity - 2 freq
toosied - 1 freq
tossed - 20 freq
tuggit - 3 freq
thackit - 3 freq
thoucht - 119 freq
taxed - 4 freq
thochty - 7 freq
taised - 1 freq
teased - 5 freq
techt - 2 freq
taketh - 1 freq
tousit - 1 freq
tugged - 4 freq
teuched - 4 freq
teucht - 1 freq
teuked - 1 freq
thicket - 1 freq
taistie - 1 freq
thoct - 7 freq
toastie - 8 freq
tost - 2 freq
tasht - 1 freq
tastie - 2 freq
teachit - 2 freq
tiched - 2 freq
takked - 1 freq
twixt - 3 freq
twisty - 2 freq
ticket'' - 1 freq
'taste - 1 freq
tickit - 3 freq
thegithe - 1 freq
togged - 1 freq
taist - 2 freq
taakt - 10 freq
tist - 1 freq
tukt - 2 freq
tecth - 1 freq
twisitt - 1 freq
tacked - 2 freq
thougty - 1 freq
ticed - 2 freq
tackett - 1 freq
taicht - 3 freq
taakit - 6 freq
twiced - 13 freq
tak'ed - 1 freq
tuckit - 10 freq
twigged - 3 freq
teacheth - 1 freq
toked - 1 freq
toast' - 1 freq
tocht - 166 freq
tact - 8 freq
tized - 5 freq
taekit - 4 freq
'taught - 1 freq
twice't - 1 freq
taaked - 13 freq
thocht-' - 1 freq
thusgate - 1 freq
tashit - 2 freq
thight - 2 freq
teuchit - 3 freq
taeset - 2 freq
tacketie - 2 freq
tised - 1 freq
'tuesday - 1 freq
thoecht - 2 freq
tacket - 1 freq
tocht-du - 1 freq
t'kut - 1 freq
toght - 1 freq
twiggit - 2 freq
twicet - 1 freq
taoist - 2 freq
taest - 1 freq
tiggit - 2 freq
thougt - 1 freq
ticketie - 1 freq
toushtie - 1 freq
tacit - 1 freq
takkity - 1 freq
thought' - 2 freq
theik't - 1 freq
tackit - 3 freq
tyesday - 1 freq
tycht - 1 freq
twyst - 1 freq
tasty' - 1 freq
€˜thocht - 3 freq
thacket - 1 freq
thigged - 1 freq
theekit - 2 freq
teached - 8 freq
tosst - 1 freq
teuchat - 1 freq
thusday - 1 freq
toucheth - 1 freq
thisgied - 1 freq
taxt - 3 freq
tashed - 2 freq
takkit - 2 freq
twycet - 1 freq
toukit - 2 freq
tickity - 1 freq
€œtouched - 1 freq
tekked - 1 freq
tought - 3 freq
t-o-c-h-t - 1 freq
€˜tocht - 1 freq
this'd - 1 freq
tasked- - 1 freq
tackitie - 1 freq
testy - 1 freq
taked - 1 freq
takd - 1 freq
thowcht - 1 freq
thoctie - 1 freq
tystie - 1 freq
tayside - 2 freq
tyseday - 1 freq
tweaked - 1 freq
txt - 8 freq
thoght - 1 freq
txhudd - 1 freq
tasked - 1 freq
tozdee - 1 freq
tooskit - 1 freq
tesswhite - 1 freq
thiught - 1 freq
ticket' - 4 freq
tickety - 5 freq
ticketty - 1 freq
toocute - 1 freq
teeside - 1 freq
toost - 1 freq
MetaPhone code - TFT
daft - 436 freq
daftie - 23 freq
dauvit - 95 freq
taught - 66 freq
tight - 71 freq
devoid - 7 freq
deaved - 13 freq
divvied - 2 freq
dafty - 29 freq
'daft - 2 freq
david - 230 freq
doft - 1 freq
defeat - 30 freq
divide - 24 freq
divot - 13 freq
daavit - 25 freq
'daavit - 1 freq
defait - 7 freq
defaut - 20 freq
dived - 21 freq
duvet - 20 freq
daffed - 3 freq
tuft - 1 freq
defied - 5 freq
'dauvit - 2 freq
divid - 10 freq
dvd - 11 freq
dafft - 2 freq
dayvideee - 2 freq
deeved - 6 freq
taft - 1 freq
devout - 4 freq
daivit - 4 freq
devide - 1 freq
dighty - 2 freq
davit - 23 freq
divïd - 8 freq
taffeta - 1 freq
'taught - 1 freq
dight - 3 freq
toffed - 1 freq
doffed - 3 freq
tift - 4 freq
tuffet - 2 freq
tuffed - 1 freq
'david - 2 freq
deft - 4 freq
toght - 1 freq
deived - 1 freq
divvy-oot - 1 freq
€œdauvit - 3 freq
€œdavid - 1 freq
tuftie - 1 freq
toved - 1 freq
€˜devout - 1 freq
tought - 3 freq
deavit - 1 freq
daavid - 12 freq
daavd - 1 freq
teviot - 2 freq
€œdaft - 1 freq
tufd - 1 freq
dvd' - 1 freq
“daft - 1 freq
toft - 1 freq
'dived' - 3 freq
TIGHT
Time to execute Levenshtein function - 0.193854 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.366372 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029331 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040858 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000846 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.