A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to thought in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
thought (0) - 490 freq
'thought (1) - 2 freq
though (1) - 1167 freq
thoucht (1) - 119 freq
throught (1) - 1 freq
thiught (1) - 1 freq
thoght (1) - 1 freq
thought' (1) - 2 freq
tought (1) - 3 freq
thoughts (1) - 56 freq
thougt (1) - 1 freq
thougty (2) - 1 freq
wrought (2) - 8 freq
frought (2) - 1 freq
brought (2) - 88 freq
troughs (2) - 4 freq
fought (2) - 19 freq
thouch (2) - 12 freq
ought (2) - 11 freq
hough (2) - 4 freq
thoecht (2) - 2 freq
thight (2) - 2 freq
thoights (2) - 1 freq
through' (2) - 1 freq
bought (2) - 80 freq
thought (0) - 490 freq
thiught (1) - 1 freq
thoght (1) - 1 freq
thoughts (2) - 56 freq
thougt (2) - 1 freq
thight (2) - 2 freq
thought' (2) - 2 freq
tought (2) - 3 freq
'thought (2) - 2 freq
though (2) - 1167 freq
thoucht (2) - 119 freq
throught (2) - 1 freq
thoecht (3) - 2 freq
thoights (3) - 1 freq
thooht (3) - 1 freq
thoeht (3) - 4 freq
toght (3) - 1 freq
thocht (3) - 2645 freq
thougty (3) - 1 freq
taught (3) - 66 freq
athough (3) - 2 freq
thowcht (4) - 1 freq
hough' (4) - 1 freq
thout (4) - 2 freq
height (4) - 45 freq
SoundEx code - T230
thocht - 2645 freq
toast - 86 freq
touched - 75 freq
tsead - 3 freq
ticht - 223 freq
twist - 45 freq
taste - 197 freq
tigged - 2 freq
tucked - 34 freq
tysday - 8 freq
ticket - 170 freq
text - 168 freq
thought - 490 freq
taught - 66 freq
tight - 71 freq
taucht - 46 freq
thochtie - 55 freq
touchit - 1 freq
tuesday - 162 freq
tuxedo - 4 freq
'thought - 2 freq
test - 145 freq
toucht - 13 freq
taskit - 1 freq
toasty - 3 freq
ticked - 5 freq
tasty - 45 freq
theikit - 6 freq
tuckt - 3 freq
tackety - 16 freq
tackity - 2 freq
toosied - 1 freq
tossed - 20 freq
tuggit - 3 freq
thackit - 3 freq
thoucht - 119 freq
taxed - 4 freq
thochty - 7 freq
taised - 1 freq
teased - 5 freq
techt - 2 freq
taketh - 1 freq
tousit - 1 freq
tugged - 4 freq
teuched - 4 freq
teucht - 1 freq
teuked - 1 freq
thicket - 1 freq
taistie - 1 freq
thoct - 7 freq
toastie - 8 freq
tost - 2 freq
tasht - 1 freq
tastie - 2 freq
teachit - 2 freq
tiched - 2 freq
takked - 1 freq
twixt - 3 freq
twisty - 2 freq
ticket'' - 1 freq
'taste - 1 freq
tickit - 3 freq
thegithe - 1 freq
togged - 1 freq
taist - 2 freq
taakt - 10 freq
tist - 1 freq
tukt - 2 freq
tecth - 1 freq
twisitt - 1 freq
tacked - 2 freq
thougty - 1 freq
ticed - 2 freq
tackett - 1 freq
taicht - 3 freq
taakit - 6 freq
twiced - 13 freq
tak'ed - 1 freq
tuckit - 10 freq
twigged - 3 freq
teacheth - 1 freq
toked - 1 freq
toast' - 1 freq
tocht - 166 freq
tact - 8 freq
tized - 5 freq
taekit - 4 freq
'taught - 1 freq
twice't - 1 freq
taaked - 13 freq
thocht-' - 1 freq
thusgate - 1 freq
tashit - 2 freq
thight - 2 freq
teuchit - 3 freq
taeset - 2 freq
tacketie - 2 freq
tised - 1 freq
'tuesday - 1 freq
thoecht - 2 freq
tacket - 1 freq
tocht-du - 1 freq
t'kut - 1 freq
toght - 1 freq
twiggit - 2 freq
twicet - 1 freq
taoist - 2 freq
taest - 1 freq
tiggit - 2 freq
thougt - 1 freq
ticketie - 1 freq
toushtie - 1 freq
tacit - 1 freq
takkity - 1 freq
thought' - 2 freq
theik't - 1 freq
tackit - 3 freq
tyesday - 1 freq
tycht - 1 freq
twyst - 1 freq
tasty' - 1 freq
thocht - 3 freq
thacket - 1 freq
thigged - 1 freq
theekit - 2 freq
teached - 8 freq
tosst - 1 freq
teuchat - 1 freq
thusday - 1 freq
toucheth - 1 freq
thisgied - 1 freq
taxt - 3 freq
tashed - 2 freq
takkit - 2 freq
twycet - 1 freq
toukit - 2 freq
tickity - 1 freq
touched - 1 freq
tekked - 1 freq
tought - 3 freq
t-o-c-h-t - 1 freq
tocht - 1 freq
this'd - 1 freq
tasked- - 1 freq
tackitie - 1 freq
testy - 1 freq
taked - 1 freq
takd - 1 freq
thowcht - 1 freq
thoctie - 1 freq
tystie - 1 freq
tayside - 2 freq
tyseday - 1 freq
tweaked - 1 freq
txt - 8 freq
thoght - 1 freq
txhudd - 1 freq
tasked - 1 freq
tozdee - 1 freq
tooskit - 1 freq
tesswhite - 1 freq
thiught - 1 freq
ticket' - 4 freq
tickety - 5 freq
ticketty - 1 freq
toocute - 1 freq
teeside - 1 freq
toost - 1 freq
MetaPhone code - 0T
that - 26604 freq
'that - 72 freq
the-day - 32 freq
thit - 566 freq
they'd - 430 freq
'that' - 4 freq
thud - 17 freq
thought - 490 freq
that-ah - 1 freq
thoat - 103 freq
'thought - 2 freq
that' - 19 freq
thoeht - 4 freq
-that - 1 freq
''that - 1 freq
thay'd - 6 freq
thet - 9 freq
theday - 57 freq
theit - 2 freq
thowt - 97 freq
thate - 1 freq
thuddy - 1 freq
theyd - 10 freq
thaot - 1 freq
thid - 1 freq
th'day - 4 freq
thout - 2 freq
that'ah - 1 freq
the'd - 4 freq
'they'd - 2 freq
thut - 29 freq
thoo'd - 2 freq
thed - 1 freq
they'ed - 1 freq
that- - 1 freq
thooht - 1 freq
theat - 1 freq
thai'd - 5 freq
thought' - 2 freq
thae'd - 1 freq
that - 52 freq
that - 79 freq
they'd - 1 freq
thote - 4 freq
thud - 1 freq
that - 2 freq
that - 1 freq
that - 17 freq
thot - 21 freq
they’d - 3 freq
‘that - 1 freq
that“ - 1 freq
thd - 1 freq
thait - 3 freq
thiught - 1 freq
th’day - 1 freq
THOUGHT
think - 3026 freq
hink - 410 freq
thinks - 203 freq
hinks - 19 freq
thought - 490 freq
thoughts - 56 freq
thocht - 2645 freq
thochts - 324 freq
thinkin - 574 freq
thinking - 85 freq
thinkan - 52 freq
thïnk - 24 freq
thinkin' - 22 freq
thïnkin - 7 freq
thinkers - 5 freq
thinker - 4 freq
Time to execute Levenshtein function - 0.208748 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.370330 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029885 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040405 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000878 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.