A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to train in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
train (0) - 146 freq
traan (1) - 1 freq
tradin (1) - 9 freq
thrain (1) - 1 freq
train' (1) - 1 freq
trains (1) - 27 freq
rain (1) - 672 freq
traint (1) - 1 freq
thain (1) - 6 freq
brain (1) - 151 freq
tain (1) - 42 freq
traid (1) - 3 freq
strain (1) - 22 freq
trail (1) - 57 freq
twain (1) - 1 freq
tracin (1) - 8 freq
tryin (1) - 688 freq
trein (1) - 1 freq
drain (1) - 30 freq
traik (1) - 10 freq
trait (1) - 11 freq
trais (1) - 1 freq
orain (1) - 1 freq
trvin (1) - 1 freq
grain (1) - 77 freq
train (0) - 146 freq
tryin (1) - 688 freq
trein (1) - 1 freq
traan (1) - 1 freq
trainie (2) - 3 freq
tyran (2) - 1 freq
trainee (2) - 3 freq
trvin (2) - 1 freq
tron (2) - 11 freq
tryan (2) - 32 freq
tarin (2) - 1 freq
tryen (2) - 21 freq
triyin (2) - 2 freq
trooin (2) - 1 freq
trine (2) - 2 freq
tirin (2) - 4 freq
toarn (2) - 1 freq
orain (2) - 1 freq
tearin (2) - 10 freq
treen (2) - 4 freq
troon (2) - 5 freq
grain (2) - 77 freq
strain (2) - 22 freq
trais (2) - 1 freq
trains (2) - 27 freq
SoundEx code - T650
tryin - 688 freq
term - 98 freq
turn - 793 freq
thrawin - 17 freq
thorn - 9 freq
tirrin - 3 freq
trym - 1 freq
tearin - 10 freq
train - 146 freq
throwin - 31 freq
tryin- - 3 freq
thrown - 43 freq
tureen - 4 freq
throne - 45 freq
tirryin - 1 freq
torn - 50 freq
trenn - 1 freq
thrawn - 108 freq
tearoom - 3 freq
terrain - 10 freq
tyranny - 9 freq
thraan - 12 freq
taurn - 1 freq
tirn - 35 freq
trim - 15 freq
trauma - 8 freq
trooin - 1 freq
tram - 16 freq
teerin - 5 freq
tirin - 4 freq
tryin'ae - 1 freq
teirin - 6 freq
tourin - 5 freq
tierin - 2 freq
thrym - 10 freq
treen - 4 freq
troon - 5 freq
thrum - 12 freq
trewin - 1 freq
tryen - 21 freq
throwen - 1 freq
tryin' - 8 freq
thoarn - 11 freq
tramway - 1 freq
tron - 11 freq
thereon - 1 freq
throen - 1 freq
towerin - 12 freq
thran - 22 freq
turn' - 1 freq
trauma' - 1 freq
trehin - 7 freq
thorny - 2 freq
tryan - 32 freq
thern-wi - 1 freq
turne - 1 freq
thorn' - 1 freq
'turn - 1 freq
thraain - 1 freq
thron - 1 freq
tooerin - 1 freq
tirn-wye - 1 freq
trönnie - 1 freq
tronn - 1 freq
throwan - 4 freq
tyrone - 6 freq
tea-room - 4 freq
trein - 1 freq
thern - 1 freq
toarn - 1 freq
theorem - 1 freq
trainee - 3 freq
thairm - 3 freq
tournie - 1 freq
train' - 1 freq
triyin - 2 freq
tryan- - 1 freq
trannie - 5 freq
thrummie - 1 freq
thairin - 1 freq
tairm - 13 freq
tyran - 1 freq
traan - 1 freq
turnie - 2 freq
throw-han - 2 freq
tarin - 1 freq
thrain - 1 freq
tryne - 5 freq
thare-in - 1 freq
tharein - 1 freq
touerin - 1 freq
€œtrum - 1 freq
trine - 2 freq
thairn - 1 freq
there-in - 1 freq
tern - 1 freq
€˜train - 1 freq
tyrian - 1 freq
threne - 1 freq
three-han - 1 freq
therein - 3 freq
trainie - 3 freq
tarim - 1 freq
€œtryin - 1 freq
tranny - 2 freq
tirannie - 1 freq
trone - 6 freq
traiän - 2 freq
terryann - 3 freq
tryÂ’n - 1 freq
tierney - 1 freq
tarrin - 1 freq
'thrawn - 2 freq
‘thrawn - 1 freq
'thrawn' - 1 freq
'thran' - 1 freq
MetaPhone code - TRN
turn - 793 freq
droon - 41 freq
drawn - 88 freq
tirrin - 3 freq
daurin - 10 freq
tearin - 10 freq
train - 146 freq
tureen - 4 freq
torn - 50 freq
droun - 7 freq
dern - 21 freq
trenn - 1 freq
drain - 30 freq
draan - 23 freq
daurna - 24 freq
durin - 181 freq
terrain - 10 freq
tyranny - 9 freq
taurn - 1 freq
darn - 3 freq
tirn - 35 freq
darnae - 5 freq
trooin - 1 freq
teerin - 5 freq
dreean - 1 freq
tirin - 4 freq
drewn - 2 freq
teirin - 6 freq
tourin - 5 freq
darin - 5 freq
daurnae - 4 freq
drone - 17 freq
tierin - 2 freq
treen - 4 freq
troon - 5 freq
darenae - 4 freq
tron - 11 freq
turn' - 1 freq
darien - 8 freq
draa'in - 1 freq
turne - 1 freq
draain - 15 freq
doreen - 3 freq
'turn - 1 freq
darren - 8 freq
tooerin - 1 freq
trönnie - 1 freq
tronn - 1 freq
dorian - 2 freq
tyrone - 6 freq
trein - 1 freq
toarn - 1 freq
draaeen - 4 freq
duran - 3 freq
darena - 3 freq
draawn - 3 freq
draa-an - 1 freq
durin' - 2 freq
trainee - 3 freq
tournie - 1 freq
drane - 1 freq
train' - 1 freq
trannie - 5 freq
drywyn - 1 freq
draen - 1 freq
hyterin - 3 freq
dauran - 1 freq
tyran - 1 freq
traan - 1 freq
d'aran - 1 freq
turnie - 2 freq
daarna - 1 freq
tarin - 1 freq
tryne - 5 freq
touerin - 1 freq
dreein - 1 freq
drown - 2 freq
trine - 2 freq
'drain - 1 freq
dorine - 2 freq
tern - 1 freq
€˜train - 1 freq
tyrian - 1 freq
drien - 1 freq
trainie - 3 freq
daarin - 1 freq
tranny - 2 freq
tirannie - 1 freq
trone - 6 freq
traiän - 2 freq
tryÂ’n - 1 freq
tierney - 1 freq
dooron - 5 freq
draÂ’n - 1 freq
darrn - 1 freq
tarrin - 1 freq
durinÂ’ - 1 freq
TRAIN
train - 146 freq
trainin - 59 freq
trainers - 46 freq
trains - 27 freq
trained - 27 freq
trainin' - 13 freq
training - 17 freq
trainer - 10 freq
trainies - 4 freq
trainspotting - 8 freq
trainie - 3 freq
trainspottin - 3 freq
Time to execute Levenshtein function - 0.401856 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.704562 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.068039 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040391 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001147 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.