A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to terrain in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
terrain (0) - 10 freq
terrains (1) - 1 freq
terracin (1) - 2 freq
tarrin (2) - 1 freq
retrain (2) - 2 freq
kerryin (2) - 12 freq
errin (2) - 1 freq
serrin (2) - 5 freq
terrie (2) - 2 freq
pertain (2) - 1 freq
werain (2) - 1 freq
erran (2) - 3 freq
ferryin (2) - 2 freq
refrain (2) - 13 freq
merryin (2) - 3 freq
terrace (2) - 24 freq
herrin (2) - 24 freq
kerrjin (2) - 2 freq
teirin (2) - 6 freq
serrsin (2) - 1 freq
terryann (2) - 3 freq
terra (2) - 1 freq
thrain (2) - 1 freq
certain (2) - 150 freq
teerin (2) - 5 freq
terrain (0) - 10 freq
tirryin (2) - 1 freq
tarrin (2) - 1 freq
tirrin (2) - 3 freq
terracin (2) - 2 freq
terrains (2) - 1 freq
sterrin (3) - 1 freq
thraain (3) - 1 freq
sterran (3) - 1 freq
teerin (3) - 5 freq
thrain (3) - 1 freq
merran (3) - 4 freq
werrin (3) - 3 freq
tertan (3) - 1 freq
train (3) - 147 freq
tearin (3) - 12 freq
terra (3) - 1 freq
lorrain (3) - 1 freq
merrin (3) - 1 freq
herryin (3) - 2 freq
perrin (3) - 2 freq
merryin (3) - 3 freq
terrie (3) - 2 freq
ferryin (3) - 2 freq
erran (3) - 3 freq
SoundEx code - T650
tryin - 702 freq
term - 99 freq
turn - 818 freq
thrawin - 17 freq
thorn - 9 freq
tirrin - 3 freq
trym - 1 freq
tearin - 12 freq
train - 147 freq
throwin - 34 freq
tryin- - 3 freq
thrown - 43 freq
tureen - 4 freq
throne - 45 freq
tirryin - 1 freq
torn - 51 freq
trenn - 1 freq
thrawn - 109 freq
tearoom - 3 freq
terrain - 10 freq
tyranny - 9 freq
thraan - 12 freq
taurn - 1 freq
tirn - 35 freq
trim - 16 freq
trauma - 9 freq
trooin - 1 freq
tram - 23 freq
teerin - 5 freq
tirin - 4 freq
tryin'ae - 3 freq
teirin - 6 freq
tourin - 5 freq
tierin - 2 freq
thrym - 10 freq
treen - 4 freq
troon - 5 freq
thrum - 12 freq
trewin - 1 freq
tryen - 21 freq
throwen - 1 freq
tryin' - 8 freq
thorny - 3 freq
try'nae - 3 freq
try'n - 2 freq
tryn'ae - 1 freq
thoarn - 11 freq
tramway - 1 freq
tron - 11 freq
thereon - 1 freq
throen - 1 freq
towerin - 12 freq
thran - 22 freq
turn' - 1 freq
trauma' - 1 freq
trehin - 7 freq
tryan - 32 freq
thern-wi - 1 freq
turne - 1 freq
thorn' - 1 freq
'turn - 1 freq
thraain - 1 freq
thron - 1 freq
tooerin - 1 freq
tirn-wye - 1 freq
trönnie - 1 freq
tronn - 1 freq
throwan - 4 freq
tyrone - 6 freq
tea-room - 4 freq
trein - 1 freq
thern - 1 freq
toarn - 1 freq
theorem - 1 freq
trainee - 3 freq
thairm - 3 freq
tournie - 1 freq
train' - 1 freq
triyin - 2 freq
tryan- - 1 freq
trannie - 5 freq
thrummie - 1 freq
thairin - 1 freq
tairm - 13 freq
tyran - 1 freq
traan - 1 freq
turnie - 2 freq
throw-han - 2 freq
tarin - 1 freq
thrain - 1 freq
tryne - 5 freq
thare-in - 1 freq
tharein - 1 freq
touerin - 1 freq
€œtrum - 1 freq
trine - 2 freq
thairn - 1 freq
there-in - 1 freq
tern - 1 freq
€˜train - 1 freq
tyrian - 1 freq
threne - 1 freq
three-han - 1 freq
therein - 3 freq
trainie - 3 freq
tarim - 1 freq
€œtryin - 1 freq
tranny - 2 freq
tirannie - 1 freq
trone - 6 freq
traiän - 2 freq
terryann - 3 freq
tryÂ’n - 1 freq
tierney - 1 freq
tarrin - 1 freq
'thrawn - 2 freq
‘thrawn - 1 freq
'thrawn' - 1 freq
'thran' - 1 freq
MetaPhone code - TRN
turn - 818 freq
droon - 42 freq
drawn - 90 freq
tirrin - 3 freq
daurin - 10 freq
tearin - 12 freq
train - 147 freq
tureen - 4 freq
torn - 51 freq
droun - 7 freq
dern - 21 freq
trenn - 1 freq
drain - 30 freq
draan - 23 freq
daurna - 24 freq
durin - 183 freq
terrain - 10 freq
tyranny - 9 freq
taurn - 1 freq
darn - 3 freq
tirn - 35 freq
darnae - 5 freq
trooin - 1 freq
teerin - 5 freq
dreean - 1 freq
tirin - 4 freq
drewn - 2 freq
teirin - 6 freq
tourin - 5 freq
darin - 5 freq
daurnae - 4 freq
drone - 19 freq
tierin - 2 freq
treen - 4 freq
troon - 5 freq
darenae - 4 freq
try'nae - 3 freq
try'n - 2 freq
droney - 2 freq
tryn'ae - 1 freq
'dorian - 1 freq
dorian - 32 freq
'dorian' - 1 freq
tron - 11 freq
turn' - 1 freq
darien - 8 freq
draa'in - 1 freq
turne - 1 freq
draain - 15 freq
doreen - 3 freq
'turn - 1 freq
darren - 8 freq
tooerin - 1 freq
trönnie - 1 freq
tronn - 1 freq
tyrone - 6 freq
trein - 1 freq
toarn - 1 freq
draaeen - 4 freq
duran - 3 freq
darena - 3 freq
draawn - 3 freq
draa-an - 1 freq
durin' - 2 freq
trainee - 3 freq
tournie - 1 freq
drane - 1 freq
train' - 1 freq
trannie - 5 freq
drywyn - 1 freq
draen - 1 freq
hyterin - 3 freq
dauran - 1 freq
tyran - 1 freq
traan - 1 freq
d'aran - 1 freq
turnie - 2 freq
daarna - 1 freq
tarin - 1 freq
tryne - 5 freq
touerin - 1 freq
dreein - 1 freq
drown - 2 freq
trine - 2 freq
'drain - 1 freq
dorine - 2 freq
tern - 1 freq
€˜train - 1 freq
tyrian - 1 freq
drien - 1 freq
trainie - 3 freq
daarin - 1 freq
tranny - 2 freq
tirannie - 1 freq
trone - 6 freq
traiän - 2 freq
tryÂ’n - 1 freq
tierney - 1 freq
dooron - 5 freq
draÂ’n - 1 freq
darrn - 1 freq
tarrin - 1 freq
durinÂ’ - 1 freq
TERRAIN
Time to execute Levenshtein function - 0.203413 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.363624 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030669 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037082 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000911 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.