A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to taein in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
taein (0) - 6 freq
daein (1) - 882 freq
takin (1) - 453 freq
taxin (1) - 8 freq
twein (1) - 1 freq
taeen (1) - 1 freq
trein (1) - 1 freq
ta'in (1) - 1 freq
laein (1) - 9 freq
thein (1) - 23 freq
haein (1) - 661 freq
taen (1) - 1001 freq
gaein (1) - 147 freq
teein (1) - 2 freq
naein (1) - 1 freq
tein (1) - 3 freq
baein (1) - 20 freq
tiein (1) - 1 freq
tain (1) - 42 freq
saein (1) - 3 freq
tae'n (1) - 1 freq
waein (1) - 1 freq
tarin (1) - 1 freq
tabie (2) - 1 freq
iain (2) - 49 freq
taein (0) - 6 freq
tein (1) - 3 freq
tiein (1) - 1 freq
tain (1) - 42 freq
teein (1) - 2 freq
taen (1) - 1001 freq
taeen (1) - 1 freq
ten (2) - 638 freq
tien (2) - 3 freq
tuin (2) - 27 freq
tin (2) - 184 freq
tyin (2) - 13 freq
teen (2) - 461 freq
tyeen (2) - 2 freq
aetin (2) - 27 freq
atein (2) - 4 freq
toin (2) - 1 freq
eatin (2) - 154 freq
atin (2) - 3 freq
toyin (2) - 2 freq
aatin (2) - 1 freq
tooin (2) - 1 freq
tean (2) - 1 freq
taun (2) - 1 freq
tan (2) - 55 freq
SoundEx code - T500
them - 5422 freq
thon - 2542 freq
then - 4541 freq
thin - 323 freq
taen - 1001 freq
than - 2763 freq
time - 5974 freq
ten - 638 freq
toun - 379 freq
teen - 461 freq
'time - 4 freq
teem - 47 freq
thaim - 2522 freq
toon - 698 freq
tam - 522 freq
tom - 134 freq
tone - 157 freq
time- - 2 freq
teeny - 13 freq
tiny - 55 freq
tae-an - 2 freq
them- - 2 freq
then-ah - 3 freq
then- - 2 freq
them-aw - 1 freq
them-' - 2 freq
tea-nae - 1 freq
then--- - 1 freq
tin - 184 freq
tyne - 30 freq
tuin - 27 freq
tuwim - 2 freq
toom - 35 freq
tame - 23 freq
'tam - 14 freq
tane - 175 freq
tine - 4 freq
theme - 39 freq
tune - 167 freq
'tm - 2 freq
'then - 38 freq
thum - 463 freq
tan - 55 freq
thyme - 4 freq
tom' - 1 freq
'thon - 11 freq
th'ane - 1 freq
tame' - 3 freq
team - 306 freq
tinny - 14 freq
town - 51 freq
tam' - 1 freq
teeeeeam - 1 freq
teeeamy - 1 freq
thine - 15 freq
tuim - 103 freq
t'm - 3 freq
tae--an - 1 freq
tyme - 220 freq
thoum - 13 freq
thaim- - 1 freq
thaim-aa - 1 freq
tm - 7 freq
'tween - 1 freq
time' - 6 freq
team' - 2 freq
theem - 7 freq
theen - 2 freq
then' - 3 freq
ton - 17 freq
the-nou - 4 freq
ta'in - 1 freq
'ten - 3 freq
't'm - 3 freq
twin - 44 freq
'tom - 1 freq
ta'en - 36 freq
thawin - 3 freq
thoom - 12 freq
tham - 43 freq
tannoy - 3 freq
tina - 17 freq
timé - 1 freq
tawny - 3 freq
tain - 42 freq
tyan - 2 freq
thim - 193 freq
toin - 1 freq
taun - 1 freq
tne - 2 freq
thane - 3 freq
thin' - 1 freq
twine - 21 freq
thein - 23 freq
towen - 2 freq
theim - 12 freq
thain - 6 freq
thn - 1 freq
thone - 13 freq
tme - 3 freq
ttttaaaahhhhaaaaaam - 1 freq
taaahhmm - 1 freq
tyoon - 4 freq
toon' - 1 freq
tammie - 25 freq
tim - 47 freq
tyin - 13 freq
thonway - 2 freq
'than - 1 freq
them' - 5 freq
thun - 2 freq
thoan - 1 freq
they'm - 20 freq
thenow - 3 freq
tommy - 100 freq
thenoo - 14 freq
tony - 33 freq
thuma - 1 freq
tehome - 1 freq
thaem - 3 freq
tinnie - 7 freq
toon-haw - 1 freq
thon' - 4 freq
ttme - 1 freq
t'im - 1 freq
t'im' - 1 freq
tn - 8 freq
tan' - 1 freq
tum - 17 freq
thum- - 1 freq
thum-aw - 1 freq
thin-aye - 1 freq
thim- - 2 freq
thin-a - 1 freq
them'ii - 1 freq
tion - 1 freq
towin - 1 freq
tuna - 9 freq
ti-n - 1 freq
'tommy - 2 freq
thame - 27 freq
tim' - 43 freq
tun - 4 freq
thaun - 2 freq
tome - 4 freq
ten' - 1 freq
tae-no - 1 freq
'team'' - 1 freq
tammy - 48 freq
töm - 2 freq
tiein - 1 freq
tume - 14 freq
tym - 2 freq
ti'm - 1 freq
teum - 8 freq
tummy - 4 freq
toam - 1 freq
'tina - 1 freq
tween - 2 freq
tooin - 1 freq
thaen - 2 freq
toe-an - 1 freq
tae'n - 1 freq
twenny - 7 freq
twennie - 1 freq
t'dem - 1 freq
t'ane - 1 freq
tiym - 6 freq
tynni - 1 freq
to-an - 1 freq
tea-no - 1 freq
tøm - 1 freq
'tiny' - 1 freq
tae'm - 2 freq
thaum - 1 freq
toun' - 1 freq
taeen - 1 freq
thom - 11 freq
tein - 3 freq
twein - 1 freq
'thone - 1 freq
thayme - 1 freq
thyne - 1 freq
'twein - 1 freq
taein - 6 freq
twun - 1 freq
thaim' - 1 freq
towwin - 1 freq
teenie - 54 freq
toonie - 1 freq
thinn - 2 freq
then - 6 freq
than - 1 freq
tam - 6 freq
then - 1 freq
toune - 2 freq
thon - 15 freq
'teem - 1 freq
thon - 1 freq
twene - 3 freq
thon - 1 freq
tino - 2 freq
them - 4 freq
thaim - 2 freq
tewin - 1 freq
thine - 1 freq
them - 2 freq
tyeen - 2 freq
toyin - 2 freq
thon - 2 freq
time - 1 freq
theym - 27 freq
then - 1 freq
them - 1 freq
toamy - 12 freq
ten - 1 freq
tommy - 1 freq
tien - 3 freq
than - 3 freq
ten - 1 freq
thyn - 1 freq
twain - 1 freq
tyna - 1 freq
then - 2 freq
tein - 1 freq
tean - 1 freq
taim - 1 freq
teein - 2 freq
time - 1 freq
th’n - 4 freq
thi’n - 2 freq
thi'n - 1 freq
tanoy - 1 freq
th’in - 1 freq
teamie - 1 freq
te’en - 2 freq
te'en - 2 freq
timmy - 2 freq
‘thon - 1 freq
teamni - 1 freq
toney - 1 freq
thin’ - 1 freq
tommie - 1 freq
time’ - 1 freq
tanya - 1 freq
toni - 2 freq
'thom - 1 freq
tmi - 1 freq
tywan - 1 freq
“thaim - 4 freq
time” - 1 freq
thoumie - 1 freq
teena - 3 freq
tunnneeee - 1 freq
'taim' - 1 freq
toannwe - 1 freq
tomow - 1 freq
taeyma - 1 freq
MetaPhone code - TN
doon - 7067 freq
dinna - 1825 freq
taen - 1001 freq
done - 821 freq
dawn - 92 freq
down - 208 freq
daein - 882 freq
ten - 638 freq
dinnae - 1942 freq
deein - 295 freq
toun - 379 freq
den - 104 freq
'dinna - 52 freq
teen - 461 freq
deen - 289 freq
diane - 9 freq
toon - 698 freq
duin - 393 freq
doun - 1278 freq
dean - 18 freq
tone - 157 freq
dinah - 75 freq
teeny - 13 freq
tiny - 55 freq
dunno - 43 freq
doon' - 9 freq
tae-an - 2 freq
dinn- - 3 freq
dinn - 6 freq
tea-nae - 1 freq
deny - 53 freq
tin - 184 freq
din - 75 freq
dwyne - 23 freq
tyne - 30 freq
tuin - 27 freq
dinny - 10 freq
dune - 103 freq
dain - 144 freq
tane - 175 freq
tine - 4 freq
tune - 167 freq
tan - 55 freq
daen - 223 freq
daena - 26 freq
wytin - 66 freq
tinny - 14 freq
danny - 135 freq
'danny - 1 freq
town - 51 freq
dooin - 2 freq
doony - 2 freq
'dunno - 1 freq
doin - 75 freq
dae'in - 1 freq
dine - 26 freq
'dinnae - 63 freq
tae--an - 1 freq
dinn-- - 1 freq
downy - 1 freq
dinghy - 3 freq
dinae - 40 freq
daein' - 14 freq
ton - 17 freq
ta'in - 1 freq
'ten - 3 freq
'doon - 6 freq
'deein - 2 freq
don - 260 freq
ta'en - 36 freq
dein - 22 freq
dann - 9 freq
tannoy - 3 freq
tina - 17 freq
dana - 1 freq
dan - 472 freq
dun - 36 freq
tawny - 3 freq
tain - 42 freq
toin - 1 freq
dinnnae - 1 freq
day-in - 1 freq
deign - 1 freq
taun - 1 freq
tne - 2 freq
downie - 2 freq
duun - 1 freq
dina - 43 freq
da'en - 1 freq
hydin - 2 freq
ddnae - 1 freq
dno - 1 freq
toon' - 1 freq
dene - 3 freq
deean - 3 freq
daenae - 4 freq
'done - 4 freq
diine - 2 freq
dna - 16 freq
doenae - 1 freq
tony - 33 freq
dianae - 1 freq
tinnie - 7 freq
diana - 8 freq
tn - 8 freq
tan' - 1 freq
denn- - 1 freq
denn - 3 freq
deuan - 16 freq
deun - 31 freq
dane - 55 freq
dinah' - 1 freq
tuna - 9 freq
don' - 1 freq
ti-n - 1 freq
dïnnae - 124 freq
'dïnnae - 4 freq
dinnae' - 2 freq
done' - 1 freq
wydin - 2 freq
tun - 4 freq
'wytin - 1 freq
deane - 3 freq
daen' - 1 freq
dee'in - 3 freq
ten' - 1 freq
'daein - 2 freq
dönna - 4 freq
denee - 1 freq
dunna - 159 freq
döne - 6 freq
döin - 16 freq
tae-no - 1 freq
'dan - 2 freq
dön - 52 freq
'dunna - 4 freq
ddin - 5 freq
döön - 1 freq
down' - 3 freq
dae'n - 2 freq
tiein - 1 freq
dae-in - 3 freq
dinno - 16 freq
'deen' - 1 freq
dønna - 1 freq
'tina - 1 freq
daean - 8 freq
tooin - 1 freq
toe-an - 1 freq
tae'n - 1 freq
dona - 1 freq
doein - 1 freq
duina - 6 freq
t'ane - 1 freq
tynni - 1 freq
'dain - 1 freq
doan - 9 freq
to-an - 1 freq
tea-no - 1 freq
døn - 8 freq
døin - 5 freq
doen - 4 freq
'tiny' - 1 freq
dunnae - 3 freq
deen- - 1 freq
toun' - 1 freq
taeen - 1 freq
duana - 1 freq
dun' - 2 freq
tein - 3 freq
dàin - 1 freq
dyne - 2 freq
taein - 6 freq
dunn - 8 freq
towwin - 1 freq
dan - 1 freq
dione - 1 freq
diinne - 1 freq
teenie - 54 freq
donnie - 9 freq
toonie - 1 freq
don - 12 freq
dinna - 1 freq
douna - 1 freq
down - 1 freq
dinnae - 7 freq
dün - 1 freq
toune - 2 freq
dunnie - 6 freq
dane' - 1 freq
dinna - 34 freq
dinnae - 19 freq
tino - 2 freq
doon - 1 freq
doun - 1 freq
dunny - 1 freq
don - 10 freq
dinnna - 1 freq
dunna - 3 freq
dunna - 1 freq
dien - 1 freq
dunnoo - 2 freq
danny - 7 freq
doin' - 1 freq
'don - 1 freq
donna - 51 freq
diean - 1 freq
doon-o - 1 freq
dinna - 4 freq
daun - 1 freq
'daein' - 1 freq
daein - 1 freq
doune - 3 freq
dain' - 2 freq
dinni - 10 freq
dini - 9 freq
ten - 1 freq
doon - 2 freq
tien - 3 freq
daena - 3 freq
ten - 1 freq
diin - 2 freq
don - 1 freq
dùn - 1 freq
dinnae - 1 freq
tyna - 1 freq
done - 1 freq
tein - 1 freq
denie - 1 freq
tean - 1 freq
dinnae - 2 freq
teein - 2 freq
tanoy - 1 freq
dee’in - 1 freq
te’en - 2 freq
te'en - 2 freq
“dinna - 1 freq
duine - 1 freq
toney - 1 freq
‘done - 1 freq
downey - 1 freq
deano - 1 freq
wtin - 1 freq
toni - 2 freq
de'in - 4 freq
deein' - 1 freq
'dinna' - 1 freq
deain - 1 freq
denny - 2 freq
daein” - 1 freq
deaein - 1 freq
“daein - 1 freq
deen' - 1 freq
teena - 3 freq
“dunna - 2 freq
“dinnae - 1 freq
tunnneeee - 1 freq
ytno - 1 freq
donny - 1 freq
deena - 1 freq
deun' - 1 freq
dinnea - 4 freq
TAEIN
Time to execute Levenshtein function - 0.246743 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.469273 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031366 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.076627 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000998 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.