A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to tym in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
tym (0) - 2 freq
tm (1) - 7 freq
hym (1) - 5 freq
tye (1) - 8 freq
cym (1) - 6 freq
tyd (1) - 4 freq
tyvm (1) - 1 freq
tim (1) - 46 freq
nym (1) - 1 freq
tyme (1) - 219 freq
trym (1) - 1 freq
ty (1) - 7 freq
iym (1) - 2 freq
tum (1) - 17 freq
tom (1) - 134 freq
tyr (1) - 1 freq
t'm (1) - 3 freq
tam (1) - 519 freq
ym (1) - 2 freq
tiym (1) - 6 freq
gym (1) - 40 freq
lym (1) - 1 freq
tuyu (2) - 1 freq
tex (2) - 25 freq
tis (2) - 38 freq
tym (0) - 2 freq
tim (1) - 46 freq
tyme (1) - 219 freq
tm (1) - 7 freq
tom (1) - 134 freq
tum (1) - 17 freq
tam (1) - 519 freq
tiym (1) - 6 freq
taim (2) - 1 freq
time (2) - 5878 freq
toom (2) - 35 freq
tame (2) - 23 freq
item (2) - 17 freq
atom (2) - 5 freq
tuim (2) - 102 freq
teem (2) - 46 freq
teum (2) - 8 freq
tome (2) - 4 freq
toam (2) - 1 freq
tme (2) - 3 freq
tume (2) - 14 freq
atm (2) - 3 freq
tmi (2) - 1 freq
team (2) - 292 freq
trym (2) - 1 freq
SoundEx code - T500
them - 5354 freq
thon - 2518 freq
then - 4451 freq
thin - 317 freq
taen - 999 freq
than - 2715 freq
time - 5878 freq
ten - 619 freq
toun - 379 freq
teen - 461 freq
'time - 4 freq
teem - 46 freq
thaim - 2515 freq
toon - 689 freq
tam - 519 freq
tom - 134 freq
tone - 153 freq
time- - 2 freq
teeny - 13 freq
tiny - 54 freq
tae-an - 2 freq
them- - 2 freq
then-ah - 3 freq
then- - 2 freq
them-aw - 1 freq
them-' - 2 freq
tea-nae - 1 freq
then--- - 1 freq
tin - 180 freq
tyne - 30 freq
tuin - 27 freq
tuwim - 2 freq
toom - 35 freq
tame - 23 freq
'tam - 14 freq
tane - 175 freq
tine - 4 freq
theme - 38 freq
tune - 146 freq
'tm - 1 freq
'then - 38 freq
thum - 416 freq
tan - 48 freq
thyme - 4 freq
tom' - 1 freq
'thon - 11 freq
th'ane - 1 freq
tame' - 3 freq
team - 292 freq
tinny - 14 freq
town - 51 freq
tam' - 1 freq
teeeeeam - 1 freq
teeeamy - 1 freq
thine - 15 freq
tuim - 102 freq
t'm - 3 freq
tae--an - 1 freq
tyme - 219 freq
thoum - 13 freq
thaim- - 1 freq
thaim-aa - 1 freq
tm - 7 freq
'tween - 1 freq
time' - 6 freq
team' - 2 freq
theem - 7 freq
theen - 2 freq
then' - 3 freq
ton - 14 freq
the-nou - 4 freq
ta'in - 1 freq
'ten - 3 freq
't'm - 3 freq
twin - 44 freq
'tom - 1 freq
ta'en - 36 freq
thawin - 2 freq
thoom - 12 freq
tham - 43 freq
tannoy - 3 freq
tina - 17 freq
timé - 1 freq
tawny - 3 freq
tain - 42 freq
tyan - 2 freq
thim - 193 freq
toin - 1 freq
taun - 1 freq
tne - 2 freq
thane - 2 freq
thin' - 1 freq
twine - 21 freq
thein - 23 freq
towen - 2 freq
theim - 12 freq
thain - 6 freq
thn - 1 freq
thone - 13 freq
tme - 3 freq
thonway - 2 freq
'than - 1 freq
them' - 5 freq
thun - 2 freq
tammie - 24 freq
thoan - 1 freq
they'm - 20 freq
thenow - 3 freq
tyin - 12 freq
tommy - 100 freq
thenoo - 14 freq
tony - 33 freq
thuma - 1 freq
tehome - 1 freq
thaem - 3 freq
tinnie - 7 freq
toon-haw - 1 freq
thon' - 4 freq
ttme - 1 freq
t'im - 1 freq
t'im' - 1 freq
tn - 8 freq
tan' - 1 freq
tum - 17 freq
thum- - 1 freq
thum-aw - 1 freq
tim - 46 freq
thin-aye - 1 freq
thim- - 2 freq
thin-a - 1 freq
them'ii - 1 freq
tion - 1 freq
towin - 1 freq
tuna - 9 freq
ti-n - 1 freq
'tommy - 2 freq
thame - 27 freq
tim' - 43 freq
tun - 4 freq
thaun - 2 freq
tome - 4 freq
ten' - 1 freq
tae-no - 1 freq
'team'' - 1 freq
tammy - 48 freq
töm - 2 freq
tiein - 1 freq
tume - 14 freq
tym - 2 freq
ti'm - 1 freq
teum - 8 freq
tummy - 4 freq
toam - 1 freq
'tina - 1 freq
tween - 2 freq
tooin - 1 freq
thaen - 2 freq
toe-an - 1 freq
tae'n - 1 freq
twenny - 7 freq
twennie - 1 freq
t'dem - 1 freq
t'ane - 1 freq
tiym - 6 freq
tynni - 1 freq
to-an - 1 freq
tea-no - 1 freq
tøm - 1 freq
'tiny' - 1 freq
tae'm - 2 freq
thaum - 1 freq
toun' - 1 freq
taeen - 1 freq
thom - 11 freq
tein - 3 freq
twein - 1 freq
'thone - 1 freq
thayme - 1 freq
thyne - 1 freq
'twein - 1 freq
taein - 6 freq
twun - 1 freq
thaim' - 1 freq
towwin - 1 freq
teenie - 54 freq
toonie - 1 freq
thinn - 2 freq
then - 6 freq
than - 1 freq
tam - 6 freq
then - 1 freq
toune - 2 freq
thon - 15 freq
'teem - 1 freq
thon - 1 freq
twene - 3 freq
thon - 1 freq
tino - 2 freq
them - 4 freq
thaim - 2 freq
tewin - 1 freq
thine - 1 freq
them - 2 freq
tyeen - 2 freq
toyin - 2 freq
thon - 2 freq
time - 1 freq
theym - 27 freq
then - 1 freq
them - 1 freq
toamy - 12 freq
ten - 1 freq
tommy - 1 freq
tien - 3 freq
than - 3 freq
ten - 1 freq
thyn - 1 freq
twain - 1 freq
tyna - 1 freq
then - 2 freq
tein - 1 freq
tean - 1 freq
taim - 1 freq
teein - 2 freq
time - 1 freq
th’n - 4 freq
thi’n - 2 freq
thi'n - 1 freq
tanoy - 1 freq
th’in - 1 freq
teamie - 1 freq
te’en - 2 freq
te'en - 2 freq
timmy - 2 freq
‘thon - 1 freq
teamni - 1 freq
toney - 1 freq
thin’ - 1 freq
tommie - 1 freq
time’ - 1 freq
tanya - 1 freq
toni - 2 freq
'thom - 1 freq
tmi - 1 freq
tywan - 1 freq
“thaim - 4 freq
time” - 1 freq
thoumie - 1 freq
teena - 3 freq
tunnneeee - 1 freq
tyoon - 1 freq
'taim' - 1 freq
toannwe - 1 freq
tomow - 1 freq
taeyma - 1 freq
MetaPhone code - TM
time - 5878 freq
'time - 4 freq
teem - 46 freq
tam - 519 freq
tom - 134 freq
time- - 2 freq
toom - 35 freq
tame - 23 freq
'tam - 14 freq
domme - 1 freq
'tm - 1 freq
tom' - 1 freq
deem - 19 freq
doom - 34 freq
tame' - 3 freq
team - 292 freq
tam' - 1 freq
teeeeeam - 1 freq
teeeamy - 1 freq
tuim - 102 freq
t'm - 3 freq
tyme - 219 freq
tm - 7 freq
time' - 6 freq
team' - 2 freq
'dam - 1 freq
tomb - 31 freq
dumb - 38 freq
't'm - 3 freq
'tom - 1 freq
dummy - 27 freq
dam - 28 freq
dem - 722 freq
demi - 1 freq
tomboy - 1 freq
timé - 1 freq
'dame - 1 freq
dim - 50 freq
dome - 21 freq
demo - 3 freq
tme - 3 freq
dame - 16 freq
tammie - 24 freq
tommy - 100 freq
ttme - 1 freq
t'im - 1 freq
t'im' - 1 freq
tum - 17 freq
tim - 46 freq
htm - 17 freq
'tommy - 2 freq
dum - 14 freq
daem - 1 freq
tim' - 43 freq
'dumb - 2 freq
tome - 4 freq
dem-aa - 1 freq
dem- - 1 freq
dom - 3 freq
'team'' - 1 freq
'dom - 2 freq
tammy - 48 freq
töm - 2 freq
tume - 14 freq
tym - 2 freq
ti'm - 1 freq
teum - 8 freq
domm - 1 freq
dumbo - 2 freq
tummy - 4 freq
toam - 1 freq
doomy - 1 freq
duma - 1 freq
tiym - 6 freq
-dom - 2 freq
tøm - 1 freq
tumb - 1 freq
dima-' - 1 freq
tae'm - 2 freq
deemie - 3 freq
duim - 1 freq
dummie - 1 freq
tombaugh - 1 freq
tam - 6 freq
'teem - 1 freq
doom - 1 freq
diem - 2 freq
dem - 4 freq
dumba - 1 freq
time - 1 freq
damb - 1 freq
toamy - 12 freq
deemy - 1 freq
dumb - 1 freq
tommy - 1 freq
damo - 10 freq
taim - 1 freq
time - 1 freq
wdma - 1 freq
teamie - 1 freq
dm - 12 freq
timmy - 2 freq
tommie - 1 freq
time’ - 1 freq
dmi - 1 freq
tmi - 1 freq
tomb” - 1 freq
time” - 1 freq
ydm - 1 freq
'taim' - 1 freq
damoa - 7 freq
tomow - 1 freq
taeyma - 1 freq
TYM
Time to execute Levenshtein function - 0.192743 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.333153 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029227 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037605 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000793 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.