A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to toss in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
toss (0) - 27 freq
goss (1) - 1 freq
tosh (1) - 7 freq
soss (1) - 15 freq
tosst (1) - 1 freq
tobs (1) - 1 freq
toks (1) - 1 freq
tos (1) - 3 freq
boss (1) - 94 freq
tss (1) - 2 freq
tods (1) - 48 freq
tots (1) - 4 freq
loss (1) - 200 freq
tors (1) - 2 freq
toes (1) - 48 freq
togs (1) - 1 freq
toass (1) - 2 freq
moss (1) - 99 freq
toys (1) - 45 freq
joss (1) - 2 freq
doss (1) - 5 freq
tess (1) - 1 freq
tost (1) - 2 freq
hoss (1) - 1 freq
tows (1) - 2 freq
toss (0) - 27 freq
toass (1) - 2 freq
tess (1) - 1 freq
tss (1) - 2 freq
tass (1) - 1 freq
hoss (2) - 1 freq
tows (2) - 2 freq
tost (2) - 2 freq
tosh (2) - 7 freq
doss (2) - 5 freq
voss (2) - 10 freq
poss (2) - 5 freq
tobs (2) - 1 freq
goss (2) - 1 freq
ross (2) - 102 freq
toms (2) - 1 freq
tops (2) - 21 freq
joss (2) - 2 freq
tons (2) - 10 freq
tosst (2) - 1 freq
tots (2) - 4 freq
boss (2) - 94 freq
toys (2) - 45 freq
toks (2) - 1 freq
tos (2) - 3 freq
SoundEx code - T200
this - 11303 freq
took - 1049 freq
'this - 73 freq
touch - 257 freq
though - 1213 freq
teach - 86 freq
those - 296 freq
these - 1129 freq
tak - 2834 freq
taks - 458 freq
thick - 227 freq
taes - 107 freq
take - 706 freq
tick - 69 freq
ties - 31 freq
task - 57 freq
twice - 188 freq
thase - 13 freq
teuk - 221 freq
tosh - 7 freq
tousie - 11 freq
teuch - 31 freq
take- - 1 freq
toays - 1 freq
this-ah - 1 freq
tea's - 4 freq
ts - 44 freq
tack - 43 freq
this- - 1 freq
twos - 4 freq
'tis - 9 freq
tac - 22 freq
twice- - 3 freq
toes - 48 freq
tis - 38 freq
tts - 2 freq
tt's - 3 freq
toss - 27 freq
tuik - 313 freq
tyke - 27 freq
tassie - 37 freq
thus - 40 freq
twiggy - 1 freq
tsk - 2 freq
tackie - 2 freq
this' - 3 freq
tesco - 58 freq
tosie - 4 freq
taak - 94 freq
'tak - 24 freq
'these - 4 freq
tag - 32 freq
twoz - 1 freq
tuck - 26 freq
toi's - 1 freq
taz - 21 freq
taz's - 3 freq
'taz - 1 freq
taxi - 85 freq
tech - 8 freq
tissue - 6 freq
tease - 10 freq
tak' - 81 freq
twas - 16 freq
techie - 4 freq
tawse - 14 freq
toys - 45 freq
tazzy - 1 freq
twaes - 2 freq
tka - 1 freq
tusk - 5 freq
tizz - 1 freq
tizzy - 2 freq
t'wis - 2 freq
tawsie - 1 freq
'twis - 32 freq
tough - 43 freq
twis - 13 freq
teas - 3 freq
ticky - 2 freq
theek - 7 freq
thes - 12 freq
theez - 2 freq
thees - 4 freq
twig - 18 freq
thig - 3 freq
touzie - 3 freq
tacks - 3 freq
'tough - 1 freq
touchy - 9 freq
tusks - 7 freq
toc - 2 freq
toosie - 2 freq
tags - 6 freq
tchau - 1 freq
tax - 61 freq
t'was - 2 freq
togs - 1 freq
theik - 8 freq
takk - 89 freq
teeicks - 1 freq
tuk - 227 freq
tug - 11 freq
tucks - 6 freq
thigh - 17 freq
t's - 9 freq
towsie - 3 freq
thack - 2 freq
tig - 21 freq
tozie - 3 freq
tigs - 3 freq
tike - 2 freq
twyse - 1 freq
tae's - 5 freq
theis - 19 freq
tak's - 4 freq
teesh - 5 freq
tik - 4 freq
taka - 2 freq
thj - 1 freq
'tt's - 1 freq
ticks - 13 freq
tocks - 2 freq
ths - 5 freq
tae-us - 1 freq
tiks - 1 freq
toch - 1 freq
thusa - 1 freq
tashy - 2 freq
tash - 4 freq
takkke - 1 freq
tock - 18 freq
thak - 1 freq
teckie - 1 freq
twigs - 12 freq
'twa's - 1 freq
tke - 2 freq
tacki' - 1 freq
takeaway - 10 freq
tuco - 1 freq
tess - 1 freq
tic - 12 freq
'teach - 1 freq
'twes - 2 freq
twa's - 6 freq
thou's - 1 freq
t'is - 6 freq
tasks - 13 freq
teachee - 3 freq
thiz - 1 freq
tweak - 2 freq
thug - 6 freq
t--s - 1 freq
toass - 2 freq
thoch - 2 freq
toshie - 15 freq
tucky - 12 freq
tej - 1 freq
'tes - 1 freq
thïs - 233 freq
taich - 6 freq
ïts - 16 freq
tex - 25 freq
ït's - 16 freq
°tak - 1 freq
tyes - 1 freq
°these - 1 freq
thïs' - 1 freq
'thïs - 2 freq
taaks - 10 freq
thas - 2 freq
ta'k - 1 freq
taks' - 2 freq
thochy - 1 freq
tokyo - 2 freq
t'wus - 7 freq
tig's - 1 freq
taws - 1 freq
'take - 1 freq
tyaach - 1 freq
tees - 3 freq
t'sae - 1 freq
tagsy - 1 freq
tukk - 1 freq
twise - 11 freq
taeks - 1 freq
'this' - 2 freq
'those' - 1 freq
'teach' - 2 freq
'twas - 11 freq
tice - 3 freq
'though - 2 freq
thoche - 1 freq
'ts - 3 freq
towes - 6 freq
tiche - 1 freq
this-' - 1 freq
tise - 3 freq
tagsie - 1 freq
these' - 1 freq
'those - 1 freq
toogs - 6 freq
tö-tak - 1 freq
twaese - 3 freq
té-tak - 1 freq
takkaway - 2 freq
tæs - 1 freq
tuiks - 1 freq
tiso - 1 freq
tuaks - 1 freq
teeick - 2 freq
touchie - 30 freq
tch - 3 freq
taj - 2 freq
tuke - 1 freq
tass - 1 freq
toque - 1 freq
touk - 7 freq
thouch - 12 freq
tyeuch - 1 freq
tok - 2 freq
toks - 1 freq
thows - 1 freq
tweesh - 1 freq
tds - 3 freq
tss - 2 freq
td's - 3 freq
taik - 4 freq
't's - 2 freq
tøk - 3 freq
tize - 4 freq
tyoch - 1 freq
taas - 1 freq
tyeuk - 21 freq
toga - 2 freq
teech - 2 freq
thugs - 4 freq
taigs - 2 freq
Ötzi - 1 freq
tiggs - 1 freq
tush - 1 freq
tows - 2 freq
€˜tak - 3 freq
tikk - 1 freq
teck - 1 freq
tycho - 2 freq
€˜twas - 1 freq
€˜this - 22 freq
tea-hoose - 1 freq
€™tis - 1 freq
tauk - 2 freq
thys - 1 freq
thik - 3 freq
€œthis - 38 freq
takks - 4 freq
takk-aa - 1 freq
thaws - 1 freq
'takk - 1 freq
''twis - 1 freq
tich - 2 freq
thies - 1 freq
€œtak - 10 freq
tyse - 1 freq
teugs - 1 freq
twa-wyes - 1 freq
€˜those - 6 freq
tyso - 1 freq
€˜tax - 1 freq
ttish - 1 freq
€˜these - 1 freq
tawk - 4 freq
take-away - 1 freq
tweaks - 1 freq
takawa - 2 freq
toke - 2 freq
€œtake - 7 freq
€˜take - 2 freq
toiys - 1 freq
texi - 1 freq
thooasa - 1 freq
tesk - 1 freq
tuek - 1 freq
tos - 3 freq
taki - 1 freq
€˜taxi - 1 freq
€œthough - 1 freq
tea-houss - 1 freq
tuggs - 1 freq
tugs - 1 freq
€œtuck - 1 freq
€œtakk - 1 freq
€™this - 2 freq
tashi - 8 freq
€œthese - 2 freq
€œthose - 1 freq
€œtaks - 1 freq
€”though - 1 freq
teugh - 1 freq
€œtis - 2 freq
thae's - 1 freq
tues - 6 freq
tushie - 4 freq
€œtushie - 4 freq
tuc - 1 freq
ts'e - 3 freq
€™twes - 1 freq
taech - 1 freq
tgi - 1 freq
tuggy - 1 freq
€™take - 1 freq
texhii - 2 freq
tckckw - 1 freq
taco - 1 freq
tc - 3 freq
tgzg - 1 freq
tjo - 2 freq
tju - 1 freq
tx - 3 freq
“this - 3 freq
tg - 7 freq
tjw - 1 freq
tq - 2 freq
tache - 1 freq
tux - 1 freq
thuck - 1 freq
thake - 1 freq
ti's - 1 freq
thsh - 1 freq
tickie - 1 freq
tikka - 2 freq
tiz - 3 freq
takÂ’s - 1 freq
tezza - 1 freq
tackawa - 1 freq
tzx - 1 freq
tgsu - 1 freq
tkee - 1 freq
tk - 3 freq
tooÂ’s - 1 freq
‘twas - 1 freq
tcq - 1 freq
tuoj - 1 freq
tcg - 1 freq
thc - 3 freq
tcsoa - 1 freq
teuchie - 1 freq
twix - 1 freq
tachy - 1 freq
tauq - 1 freq
ttdku - 1 freq
tqj - 1 freq
toq - 1 freq
tdzcqkx - 1 freq
thos - 1 freq
tha's - 3 freq
tzu - 1 freq
thzu - 1 freq
tooj - 1 freq
twqkz - 1 freq
tkz - 1 freq
tes - 2 freq
tihz - 1 freq
taes' - 1 freq
ttkq - 1 freq
ttj - 1 freq
tac' - 1 freq
teache - 1 freq
thicko - 1 freq
tj - 3 freq
tokie - 1 freq
”this - 1 freq
tdx - 1 freq
thcki - 1 freq
tyqq - 1 freq
MetaPhone code - TS
doos - 101 freq
does - 385 freq
days - 1574 freq
taes - 107 freq
ties - 31 freq
douce - 128 freq
dis - 1552 freq
daes - 213 freq
tousie - 11 freq
daisy - 62 freq
toays - 1 freq
diz - 55 freq
tea's - 4 freq
ts - 44 freq
doze - 10 freq
'tis - 9 freq
toes - 48 freq
tis - 38 freq
tts - 2 freq
tt's - 3 freq
toss - 27 freq
daise - 2 freq
day's - 105 freq
dizzy - 19 freq
tassie - 37 freq
dose - 45 freq
tosie - 4 freq
das - 23 freq
toi's - 1 freq
taz - 21 freq
'taz - 1 freq
tissue - 6 freq
days' - 12 freq
tease - 10 freq
tawse - 14 freq
dy's - 2 freq
dozy - 10 freq
toys - 45 freq
tazzy - 1 freq
tizz - 1 freq
tizzy - 2 freq
tawsie - 1 freq
teas - 3 freq
dozie - 5 freq
touzie - 3 freq
daws - 5 freq
da's - 67 freq
douse - 3 freq
toosie - 2 freq
duis - 23 freq
dos - 4 freq
dees - 41 freq
'daes - 1 freq
dies - 7 freq
dues - 10 freq
deys - 9 freq
t's - 9 freq
daw's - 1 freq
towsie - 3 freq
doss - 5 freq
tozie - 3 freq
twyse - 1 freq
tae's - 5 freq
'tt's - 1 freq
'daisy - 2 freq
tae-us - 1 freq
deus - 18 freq
'does - 5 freq
douze - 1 freq
da''s - 1 freq
hts - 5 freq
dehs - 1 freq
dice - 13 freq
daze - 7 freq
tess - 1 freq
t'is - 6 freq
dyce - 5 freq
dae's - 3 freq
doozie - 1 freq
dows - 1 freq
t--s - 1 freq
toass - 2 freq
dess - 11 freq
'das - 1 freq
'tes - 1 freq
ïts - 16 freq
hïts - 1 freq
ït's - 16 freq
wüt's - 1 freq
daiys - 1 freq
di's - 1 freq
taws - 1 freq
doos' - 1 freq
tees - 3 freq
wytes - 3 freq
'da's - 2 freq
t'sae - 1 freq
doose - 6 freq
'du's - 4 freq
du's - 113 freq
'dice' - 1 freq
dis' - 3 freq
dus - 24 freq
'dis - 10 freq
des - 23 freq
dozey - 1 freq
tice - 3 freq
ds - 9 freq
dese - 6 freq
'ts - 3 freq
dous - 6 freq
tise - 3 freq
do's - 2 freq
dee's - 2 freq
doo's - 4 freq
dæs - 2 freq
tæs - 1 freq
tiso - 1 freq
dowse - 2 freq
dace - 4 freq
tass - 1 freq
dös - 1 freq
tss - 2 freq
deece - 1 freq
't's - 2 freq
tize - 4 freq
taas - 1 freq
dicey - 1 freq
deizie - 4 freq
daies - 2 freq
deezie - 1 freq
Ötzi - 1 freq
dss - 2 freq
daa's - 2 freq
tows - 2 freq
€™duys - 1 freq
duys - 1 freq
€˜dis - 3 freq
deas - 1 freq
dasy - 1 freq
€™tis - 1 freq
€”dees - 1 freq
dois - 3 freq
dow's - 2 freq
tyse - 1 freq
tyso - 1 freq
dys- - 1 freq
disa- - 1 freq
daiss - 2 freq
dizzie - 2 freq
toiys - 1 freq
dosy - 1 freq
'daes' - 1 freq
hyde's - 1 freq
tos - 3 freq
die's - 1 freq
daz - 1 freq
€œdis - 17 freq
doozy - 1 freq
€œtis - 2 freq
tues - 6 freq
€œdoes - 5 freq
ts'e - 3 freq
dooÂ’s - 3 freq
dayÂ’s - 2 freq
dsa - 1 freq
dzyy - 1 freq
dazzy - 1 freq
dis- - 1 freq
ti's - 1 freq
tiz - 3 freq
tezza - 1 freq
dz - 3 freq
tooÂ’s - 1 freq
deuce - 3 freq
daÂ’s - 2 freq
desi - 1 freq
deis - 1 freq
doozo - 1 freq
doucie - 1 freq
ytzo - 1 freq
dsi - 1 freq
duÂ’s - 1 freq
dissae - 1 freq
tzu - 1 freq
tes - 2 freq
tihz - 1 freq
dze - 1 freq
taes' - 1 freq
daysÂ’ - 1 freq
TOSS
Time to execute Levenshtein function - 0.609798 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.101931 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.092883 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.109518 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001279 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.