A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to tax in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
tax (0) - 60 freq
ta (1) - 2534 freq
ta- (1) - 1 freq
tat (1) - 16 freq
tzx (1) - 1 freq
tao (1) - 2 freq
ax (1) - 76 freq
tab (1) - 6 freq
tar (1) - 26 freq
lax (1) - 2 freq
taa (1) - 3 freq
tak (1) - 2818 freq
taj (1) - 2 freq
tdx (1) - 1 freq
tux (1) - 1 freq
rax (1) - 79 freq
taxt (1) - 3 freq
tay (1) - 185 freq
tal (1) - 6 freq
taf (1) - 1 freq
pax (1) - 3 freq
tau (1) - 2 freq
tag (1) - 32 freq
bax (1) - 2 freq
wax (1) - 33 freq
tax (0) - 60 freq
tx (1) - 3 freq
tux (1) - 1 freq
tex (1) - 25 freq
taxi (1) - 84 freq
sax (2) - 242 freq
trax (2) - 10 freq
tap (2) - 757 freq
wax (2) - 33 freq
taw (2) - 9 freq
tai (2) - 1 freq
tan (2) - 48 freq
tae (2) - 64038 freq
tah (2) - 2 freq
tad (2) - 54 freq
texi (2) - 1 freq
fax (2) - 9 freq
dax (2) - 1 freq
bax (2) - 2 freq
tam (2) - 519 freq
max (2) - 31 freq
tac (2) - 18 freq
taz (2) - 21 freq
ax (2) - 76 freq
tab (2) - 6 freq
SoundEx code - T200
this - 11106 freq
took - 1034 freq
'this - 71 freq
touch - 253 freq
though - 1167 freq
teach - 84 freq
those - 286 freq
these - 1110 freq
tak - 2818 freq
taks - 455 freq
thick - 218 freq
taes - 104 freq
take - 676 freq
tick - 69 freq
ties - 29 freq
task - 57 freq
twice - 183 freq
thase - 13 freq
teuk - 221 freq
tosh - 7 freq
tousie - 11 freq
teuch - 31 freq
take- - 1 freq
toays - 1 freq
this-ah - 1 freq
tea's - 4 freq
ts - 41 freq
tack - 43 freq
this- - 1 freq
twos - 4 freq
'tis - 9 freq
tac - 18 freq
twice- - 3 freq
toes - 48 freq
tis - 38 freq
tts - 2 freq
tt's - 3 freq
toss - 27 freq
tuik - 313 freq
tyke - 27 freq
tassie - 35 freq
thus - 40 freq
twiggy - 1 freq
tsk - 2 freq
tackie - 2 freq
this' - 3 freq
tesco - 58 freq
tosie - 4 freq
taak - 94 freq
'tak - 24 freq
'these - 4 freq
tag - 32 freq
twoz - 1 freq
tuck - 26 freq
toi's - 1 freq
taz - 21 freq
taz's - 3 freq
'taz - 1 freq
taxi - 84 freq
tech - 8 freq
tissue - 6 freq
tease - 10 freq
tak' - 79 freq
twas - 16 freq
techie - 4 freq
tawse - 13 freq
toys - 42 freq
tazzy - 1 freq
twaes - 2 freq
tka - 1 freq
tusk - 5 freq
tizz - 1 freq
tizzy - 2 freq
t'wis - 2 freq
tawsie - 1 freq
'twis - 30 freq
tough - 38 freq
twis - 13 freq
teas - 3 freq
ticky - 2 freq
theek - 6 freq
thes - 12 freq
theez - 2 freq
thees - 4 freq
twig - 18 freq
thig - 3 freq
touzie - 3 freq
tacks - 3 freq
'tough - 1 freq
touchy - 8 freq
tusks - 7 freq
toc - 2 freq
toosie - 2 freq
tags - 6 freq
tchau - 1 freq
tax - 60 freq
t'was - 2 freq
togs - 1 freq
theik - 8 freq
takk - 89 freq
teeicks - 1 freq
tuk - 227 freq
tug - 11 freq
tucks - 6 freq
thigh - 16 freq
t's - 9 freq
towsie - 3 freq
thack - 2 freq
tig - 18 freq
tozie - 3 freq
tigs - 2 freq
tike - 2 freq
twyse - 1 freq
tae's - 5 freq
theis - 19 freq
tak's - 3 freq
teesh - 5 freq
tik - 4 freq
taka - 2 freq
thj - 1 freq
'tt's - 2 freq
ticks - 13 freq
tocks - 3 freq
ths - 5 freq
tae-us - 1 freq
twigs - 12 freq
'twa's - 1 freq
tke - 2 freq
tacki' - 1 freq
takeaway - 10 freq
tuco - 1 freq
tash - 3 freq
tess - 1 freq
tic - 12 freq
'teach - 1 freq
'twes - 2 freq
twa's - 6 freq
thou's - 1 freq
t'is - 6 freq
tasks - 13 freq
teachee - 3 freq
thiz - 1 freq
tweak - 2 freq
thug - 6 freq
t--s - 1 freq
toass - 2 freq
thoch - 2 freq
toshie - 15 freq
tucky - 12 freq
tej - 1 freq
'tes - 1 freq
thïs - 233 freq
taich - 6 freq
ïts - 16 freq
tex - 25 freq
ït's - 16 freq
°tak - 1 freq
tyes - 1 freq
°these - 1 freq
thïs' - 1 freq
'thïs - 2 freq
taaks - 10 freq
thas - 2 freq
ta'k - 1 freq
taks' - 2 freq
thochy - 1 freq
tokyo - 2 freq
t'wus - 7 freq
tig's - 1 freq
taws - 1 freq
'take - 1 freq
tyaach - 1 freq
tees - 3 freq
t'sae - 1 freq
tagsy - 1 freq
tukk - 1 freq
tock - 17 freq
twise - 11 freq
taeks - 1 freq
'this' - 2 freq
'those' - 1 freq
'teach' - 2 freq
'twas - 11 freq
tice - 3 freq
'though - 2 freq
thoche - 1 freq
'ts - 3 freq
towes - 6 freq
tiche - 1 freq
this-' - 1 freq
tise - 3 freq
tagsie - 1 freq
these' - 1 freq
'those - 1 freq
toogs - 6 freq
tö-tak - 1 freq
twaese - 3 freq
té-tak - 1 freq
takkaway - 2 freq
tæs - 1 freq
tuiks - 1 freq
tiso - 1 freq
tuaks - 1 freq
teeick - 2 freq
touchie - 30 freq
tch - 3 freq
taj - 2 freq
tuke - 1 freq
tass - 1 freq
toque - 1 freq
touk - 7 freq
thouch - 12 freq
tyeuch - 1 freq
tok - 2 freq
toks - 1 freq
thows - 1 freq
tweesh - 1 freq
tds - 3 freq
tss - 2 freq
td's - 3 freq
taik - 4 freq
't's - 2 freq
tøk - 3 freq
tize - 4 freq
tyoch - 1 freq
taas - 1 freq
tyeuk - 21 freq
toga - 2 freq
teech - 2 freq
thugs - 4 freq
taigs - 2 freq
Ötzi - 1 freq
tiggs - 1 freq
tush - 1 freq
tows - 2 freq
€˜tak - 3 freq
tikk - 1 freq
teck - 1 freq
tycho - 2 freq
€˜twas - 1 freq
€˜this - 22 freq
tea-hoose - 1 freq
€™tis - 1 freq
tauk - 2 freq
thys - 1 freq
thik - 3 freq
€œthis - 38 freq
takks - 4 freq
takk-aa - 1 freq
thaws - 1 freq
'takk - 1 freq
''twis - 1 freq
tich - 2 freq
thies - 1 freq
€œtak - 10 freq
tyse - 1 freq
teugs - 1 freq
twa-wyes - 1 freq
€˜those - 6 freq
tyso - 1 freq
€˜tax - 1 freq
ttish - 1 freq
€˜these - 1 freq
tawk - 4 freq
take-away - 1 freq
tweaks - 1 freq
takawa - 2 freq
toke - 2 freq
€œtake - 7 freq
€˜take - 2 freq
toiys - 1 freq
texi - 1 freq
thooasa - 1 freq
tesk - 1 freq
tuek - 1 freq
tos - 3 freq
taki - 1 freq
€˜taxi - 1 freq
€œthough - 1 freq
tea-houss - 1 freq
tuggs - 1 freq
tugs - 1 freq
€œtuck - 1 freq
€œtakk - 1 freq
€™this - 2 freq
tashi - 8 freq
€œthese - 2 freq
€œthose - 1 freq
€œtaks - 1 freq
€”though - 1 freq
teugh - 1 freq
€œtis - 2 freq
thae's - 1 freq
tues - 6 freq
tushie - 4 freq
€œtushie - 4 freq
tuc - 1 freq
ts'e - 3 freq
€™twes - 1 freq
taech - 1 freq
tgi - 1 freq
tuggy - 1 freq
€™take - 1 freq
texhii - 2 freq
tckckw - 1 freq
taco - 1 freq
tc - 3 freq
tgzg - 1 freq
tjo - 2 freq
tju - 1 freq
tx - 3 freq
“this - 3 freq
tg - 7 freq
tjw - 1 freq
tq - 2 freq
tache - 1 freq
tux - 1 freq
thuck - 1 freq
thake - 1 freq
ti's - 1 freq
thsh - 1 freq
tickie - 1 freq
tikka - 2 freq
tiz - 3 freq
takÂ’s - 1 freq
tezza - 1 freq
tackawa - 1 freq
tzx - 1 freq
tgsu - 1 freq
tkee - 1 freq
tk - 3 freq
tooÂ’s - 1 freq
‘twas - 1 freq
tcq - 1 freq
tuoj - 1 freq
tcg - 1 freq
thc - 3 freq
tcsoa - 1 freq
teuchie - 1 freq
twix - 1 freq
tachy - 1 freq
tauq - 1 freq
ttdku - 1 freq
tqj - 1 freq
toq - 1 freq
tdzcqkx - 1 freq
thos - 1 freq
tha's - 3 freq
tzu - 1 freq
thzu - 1 freq
tooj - 1 freq
twqkz - 1 freq
tkz - 1 freq
tes - 2 freq
tihz - 1 freq
taes' - 1 freq
ttkq - 1 freq
ttj - 1 freq
tac' - 1 freq
teache - 1 freq
thicko - 1 freq
tj - 3 freq
tokie - 1 freq
”this - 1 freq
tdx - 1 freq
thcki - 1 freq
tyqq - 1 freq
MetaPhone code - TKS
takes - 125 freq
dugs - 228 freq
taks - 455 freq
deuks - 33 freq
dug's - 35 freq
dykes - 44 freq
dogs - 44 freq
deeks - 2 freq
tyke's - 2 freq
dog's - 5 freq
taxi - 84 freq
ducks - 13 freq
dux - 3 freq
duck's - 2 freq
doxy - 1 freq
decks - 12 freq
tacks - 3 freq
dick's - 2 freq
docs - 3 freq
dooks - 9 freq
tykes - 7 freq
tags - 6 freq
dixie - 10 freq
tax - 60 freq
doggies - 1 freq
dukes - 9 freq
togs - 1 freq
teeicks - 1 freq
tucks - 6 freq
tigs - 2 freq
deck's - 2 freq
tak's - 3 freq
ticks - 13 freq
tocks - 3 freq
dacs - 1 freq
dicks - 8 freq
tokes - 1 freq
digs - 19 freq
docks - 27 freq
doags - 6 freq
doag's - 6 freq
dokes - 3 freq
tex - 25 freq
taaks - 10 freq
taks' - 2 freq
deuk's - 2 freq
tig's - 1 freq
doug's - 10 freq
tagsy - 1 freq
tikes - 1 freq
daeks - 4 freq
taeks - 1 freq
'duck's - 1 freq
deks - 1 freq
tagsie - 1 freq
'deuks - 2 freq
toogs - 6 freq
tuiks - 1 freq
tuaks - 1 freq
dogs' - 1 freq
duke's - 7 freq
toks - 1 freq
dowgs - 3 freq
dowg's - 2 freq
ducks' - 1 freq
deukies - 1 freq
tagus - 1 freq
taigs - 2 freq
takkis - 1 freq
tiggs - 1 freq
togas - 3 freq
douks - 1 freq
dugs' - 2 freq
takks - 4 freq
teacosy - 2 freq
doks - 1 freq
teugs - 1 freq
dookies - 1 freq
€˜tax - 1 freq
dikes - 3 freq
texi - 1 freq
€˜taxi - 1 freq
tuggs - 1 freq
tugs - 1 freq
€œtaks - 1 freq
€˜takes - 1 freq
dakes - 1 freq
doacs - 1 freq
dx - 3 freq
tx - 3 freq
dgozo - 1 freq
yytks - 1 freq
dyckes - 1 freq
tux - 1 freq
takÂ’s - 1 freq
tgsu - 1 freq
deux - 1 freq
dicÂ’s - 1 freq
tcsoa - 1 freq
daxw - 1 freq
dax - 1 freq
tkz - 1 freq
TAX
Time to execute Levenshtein function - 0.175384 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.380497 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027902 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036759 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000806 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.