A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to taaks in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
taaks (0) - 10 freq
taeks (1) - 1 freq
taks (1) - 458 freq
taakt (1) - 10 freq
haaks (1) - 2 freq
tasks (1) - 13 freq
tanks (1) - 32 freq
tacks (1) - 3 freq
takks (1) - 4 freq
taak (1) - 94 freq
taas (1) - 1 freq
talks (1) - 59 freq
tuaks (1) - 1 freq
waaks (1) - 5 freq
staaks (1) - 1 freq
walks (2) - 100 freq
wanks (2) - 1 freq
teams (2) - 84 freq
jaks (2) - 1 freq
tapis (2) - 1 freq
tans (2) - 5 freq
daars (2) - 1 freq
thars (2) - 2 freq
saiks (2) - 1 freq
trams (2) - 6 freq
taaks (0) - 10 freq
tuaks (1) - 1 freq
taeks (1) - 1 freq
taks (1) - 458 freq
staaks (2) - 1 freq
toks (2) - 1 freq
takes (2) - 137 freq
tuiks (2) - 1 freq
talks (2) - 59 freq
tiks (2) - 1 freq
waaks (2) - 5 freq
taas (2) - 1 freq
tasks (2) - 13 freq
haaks (2) - 2 freq
taakt (2) - 10 freq
tacks (2) - 3 freq
tanks (2) - 32 freq
taak (2) - 94 freq
takks (2) - 4 freq
tikes (3) - 1 freq
leaks (3) - 1 freq
toays (3) - 1 freq
tears (3) - 307 freq
tales (3) - 149 freq
acks (3) - 11 freq
SoundEx code - T200
this - 11303 freq
took - 1049 freq
'this - 73 freq
touch - 257 freq
though - 1213 freq
teach - 86 freq
those - 296 freq
these - 1129 freq
tak - 2834 freq
taks - 458 freq
thick - 227 freq
taes - 107 freq
take - 706 freq
tick - 69 freq
ties - 31 freq
task - 57 freq
twice - 188 freq
thase - 13 freq
teuk - 221 freq
tosh - 7 freq
tousie - 11 freq
teuch - 31 freq
take- - 1 freq
toays - 1 freq
this-ah - 1 freq
tea's - 4 freq
ts - 44 freq
tack - 43 freq
this- - 1 freq
twos - 4 freq
'tis - 9 freq
tac - 22 freq
twice- - 3 freq
toes - 48 freq
tis - 38 freq
tts - 2 freq
tt's - 3 freq
toss - 27 freq
tuik - 313 freq
tyke - 27 freq
tassie - 37 freq
thus - 40 freq
twiggy - 1 freq
tsk - 2 freq
tackie - 2 freq
this' - 3 freq
tesco - 58 freq
tosie - 4 freq
taak - 94 freq
'tak - 24 freq
'these - 4 freq
tag - 32 freq
twoz - 1 freq
tuck - 26 freq
toi's - 1 freq
taz - 21 freq
taz's - 3 freq
'taz - 1 freq
taxi - 85 freq
tech - 8 freq
tissue - 6 freq
tease - 10 freq
tak' - 81 freq
twas - 16 freq
techie - 4 freq
tawse - 14 freq
toys - 45 freq
tazzy - 1 freq
twaes - 2 freq
tka - 1 freq
tusk - 5 freq
tizz - 1 freq
tizzy - 2 freq
t'wis - 2 freq
tawsie - 1 freq
'twis - 32 freq
tough - 43 freq
twis - 13 freq
teas - 3 freq
ticky - 2 freq
theek - 7 freq
thes - 12 freq
theez - 2 freq
thees - 4 freq
twig - 18 freq
thig - 3 freq
touzie - 3 freq
tacks - 3 freq
'tough - 1 freq
touchy - 9 freq
tusks - 7 freq
toc - 2 freq
toosie - 2 freq
tags - 6 freq
tchau - 1 freq
tax - 61 freq
t'was - 2 freq
togs - 1 freq
theik - 8 freq
takk - 89 freq
teeicks - 1 freq
tuk - 227 freq
tug - 11 freq
tucks - 6 freq
thigh - 17 freq
t's - 9 freq
towsie - 3 freq
thack - 2 freq
tig - 21 freq
tozie - 3 freq
tigs - 3 freq
tike - 2 freq
twyse - 1 freq
tae's - 5 freq
theis - 19 freq
tak's - 4 freq
teesh - 5 freq
tik - 4 freq
taka - 2 freq
thj - 1 freq
'tt's - 1 freq
ticks - 13 freq
tocks - 2 freq
ths - 5 freq
tae-us - 1 freq
tiks - 1 freq
toch - 1 freq
thusa - 1 freq
tashy - 2 freq
tash - 4 freq
takkke - 1 freq
tock - 18 freq
thak - 1 freq
teckie - 1 freq
twigs - 12 freq
'twa's - 1 freq
tke - 2 freq
tacki' - 1 freq
takeaway - 10 freq
tuco - 1 freq
tess - 1 freq
tic - 12 freq
'teach - 1 freq
'twes - 2 freq
twa's - 6 freq
thou's - 1 freq
t'is - 6 freq
tasks - 13 freq
teachee - 3 freq
thiz - 1 freq
tweak - 2 freq
thug - 6 freq
t--s - 1 freq
toass - 2 freq
thoch - 2 freq
toshie - 15 freq
tucky - 12 freq
tej - 1 freq
'tes - 1 freq
thïs - 233 freq
taich - 6 freq
ïts - 16 freq
tex - 25 freq
ït's - 16 freq
°tak - 1 freq
tyes - 1 freq
°these - 1 freq
thïs' - 1 freq
'thïs - 2 freq
taaks - 10 freq
thas - 2 freq
ta'k - 1 freq
taks' - 2 freq
thochy - 1 freq
tokyo - 2 freq
t'wus - 7 freq
tig's - 1 freq
taws - 1 freq
'take - 1 freq
tyaach - 1 freq
tees - 3 freq
t'sae - 1 freq
tagsy - 1 freq
tukk - 1 freq
twise - 11 freq
taeks - 1 freq
'this' - 2 freq
'those' - 1 freq
'teach' - 2 freq
'twas - 11 freq
tice - 3 freq
'though - 2 freq
thoche - 1 freq
'ts - 3 freq
towes - 6 freq
tiche - 1 freq
this-' - 1 freq
tise - 3 freq
tagsie - 1 freq
these' - 1 freq
'those - 1 freq
toogs - 6 freq
tö-tak - 1 freq
twaese - 3 freq
té-tak - 1 freq
takkaway - 2 freq
tæs - 1 freq
tuiks - 1 freq
tiso - 1 freq
tuaks - 1 freq
teeick - 2 freq
touchie - 30 freq
tch - 3 freq
taj - 2 freq
tuke - 1 freq
tass - 1 freq
toque - 1 freq
touk - 7 freq
thouch - 12 freq
tyeuch - 1 freq
tok - 2 freq
toks - 1 freq
thows - 1 freq
tweesh - 1 freq
tds - 3 freq
tss - 2 freq
td's - 3 freq
taik - 4 freq
't's - 2 freq
tøk - 3 freq
tize - 4 freq
tyoch - 1 freq
taas - 1 freq
tyeuk - 21 freq
toga - 2 freq
teech - 2 freq
thugs - 4 freq
taigs - 2 freq
Ötzi - 1 freq
tiggs - 1 freq
tush - 1 freq
tows - 2 freq
€˜tak - 3 freq
tikk - 1 freq
teck - 1 freq
tycho - 2 freq
€˜twas - 1 freq
€˜this - 22 freq
tea-hoose - 1 freq
€™tis - 1 freq
tauk - 2 freq
thys - 1 freq
thik - 3 freq
€œthis - 38 freq
takks - 4 freq
takk-aa - 1 freq
thaws - 1 freq
'takk - 1 freq
''twis - 1 freq
tich - 2 freq
thies - 1 freq
€œtak - 10 freq
tyse - 1 freq
teugs - 1 freq
twa-wyes - 1 freq
€˜those - 6 freq
tyso - 1 freq
€˜tax - 1 freq
ttish - 1 freq
€˜these - 1 freq
tawk - 4 freq
take-away - 1 freq
tweaks - 1 freq
takawa - 2 freq
toke - 2 freq
€œtake - 7 freq
€˜take - 2 freq
toiys - 1 freq
texi - 1 freq
thooasa - 1 freq
tesk - 1 freq
tuek - 1 freq
tos - 3 freq
taki - 1 freq
€˜taxi - 1 freq
€œthough - 1 freq
tea-houss - 1 freq
tuggs - 1 freq
tugs - 1 freq
€œtuck - 1 freq
€œtakk - 1 freq
€™this - 2 freq
tashi - 8 freq
€œthese - 2 freq
€œthose - 1 freq
€œtaks - 1 freq
€”though - 1 freq
teugh - 1 freq
€œtis - 2 freq
thae's - 1 freq
tues - 6 freq
tushie - 4 freq
€œtushie - 4 freq
tuc - 1 freq
ts'e - 3 freq
€™twes - 1 freq
taech - 1 freq
tgi - 1 freq
tuggy - 1 freq
€™take - 1 freq
texhii - 2 freq
tckckw - 1 freq
taco - 1 freq
tc - 3 freq
tgzg - 1 freq
tjo - 2 freq
tju - 1 freq
tx - 3 freq
“this - 3 freq
tg - 7 freq
tjw - 1 freq
tq - 2 freq
tache - 1 freq
tux - 1 freq
thuck - 1 freq
thake - 1 freq
ti's - 1 freq
thsh - 1 freq
tickie - 1 freq
tikka - 2 freq
tiz - 3 freq
takÂ’s - 1 freq
tezza - 1 freq
tackawa - 1 freq
tzx - 1 freq
tgsu - 1 freq
tkee - 1 freq
tk - 3 freq
tooÂ’s - 1 freq
‘twas - 1 freq
tcq - 1 freq
tuoj - 1 freq
tcg - 1 freq
thc - 3 freq
tcsoa - 1 freq
teuchie - 1 freq
twix - 1 freq
tachy - 1 freq
tauq - 1 freq
ttdku - 1 freq
tqj - 1 freq
toq - 1 freq
tdzcqkx - 1 freq
thos - 1 freq
tha's - 3 freq
tzu - 1 freq
thzu - 1 freq
tooj - 1 freq
twqkz - 1 freq
tkz - 1 freq
tes - 2 freq
tihz - 1 freq
taes' - 1 freq
ttkq - 1 freq
ttj - 1 freq
tac' - 1 freq
teache - 1 freq
thicko - 1 freq
tj - 3 freq
tokie - 1 freq
”this - 1 freq
tdx - 1 freq
thcki - 1 freq
tyqq - 1 freq
MetaPhone code - TKS
takes - 137 freq
dugs - 231 freq
taks - 458 freq
deuks - 33 freq
dug's - 37 freq
dykes - 44 freq
dogs - 45 freq
deeks - 2 freq
tyke's - 2 freq
dog's - 5 freq
taxi - 85 freq
ducks - 13 freq
dux - 3 freq
duck's - 2 freq
doxy - 1 freq
decks - 12 freq
tacks - 3 freq
dick's - 2 freq
docs - 3 freq
dooks - 9 freq
tykes - 7 freq
tags - 6 freq
dixie - 10 freq
tax - 61 freq
doggies - 1 freq
dukes - 9 freq
togs - 1 freq
teeicks - 1 freq
tucks - 6 freq
tigs - 3 freq
deck's - 2 freq
tak's - 4 freq
ticks - 13 freq
tocks - 2 freq
tiks - 1 freq
dics - 1 freq
dacs - 1 freq
dicks - 8 freq
tokes - 1 freq
digs - 19 freq
docks - 27 freq
doags - 6 freq
doag's - 6 freq
dokes - 3 freq
tex - 25 freq
taaks - 10 freq
taks' - 2 freq
deuk's - 2 freq
tig's - 1 freq
doug's - 10 freq
tagsy - 1 freq
tikes - 1 freq
daeks - 4 freq
taeks - 1 freq
'duck's - 1 freq
deks - 1 freq
tagsie - 1 freq
'deuks - 2 freq
toogs - 6 freq
tuiks - 1 freq
tuaks - 1 freq
dogs' - 1 freq
duke's - 7 freq
toks - 1 freq
dowgs - 3 freq
dowg's - 2 freq
ducks' - 1 freq
deukies - 1 freq
tagus - 1 freq
taigs - 2 freq
takkis - 1 freq
tiggs - 1 freq
togas - 3 freq
douks - 1 freq
dugs' - 2 freq
takks - 4 freq
teacosy - 2 freq
doks - 1 freq
teugs - 1 freq
dookies - 1 freq
€˜tax - 1 freq
dikes - 3 freq
texi - 1 freq
€˜taxi - 1 freq
tuggs - 1 freq
tugs - 1 freq
€œtaks - 1 freq
€˜takes - 1 freq
dakes - 1 freq
doacs - 1 freq
dx - 3 freq
tx - 3 freq
dgozo - 1 freq
yytks - 1 freq
dyckes - 1 freq
tux - 1 freq
takÂ’s - 1 freq
tgsu - 1 freq
deux - 1 freq
dicÂ’s - 1 freq
tcsoa - 1 freq
daxw - 1 freq
dax - 1 freq
tkz - 1 freq
TAAKS
Time to execute Levenshtein function - 0.289099 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.692251 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029273 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.078754 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001161 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.