A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to taka in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
taka (0) - 2 freq
taki (1) - 1 freq
takan (1) - 10 freq
takd (1) - 1 freq
takna (1) - 2 freq
tak (1) - 2834 freq
tara (1) - 16 freq
taks (1) - 458 freq
take (1) - 706 freq
tka (1) - 1 freq
aka (1) - 12 freq
tak' (1) - 81 freq
haka (1) - 1 freq
taa (1) - 3 freq
takk (1) - 89 freq
laa (2) - 114 freq
makan (2) - 21 freq
rake (2) - 31 freq
waks (2) - 8 freq
talk (2) - 496 freq
tar (2) - 26 freq
tale (2) - 300 freq
aks (2) - 18 freq
tare (2) - 4 freq
taeks (2) - 1 freq
taka (0) - 2 freq
tka (1) - 1 freq
tak (1) - 2834 freq
take (1) - 706 freq
taki (1) - 1 freq
tok (2) - 2 freq
tauk (2) - 2 freq
tke (2) - 2 freq
tike (2) - 2 freq
toke (2) - 2 freq
tuk (2) - 227 freq
taak (2) - 94 freq
tyke (2) - 27 freq
tik (2) - 4 freq
tuke (2) - 1 freq
taa (2) - 3 freq
tara (2) - 16 freq
takna (2) - 2 freq
takd (2) - 1 freq
takan (2) - 10 freq
taks (2) - 458 freq
aka (2) - 12 freq
takk (2) - 89 freq
taik (2) - 4 freq
haka (2) - 1 freq
SoundEx code - T200
this - 11303 freq
took - 1049 freq
'this - 73 freq
touch - 257 freq
though - 1213 freq
teach - 86 freq
those - 296 freq
these - 1129 freq
tak - 2834 freq
taks - 458 freq
thick - 227 freq
taes - 107 freq
take - 706 freq
tick - 69 freq
ties - 31 freq
task - 57 freq
twice - 188 freq
thase - 13 freq
teuk - 221 freq
tosh - 7 freq
tousie - 11 freq
teuch - 31 freq
take- - 1 freq
toays - 1 freq
this-ah - 1 freq
tea's - 4 freq
ts - 44 freq
tack - 43 freq
this- - 1 freq
twos - 4 freq
'tis - 9 freq
tac - 22 freq
twice- - 3 freq
toes - 48 freq
tis - 38 freq
tts - 2 freq
tt's - 3 freq
toss - 27 freq
tuik - 313 freq
tyke - 27 freq
tassie - 37 freq
thus - 40 freq
twiggy - 1 freq
tsk - 2 freq
tackie - 2 freq
this' - 3 freq
tesco - 58 freq
tosie - 4 freq
taak - 94 freq
'tak - 24 freq
'these - 4 freq
tag - 32 freq
twoz - 1 freq
tuck - 26 freq
toi's - 1 freq
taz - 21 freq
taz's - 3 freq
'taz - 1 freq
taxi - 85 freq
tech - 8 freq
tissue - 6 freq
tease - 10 freq
tak' - 81 freq
twas - 16 freq
techie - 4 freq
tawse - 14 freq
toys - 45 freq
tazzy - 1 freq
twaes - 2 freq
tka - 1 freq
tusk - 5 freq
tizz - 1 freq
tizzy - 2 freq
t'wis - 2 freq
tawsie - 1 freq
'twis - 32 freq
tough - 43 freq
twis - 13 freq
teas - 3 freq
ticky - 2 freq
theek - 7 freq
thes - 12 freq
theez - 2 freq
thees - 4 freq
twig - 18 freq
thig - 3 freq
touzie - 3 freq
tacks - 3 freq
'tough - 1 freq
touchy - 9 freq
tusks - 7 freq
toc - 2 freq
toosie - 2 freq
tags - 6 freq
tchau - 1 freq
tax - 61 freq
t'was - 2 freq
togs - 1 freq
theik - 8 freq
takk - 89 freq
teeicks - 1 freq
tuk - 227 freq
tug - 11 freq
tucks - 6 freq
thigh - 17 freq
t's - 9 freq
towsie - 3 freq
thack - 2 freq
tig - 21 freq
tozie - 3 freq
tigs - 3 freq
tike - 2 freq
twyse - 1 freq
tae's - 5 freq
theis - 19 freq
tak's - 4 freq
teesh - 5 freq
tik - 4 freq
taka - 2 freq
thj - 1 freq
'tt's - 1 freq
ticks - 13 freq
tocks - 2 freq
ths - 5 freq
tae-us - 1 freq
tiks - 1 freq
toch - 1 freq
thusa - 1 freq
tashy - 2 freq
tash - 4 freq
takkke - 1 freq
tock - 18 freq
thak - 1 freq
teckie - 1 freq
twigs - 12 freq
'twa's - 1 freq
tke - 2 freq
tacki' - 1 freq
takeaway - 10 freq
tuco - 1 freq
tess - 1 freq
tic - 12 freq
'teach - 1 freq
'twes - 2 freq
twa's - 6 freq
thou's - 1 freq
t'is - 6 freq
tasks - 13 freq
teachee - 3 freq
thiz - 1 freq
tweak - 2 freq
thug - 6 freq
t--s - 1 freq
toass - 2 freq
thoch - 2 freq
toshie - 15 freq
tucky - 12 freq
tej - 1 freq
'tes - 1 freq
thïs - 233 freq
taich - 6 freq
ïts - 16 freq
tex - 25 freq
ït's - 16 freq
°tak - 1 freq
tyes - 1 freq
°these - 1 freq
thïs' - 1 freq
'thïs - 2 freq
taaks - 10 freq
thas - 2 freq
ta'k - 1 freq
taks' - 2 freq
thochy - 1 freq
tokyo - 2 freq
t'wus - 7 freq
tig's - 1 freq
taws - 1 freq
'take - 1 freq
tyaach - 1 freq
tees - 3 freq
t'sae - 1 freq
tagsy - 1 freq
tukk - 1 freq
twise - 11 freq
taeks - 1 freq
'this' - 2 freq
'those' - 1 freq
'teach' - 2 freq
'twas - 11 freq
tice - 3 freq
'though - 2 freq
thoche - 1 freq
'ts - 3 freq
towes - 6 freq
tiche - 1 freq
this-' - 1 freq
tise - 3 freq
tagsie - 1 freq
these' - 1 freq
'those - 1 freq
toogs - 6 freq
tö-tak - 1 freq
twaese - 3 freq
té-tak - 1 freq
takkaway - 2 freq
tæs - 1 freq
tuiks - 1 freq
tiso - 1 freq
tuaks - 1 freq
teeick - 2 freq
touchie - 30 freq
tch - 3 freq
taj - 2 freq
tuke - 1 freq
tass - 1 freq
toque - 1 freq
touk - 7 freq
thouch - 12 freq
tyeuch - 1 freq
tok - 2 freq
toks - 1 freq
thows - 1 freq
tweesh - 1 freq
tds - 3 freq
tss - 2 freq
td's - 3 freq
taik - 4 freq
't's - 2 freq
tøk - 3 freq
tize - 4 freq
tyoch - 1 freq
taas - 1 freq
tyeuk - 21 freq
toga - 2 freq
teech - 2 freq
thugs - 4 freq
taigs - 2 freq
Ötzi - 1 freq
tiggs - 1 freq
tush - 1 freq
tows - 2 freq
€˜tak - 3 freq
tikk - 1 freq
teck - 1 freq
tycho - 2 freq
€˜twas - 1 freq
€˜this - 22 freq
tea-hoose - 1 freq
€™tis - 1 freq
tauk - 2 freq
thys - 1 freq
thik - 3 freq
€œthis - 38 freq
takks - 4 freq
takk-aa - 1 freq
thaws - 1 freq
'takk - 1 freq
''twis - 1 freq
tich - 2 freq
thies - 1 freq
€œtak - 10 freq
tyse - 1 freq
teugs - 1 freq
twa-wyes - 1 freq
€˜those - 6 freq
tyso - 1 freq
€˜tax - 1 freq
ttish - 1 freq
€˜these - 1 freq
tawk - 4 freq
take-away - 1 freq
tweaks - 1 freq
takawa - 2 freq
toke - 2 freq
€œtake - 7 freq
€˜take - 2 freq
toiys - 1 freq
texi - 1 freq
thooasa - 1 freq
tesk - 1 freq
tuek - 1 freq
tos - 3 freq
taki - 1 freq
€˜taxi - 1 freq
€œthough - 1 freq
tea-houss - 1 freq
tuggs - 1 freq
tugs - 1 freq
€œtuck - 1 freq
€œtakk - 1 freq
€™this - 2 freq
tashi - 8 freq
€œthese - 2 freq
€œthose - 1 freq
€œtaks - 1 freq
€”though - 1 freq
teugh - 1 freq
€œtis - 2 freq
thae's - 1 freq
tues - 6 freq
tushie - 4 freq
€œtushie - 4 freq
tuc - 1 freq
ts'e - 3 freq
€™twes - 1 freq
taech - 1 freq
tgi - 1 freq
tuggy - 1 freq
€™take - 1 freq
texhii - 2 freq
tckckw - 1 freq
taco - 1 freq
tc - 3 freq
tgzg - 1 freq
tjo - 2 freq
tju - 1 freq
tx - 3 freq
“this - 3 freq
tg - 7 freq
tjw - 1 freq
tq - 2 freq
tache - 1 freq
tux - 1 freq
thuck - 1 freq
thake - 1 freq
ti's - 1 freq
thsh - 1 freq
tickie - 1 freq
tikka - 2 freq
tiz - 3 freq
takÂ’s - 1 freq
tezza - 1 freq
tackawa - 1 freq
tzx - 1 freq
tgsu - 1 freq
tkee - 1 freq
tk - 3 freq
tooÂ’s - 1 freq
‘twas - 1 freq
tcq - 1 freq
tuoj - 1 freq
tcg - 1 freq
thc - 3 freq
tcsoa - 1 freq
teuchie - 1 freq
twix - 1 freq
tachy - 1 freq
tauq - 1 freq
ttdku - 1 freq
tqj - 1 freq
toq - 1 freq
tdzcqkx - 1 freq
thos - 1 freq
tha's - 3 freq
tzu - 1 freq
thzu - 1 freq
tooj - 1 freq
twqkz - 1 freq
tkz - 1 freq
tes - 2 freq
tihz - 1 freq
taes' - 1 freq
ttkq - 1 freq
ttj - 1 freq
tac' - 1 freq
teache - 1 freq
thicko - 1 freq
tj - 3 freq
tokie - 1 freq
”this - 1 freq
tdx - 1 freq
thcki - 1 freq
tyqq - 1 freq
MetaPhone code - TK
took - 1049 freq
dyke - 143 freq
tak - 2834 freq
take - 706 freq
dig - 85 freq
tick - 69 freq
dog - 157 freq
teuk - 221 freq
dug - 576 freq
take- - 1 freq
duck - 60 freq
tack - 43 freq
tac - 22 freq
decay - 11 freq
deek - 44 freq
dag - 2 freq
'doc - 1 freq
'doc' - 1 freq
tuik - 313 freq
tyke - 27 freq
tackie - 2 freq
deck - 70 freq
dick - 78 freq
taak - 94 freq
'tak - 24 freq
deiq - 1 freq
tag - 32 freq
dick' - 3 freq
tuck - 26 freq
dicky - 2 freq
tak' - 81 freq
deuk - 47 freq
douk - 8 freq
tka - 1 freq
ticky - 2 freq
dugg - 3 freq
dickie - 4 freq
toc - 2 freq
'duck'' - 1 freq
duke - 38 freq
dook - 32 freq
takk - 89 freq
tuk - 227 freq
dickey - 4 freq
tug - 11 freq
tig - 21 freq
tike - 2 freq
tik - 4 freq
taka - 2 freq
takkke - 1 freq
tock - 18 freq
dock - 32 freq
teckie - 1 freq
dake - 4 freq
doggie - 3 freq
tke - 2 freq
tacki' - 1 freq
'dog - 2 freq
doggy - 5 freq
tuco - 1 freq
doc - 14 freq
doac - 19 freq
tic - 12 freq
deukie - 19 freq
doag - 53 freq
tucky - 12 freq
doke - 3 freq
dac - 1 freq
°tak - 1 freq
dïg - 2 freq
ta'k - 1 freq
duck' - 1 freq
'duck' - 1 freq
dug' - 2 freq
doog - 1 freq
'take - 1 freq
doug - 81 freq
'doug - 2 freq
tukk - 1 freq
daek - 18 freq
dq - 7 freq
dike - 1 freq
teeick - 2 freq
tuke - 1 freq
toque - 1 freq
touk - 7 freq
tok - 2 freq
dowg - 3 freq
dueck - 1 freq
taik - 4 freq
tøk - 3 freq
'dug' - 1 freq
dog' - 1 freq
dokey - 1 freq
deuky - 1 freq
toga - 2 freq
deik - 1 freq
duick - 1 freq
€˜tak - 3 freq
tikk - 1 freq
teck - 1 freq
€˜dug - 4 freq
€˜doag - 2 freq
tauk - 2 freq
dc - 11 freq
duc - 2 freq
takk-aa - 1 freq
'takk - 1 freq
€œtak - 10 freq
dok - 1 freq
doaggie - 1 freq
doc- - 2 freq
dec - 9 freq
€œdoag - 1 freq
ducky - 7 freq
€œducky - 1 freq
duggy - 1 freq
tawk - 4 freq
diego - 3 freq
deco - 1 freq
toke - 2 freq
€œtake - 7 freq
€˜take - 2 freq
€˜doug - 3 freq
tuek - 1 freq
taki - 1 freq
€œtuck - 1 freq
€œtakk - 1 freq
dic - 1 freq
dk - 4 freq
tuc - 1 freq
tuggy - 1 freq
€™take - 1 freq
taco - 1 freq
'dag' - 1 freq
tc - 3 freq
yhdg - 1 freq
dqi - 1 freq
decoy - 1 freq
ytk - 1 freq
tg - 7 freq
dawg - 2 freq
dqq - 1 freq
tq - 2 freq
duk - 2 freq
dyc - 1 freq
ddg - 1 freq
tickie - 1 freq
tikka - 2 freq
tkee - 1 freq
tk - 3 freq
deeko - 1 freq
dg - 19 freq
dgw - 1 freq
deg - 2 freq
tauq - 1 freq
dcu - 1 freq
dgg - 1 freq
toq - 1 freq
dqh - 1 freq
'dook' - 1 freq
dak - 1 freq
tac' - 1 freq
deeky - 1 freq
daawg - 1 freq
tokie - 1 freq
wtg - 1 freq
hdq - 1 freq
tyqq - 1 freq
TAKA
Time to execute Levenshtein function - 0.195411 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.372912 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028897 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039894 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001006 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.