A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ��though in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
though (6) - 1 freq
though (6) - 1213 freq
athough (6) - 2 freq
although (6) - 1 freq
although (6) - 158 freq
'though (6) - 2 freq
though (6) - 1 freq
ulthough (6) - 1 freq
'thought (7) - 2 freq
through (7) - 2 freq
brakthrough (7) - 1 freq
hough (7) - 4 freq
oot-through (7) - 7 freq
thougt (7) - 1 freq
ootthrough (7) - 1 freq
trough (7) - 6 freq
breakthrough (7) - 3 freq
through (7) - 1 freq
thouch (7) - 12 freq
'tough (7) - 1 freq
tough (7) - 43 freq
through (7) - 1547 freq
borthouch (7) - 1 freq
chough (7) - 1 freq
thought (7) - 508 freq
'though (12) - 2 freq
ulthough (12) - 1 freq
though (12) - 1 freq
although (12) - 158 freq
though (12) - 1 freq
though (12) - 1213 freq
although (12) - 1 freq
athough (12) - 2 freq
chough (14) - 1 freq
borthouch (14) - 1 freq
tough (14) - 43 freq
thought (14) - 508 freq
through (14) - 1547 freq
thigh (14) - 17 freq
whitehaugh (14) - 1 freq
'tough (14) - 1 freq
larberthigh (14) - 1 freq
keek-through (14) - 1 freq
borthouch (14) - 2 freq
see-through (14) - 3 freq
brakthrough (14) - 1 freq
hough (14) - 4 freq
thouch (14) - 12 freq
'thought (14) - 2 freq
through (14) - 2 freq
SoundEx code - T200
this - 11303 freq
took - 1049 freq
'this - 73 freq
touch - 257 freq
though - 1213 freq
teach - 86 freq
those - 296 freq
these - 1129 freq
tak - 2834 freq
taks - 458 freq
thick - 227 freq
taes - 107 freq
take - 706 freq
tick - 69 freq
ties - 31 freq
task - 57 freq
twice - 188 freq
thase - 13 freq
teuk - 221 freq
tosh - 7 freq
tousie - 11 freq
teuch - 31 freq
take- - 1 freq
toays - 1 freq
this-ah - 1 freq
tea's - 4 freq
ts - 44 freq
tack - 43 freq
this- - 1 freq
twos - 4 freq
'tis - 9 freq
tac - 22 freq
twice- - 3 freq
toes - 48 freq
tis - 38 freq
tts - 2 freq
tt's - 3 freq
toss - 27 freq
tuik - 313 freq
tyke - 27 freq
tassie - 37 freq
thus - 40 freq
twiggy - 1 freq
tsk - 2 freq
tackie - 2 freq
this' - 3 freq
tesco - 58 freq
tosie - 4 freq
taak - 94 freq
'tak - 24 freq
'these - 4 freq
tag - 32 freq
twoz - 1 freq
tuck - 26 freq
toi's - 1 freq
taz - 21 freq
taz's - 3 freq
'taz - 1 freq
taxi - 85 freq
tech - 8 freq
tissue - 6 freq
tease - 10 freq
tak' - 81 freq
twas - 16 freq
techie - 4 freq
tawse - 14 freq
toys - 45 freq
tazzy - 1 freq
twaes - 2 freq
tka - 1 freq
tusk - 5 freq
tizz - 1 freq
tizzy - 2 freq
t'wis - 2 freq
tawsie - 1 freq
'twis - 32 freq
tough - 43 freq
twis - 13 freq
teas - 3 freq
ticky - 2 freq
theek - 7 freq
thes - 12 freq
theez - 2 freq
thees - 4 freq
twig - 18 freq
thig - 3 freq
touzie - 3 freq
tacks - 3 freq
'tough - 1 freq
touchy - 9 freq
tusks - 7 freq
toc - 2 freq
toosie - 2 freq
tags - 6 freq
tchau - 1 freq
tax - 61 freq
t'was - 2 freq
togs - 1 freq
theik - 8 freq
takk - 89 freq
teeicks - 1 freq
tuk - 227 freq
tug - 11 freq
tucks - 6 freq
thigh - 17 freq
t's - 9 freq
towsie - 3 freq
thack - 2 freq
tig - 21 freq
tozie - 3 freq
tigs - 3 freq
tike - 2 freq
twyse - 1 freq
tae's - 5 freq
theis - 19 freq
tak's - 4 freq
teesh - 5 freq
tik - 4 freq
taka - 2 freq
thj - 1 freq
'tt's - 1 freq
ticks - 13 freq
tocks - 2 freq
ths - 5 freq
tae-us - 1 freq
tiks - 1 freq
toch - 1 freq
thusa - 1 freq
tashy - 2 freq
tash - 4 freq
takkke - 1 freq
tock - 18 freq
thak - 1 freq
teckie - 1 freq
twigs - 12 freq
'twa's - 1 freq
tke - 2 freq
tacki' - 1 freq
takeaway - 10 freq
tuco - 1 freq
tess - 1 freq
tic - 12 freq
'teach - 1 freq
'twes - 2 freq
twa's - 6 freq
thou's - 1 freq
t'is - 6 freq
tasks - 13 freq
teachee - 3 freq
thiz - 1 freq
tweak - 2 freq
thug - 6 freq
t--s - 1 freq
toass - 2 freq
thoch - 2 freq
toshie - 15 freq
tucky - 12 freq
tej - 1 freq
'tes - 1 freq
thïs - 233 freq
taich - 6 freq
ïts - 16 freq
tex - 25 freq
ït's - 16 freq
°tak - 1 freq
tyes - 1 freq
°these - 1 freq
thïs' - 1 freq
'thïs - 2 freq
taaks - 10 freq
thas - 2 freq
ta'k - 1 freq
taks' - 2 freq
thochy - 1 freq
tokyo - 2 freq
t'wus - 7 freq
tig's - 1 freq
taws - 1 freq
'take - 1 freq
tyaach - 1 freq
tees - 3 freq
t'sae - 1 freq
tagsy - 1 freq
tukk - 1 freq
twise - 11 freq
taeks - 1 freq
'this' - 2 freq
'those' - 1 freq
'teach' - 2 freq
'twas - 11 freq
tice - 3 freq
'though - 2 freq
thoche - 1 freq
'ts - 3 freq
towes - 6 freq
tiche - 1 freq
this-' - 1 freq
tise - 3 freq
tagsie - 1 freq
these' - 1 freq
'those - 1 freq
toogs - 6 freq
tö-tak - 1 freq
twaese - 3 freq
té-tak - 1 freq
takkaway - 2 freq
tæs - 1 freq
tuiks - 1 freq
tiso - 1 freq
tuaks - 1 freq
teeick - 2 freq
touchie - 30 freq
tch - 3 freq
taj - 2 freq
tuke - 1 freq
tass - 1 freq
toque - 1 freq
touk - 7 freq
thouch - 12 freq
tyeuch - 1 freq
tok - 2 freq
toks - 1 freq
thows - 1 freq
tweesh - 1 freq
tds - 3 freq
tss - 2 freq
td's - 3 freq
taik - 4 freq
't's - 2 freq
tøk - 3 freq
tize - 4 freq
tyoch - 1 freq
taas - 1 freq
tyeuk - 21 freq
toga - 2 freq
teech - 2 freq
thugs - 4 freq
taigs - 2 freq
Ötzi - 1 freq
tiggs - 1 freq
tush - 1 freq
tows - 2 freq
tak - 3 freq
tikk - 1 freq
teck - 1 freq
tycho - 2 freq
twas - 1 freq
this - 22 freq
tea-hoose - 1 freq
tis - 1 freq
tauk - 2 freq
thys - 1 freq
thik - 3 freq
this - 38 freq
takks - 4 freq
takk-aa - 1 freq
thaws - 1 freq
'takk - 1 freq
''twis - 1 freq
tich - 2 freq
thies - 1 freq
tak - 10 freq
tyse - 1 freq
teugs - 1 freq
twa-wyes - 1 freq
those - 6 freq
tyso - 1 freq
tax - 1 freq
ttish - 1 freq
these - 1 freq
tawk - 4 freq
take-away - 1 freq
tweaks - 1 freq
takawa - 2 freq
toke - 2 freq
take - 7 freq
take - 2 freq
toiys - 1 freq
texi - 1 freq
thooasa - 1 freq
tesk - 1 freq
tuek - 1 freq
tos - 3 freq
taki - 1 freq
taxi - 1 freq
though - 1 freq
tea-houss - 1 freq
tuggs - 1 freq
tugs - 1 freq
tuck - 1 freq
takk - 1 freq
this - 2 freq
tashi - 8 freq
these - 2 freq
those - 1 freq
taks - 1 freq
though - 1 freq
teugh - 1 freq
tis - 2 freq
thae's - 1 freq
tues - 6 freq
tushie - 4 freq
tushie - 4 freq
tuc - 1 freq
ts'e - 3 freq
twes - 1 freq
taech - 1 freq
tgi - 1 freq
tuggy - 1 freq
take - 1 freq
texhii - 2 freq
tckckw - 1 freq
taco - 1 freq
tc - 3 freq
tgzg - 1 freq
tjo - 2 freq
tju - 1 freq
tx - 3 freq
“this - 3 freq
tg - 7 freq
tjw - 1 freq
tq - 2 freq
tache - 1 freq
tux - 1 freq
thuck - 1 freq
thake - 1 freq
ti's - 1 freq
thsh - 1 freq
tickie - 1 freq
tikka - 2 freq
tiz - 3 freq
tak’s - 1 freq
tezza - 1 freq
tackawa - 1 freq
tzx - 1 freq
tgsu - 1 freq
tkee - 1 freq
tk - 3 freq
too’s - 1 freq
‘twas - 1 freq
tcq - 1 freq
tuoj - 1 freq
tcg - 1 freq
thc - 3 freq
tcsoa - 1 freq
teuchie - 1 freq
twix - 1 freq
tachy - 1 freq
tauq - 1 freq
ttdku - 1 freq
tqj - 1 freq
toq - 1 freq
tdzcqkx - 1 freq
thos - 1 freq
tha's - 3 freq
tzu - 1 freq
thzu - 1 freq
tooj - 1 freq
twqkz - 1 freq
tkz - 1 freq
tes - 2 freq
tihz - 1 freq
taes' - 1 freq
ttkq - 1 freq
ttj - 1 freq
tac' - 1 freq
teache - 1 freq
thicko - 1 freq
tj - 3 freq
tokie - 1 freq
”this - 1 freq
tdx - 1 freq
thcki - 1 freq
tyqq - 1 freq
MetaPhone code - 0
the - 157218 freq
they - 11452 freq
though - 1213 freq
thae - 1233 freq
tho - 1083 freq
'the - 355 freq
tha - 6295 freq
'they - 49 freq
'thae - 1 freq
thou - 95 freq
thay - 706 freq
th - 2479 freq
they' - 13 freq
thai - 445 freq
the- - 2 freq
'tho - 1 freq
thy - 97 freq
thow - 2 freq
they-eh - 1 freq
the' - 572 freq
tho' - 48 freq
th' - 107 freq
'the' - 6 freq
thi - 2576 freq
thay' - 1 freq
thee - 234 freq
thu - 23 freq
thee' - 1 freq
thei - 5 freq
thé - 1 freq
thaw - 11 freq
tha' - 11 freq
thoo - 277 freq
theiy - 6 freq
'they' - 2 freq
hythe - 2 freq
they'u - 1 freq
'tha - 16 freq
thé - 1 freq
'th - 1 freq
'th- - 1 freq
°tha - 1 freq
'th' - 1 freq
'thou' - 2 freq
'thee' - 2 freq
'thy' - 2 freq
thie - 8 freq
'though - 2 freq
thoo' - 1 freq
'thy - 3 freq
thoa - 4 freq
the - 177 freq
theh - 1 freq
the - 108 freq
tho - 2 freq
they - 24 freq
tho - 1 freq
th - 2 freq
thay - 2 freq
wyth - 1 freq
thew - 1 freq
the - 3 freq
the - 6 freq
the - 8 freq
they - 2 freq
thai - 6 freq
thae - 2 freq
they - 47 freq
thae - 6 freq
thay - 2 freq
thoo - 2 freq
thee - 1 freq
the - 4 freq
thoo - 30 freq
tha - 1 freq
the - 1 freq
thai - 2 freq
tha - 3 freq
though - 1 freq
theii - 1 freq
thy - 2 freq
though - 1 freq
they - 2 freq
thaa - 1 freq
‘the - 2 freq
theaa - 1 freq
thea - 1 freq
the… - 1 freq
the“i” - 1 freq
“the - 4 freq
“they - 1 freq
wth - 1 freq
hthy - 1 freq
theo - 1 freq
��THOUGH
Time to execute Levenshtein function - 0.183277 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.361791 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030460 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037795 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000909 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.