A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to hïts in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
hïts (0) - 1 freq
fïts (1) - 1 freq
ïts (1) - 16 freq
hïs (1) - 331 freq
sïts (1) - 1 freq
bïts (1) - 7 freq
hats (2) - 46 freq
sït (2) - 10 freq
houts (2) - 1 freq
lït (2) - 2 freq
sïns (2) - 12 freq
kïss (2) - 3 freq
böts (2) - 2 freq
haets (2) - 2 freq
bøts (2) - 1 freq
hits (2) - 150 freq
fït (2) - 17 freq
hots (2) - 1 freq
kïsts (2) - 1 freq
halts (2) - 1 freq
hïd (2) - 6 freq
hïlls (2) - 4 freq
huts (2) - 7 freq
røts (2) - 2 freq
bïds (2) - 1 freq
hïts (0) - 1 freq
sïts (2) - 1 freq
hïs (2) - 331 freq
bïts (2) - 7 freq
ïts (2) - 16 freq
fïts (2) - 1 freq
fïgs (4) - 3 freq
ïs (4) - 260 freq
hïm (4) - 575 freq
heats (4) - 3 freq
hauts (4) - 1 freq
pït (4) - 1 freq
hints (4) - 9 freq
pïgs (4) - 7 freq
hants (4) - 1 freq
röts (4) - 2 freq
hts (4) - 5 freq
herts (4) - 119 freq
hosts (4) - 13 freq
hoots (4) - 11 freq
poiïts (4) - 1 freq
cïties (4) - 2 freq
ït's (4) - 16 freq
hilts (4) - 1 freq
shïfts (4) - 1 freq
SoundEx code - H320
heids - 465 freq
hedge - 41 freq
hate-c - 2 freq
heid's - 41 freq
hedgie - 3 freq
heidache - 7 freq
hauds - 123 freq
hotch - 5 freq
hates - 36 freq
hitch - 3 freq
hits - 150 freq
heads - 39 freq
hotdogs - 3 freq
hides - 14 freq
'hoots - 2 freq
hoods - 6 freq
heeds - 29 freq
heid-gie - 1 freq
heidy's - 1 freq
huds - 6 freq
hauts - 1 freq
haddock - 14 freq
hadg - 1 freq
hideous - 7 freq
houts - 1 freq
hoodies - 5 freq
hid's - 202 freq
hids - 86 freq
hats - 46 freq
hatch - 12 freq
hat's - 3 freq
heed's - 6 freq
huts - 7 freq
haddies - 2 freq
heats - 3 freq
hoots - 11 freq
hts - 5 freq
howdie's - 1 freq
hideyoshi - 2 freq
hit's - 280 freq
heidie's - 5 freq
heidies - 4 freq
hood's - 1 freq
het-hoose - 1 freq
het's - 5 freq
haeds - 6 freq
haed's - 1 freq
hoatch - 1 freq
haitts - 1 freq
hïts - 1 freq
heathaze - 1 freq
heedge - 1 freq
hodge - 3 freq
'hid's - 2 freq
haets - 2 freq
hades - 4 freq
haads - 17 freq
hadds - 22 freq
'hit's - 5 freq
heat's - 1 freq
het-houss - 1 freq
head's - 2 freq
hethoose - 1 freq
hads - 2 freq
hi-tech - 2 freq
heds - 1 freq
hoids - 2 freq
hieds - 2 freq
'haddow's' - 1 freq
hie-tech - 1 freq
haddocks - 1 freq
hutch - 7 freq
hudds - 1 freq
hudge - 1 freq
heid-heich - 1 freq
heides - 1 freq
huddies - 1 freq
heywood's - 1 freq
hyde's - 1 freq
heedache - 2 freq
haddicks - 2 freq
hudduck - 2 freq
hitec - 1 freq
hotdog - 1 freq
huddock - 6 freq
heid’s - 1 freq
headache - 4 freq
'hates - 1 freq
hewitt's - 1 freq
hutchi - 1 freq
hutchie - 2 freq
hots - 1 freq
hid’s - 2 freq
hdq - 1 freq
heydays - 1 freq
MetaPhone code - TS
doos - 101 freq
does - 385 freq
days - 1574 freq
taes - 107 freq
ties - 31 freq
douce - 128 freq
dis - 1552 freq
daes - 213 freq
tousie - 11 freq
daisy - 62 freq
toays - 1 freq
diz - 55 freq
tea's - 4 freq
ts - 44 freq
doze - 10 freq
'tis - 9 freq
toes - 48 freq
tis - 38 freq
tts - 2 freq
tt's - 3 freq
toss - 27 freq
daise - 2 freq
day's - 105 freq
dizzy - 19 freq
tassie - 37 freq
dose - 45 freq
tosie - 4 freq
das - 23 freq
toi's - 1 freq
taz - 21 freq
'taz - 1 freq
tissue - 6 freq
days' - 12 freq
tease - 10 freq
tawse - 14 freq
dy's - 2 freq
dozy - 10 freq
toys - 45 freq
tazzy - 1 freq
tizz - 1 freq
tizzy - 2 freq
tawsie - 1 freq
teas - 3 freq
dozie - 5 freq
touzie - 3 freq
daws - 5 freq
da's - 67 freq
douse - 3 freq
toosie - 2 freq
duis - 23 freq
dos - 4 freq
dees - 41 freq
'daes - 1 freq
dies - 7 freq
dues - 10 freq
deys - 9 freq
t's - 9 freq
daw's - 1 freq
towsie - 3 freq
doss - 5 freq
tozie - 3 freq
twyse - 1 freq
tae's - 5 freq
'tt's - 1 freq
'daisy - 2 freq
tae-us - 1 freq
deus - 18 freq
'does - 5 freq
douze - 1 freq
da''s - 1 freq
hts - 5 freq
dehs - 1 freq
dice - 13 freq
daze - 7 freq
tess - 1 freq
t'is - 6 freq
dyce - 5 freq
dae's - 3 freq
doozie - 1 freq
dows - 1 freq
t--s - 1 freq
toass - 2 freq
dess - 11 freq
'das - 1 freq
'tes - 1 freq
ïts - 16 freq
hïts - 1 freq
ït's - 16 freq
wüt's - 1 freq
daiys - 1 freq
di's - 1 freq
taws - 1 freq
doos' - 1 freq
tees - 3 freq
wytes - 3 freq
'da's - 2 freq
t'sae - 1 freq
doose - 6 freq
'du's - 4 freq
du's - 113 freq
'dice' - 1 freq
dis' - 3 freq
dus - 24 freq
'dis - 10 freq
des - 23 freq
dozey - 1 freq
tice - 3 freq
ds - 9 freq
dese - 6 freq
'ts - 3 freq
dous - 6 freq
tise - 3 freq
do's - 2 freq
dee's - 2 freq
doo's - 4 freq
dæs - 2 freq
tæs - 1 freq
tiso - 1 freq
dowse - 2 freq
dace - 4 freq
tass - 1 freq
dös - 1 freq
tss - 2 freq
deece - 1 freq
't's - 2 freq
tize - 4 freq
taas - 1 freq
dicey - 1 freq
deizie - 4 freq
daies - 2 freq
deezie - 1 freq
Ötzi - 1 freq
dss - 2 freq
daa's - 2 freq
tows - 2 freq
duys - 1 freq
duys - 1 freq
dis - 3 freq
deas - 1 freq
dasy - 1 freq
tis - 1 freq
dees - 1 freq
dois - 3 freq
dow's - 2 freq
tyse - 1 freq
tyso - 1 freq
dys- - 1 freq
disa- - 1 freq
daiss - 2 freq
dizzie - 2 freq
toiys - 1 freq
dosy - 1 freq
'daes' - 1 freq
hyde's - 1 freq
tos - 3 freq
die's - 1 freq
daz - 1 freq
dis - 17 freq
doozy - 1 freq
tis - 2 freq
tues - 6 freq
does - 5 freq
ts'e - 3 freq
doo’s - 3 freq
day’s - 2 freq
dsa - 1 freq
dzyy - 1 freq
dazzy - 1 freq
dis- - 1 freq
ti's - 1 freq
tiz - 3 freq
tezza - 1 freq
dz - 3 freq
too’s - 1 freq
deuce - 3 freq
da’s - 2 freq
desi - 1 freq
deis - 1 freq
doozo - 1 freq
doucie - 1 freq
ytzo - 1 freq
dsi - 1 freq
du’s - 1 freq
dissae - 1 freq
tzu - 1 freq
tes - 2 freq
tihz - 1 freq
dze - 1 freq
taes' - 1 freq
days’ - 1 freq
HÏTS
Time to execute Levenshtein function - 0.353662 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.445192 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028306 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038019 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000925 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.