A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cïties in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cïties (0) - 2 freq
cïtie (1) - 6 freq
cíties (1) - 1 freq
ceeties (2) - 7 freq
costies (2) - 8 freq
cutties (2) - 3 freq
cooties (2) - 1 freq
cities (2) - 42 freq
cyties (2) - 1 freq
lïlies (2) - 2 freq
cuties (2) - 2 freq
certies (2) - 17 freq
ceities (2) - 8 freq
catties (2) - 3 freq
besties (3) - 1 freq
feeties (3) - 5 freq
gutties (3) - 11 freq
bitties (3) - 29 freq
foaties (3) - 1 freq
carnies (3) - 2 freq
chories (3) - 1 freq
clothes (3) - 27 freq
cosies (3) - 1 freq
cassies (3) - 6 freq
cattie's (3) - 1 freq
cïties (0) - 2 freq
cíties (2) - 1 freq
cïtie (2) - 6 freq
bïts (4) - 7 freq
catties (4) - 3 freq
fïts (4) - 1 freq
hïts (4) - 1 freq
ïts (4) - 16 freq
cöts (4) - 3 freq
certies (4) - 17 freq
sïts (4) - 1 freq
ceities (4) - 8 freq
cooties (4) - 1 freq
cutties (4) - 3 freq
costies (4) - 8 freq
ceeties (4) - 7 freq
cities (4) - 42 freq
cyties (4) - 1 freq
lïlies (4) - 2 freq
cuties (4) - 2 freq
cavities (5) - 1 freq
chutes (5) - 1 freq
ït's (5) - 16 freq
cites (5) - 12 freq
cairties (5) - 1 freq
SoundEx code - C320
cottage - 49 freq
catch - 346 freq
cities - 42 freq
city's - 10 freq
cats - 122 freq
cat's - 32 freq
ceeties - 7 freq
cuddies - 49 freq
cuddie's - 10 freq
codes - 4 freq
cuts - 44 freq
coats - 27 freq
chats - 3 freq
cotch - 8 freq
cds - 19 freq
cadiz - 1 freq
cute's - 1 freq
cd's - 2 freq
cothous - 9 freq
cot-hous - 2 freq
cahoots - 2 freq
chat's - 1 freq
cuits - 2 freq
cuddies' - 1 freq
cut's - 2 freq
cit's - 1 freq
cathoose - 2 freq
cautch - 1 freq
'cuddies - 1 freq
citz - 2 freq
cautious - 8 freq
cheats - 2 freq
coat's - 3 freq
cats' - 2 freq
catties - 3 freq
c-c-d's - 1 freq
'cheats' - 2 freq
couttie's - 3 freq
chotce - 1 freq
cïties - 2 freq
cottage' - 1 freq
cathy's - 20 freq
'catch - 2 freq
cuithes - 4 freq
cots - 2 freq
cadgy - 1 freq
châteaus - 1 freq
chates - 1 freq
cadgie - 2 freq
cahootchie - 2 freq
catchy - 3 freq
'cathy's - 1 freq
cotts - 3 freq
cöts - 3 freq
caddies - 2 freq
chutes - 1 freq
cutties - 3 freq
cyties - 1 freq
cities' - 1 freq
cits - 1 freq
'catchie' - 1 freq
cites - 12 freq
caats - 4 freq
codgie - 2 freq
cowtious - 1 freq
cahoutchy - 1 freq
cootch - 3 freq
ceities - 8 freq
codds - 2 freq
cadge - 2 freq
coits - 1 freq
coutch - 1 freq
cutesy - 1 freq
chits - 1 freq
cíties - 1 freq
coattage - 1 freq
cooties - 1 freq
chaotic - 3 freq
coots - 1 freq
cods - 2 freq
cuddies - 1 freq
coads - 1 freq
catchie - 2 freq
cottige - 1 freq
ceuithes - 1 freq
cattie's - 1 freq
czdq - 1 freq
caddis - 1 freq
cts - 1 freq
cuddy's - 1 freq
cattyish - 20 freq
cuddys - 1 freq
chdk - 1 freq
cyatcy - 1 freq
czdxi - 1 freq
cuties - 2 freq
coutts - 1 freq
caithess - 1 freq
ctdg - 1 freq
cedk - 1 freq
catwawk - 1 freq
MetaPhone code - KTS
goad's - 11 freq
cats - 122 freq
cat's - 32 freq
kites - 6 freq
guts - 73 freq
gods - 73 freq
guids - 24 freq
cuddies - 49 freq
cuddie's - 10 freq
gutsy - 7 freq
gaits - 30 freq
god's - 125 freq
codes - 4 freq
cuts - 44 freq
kate's - 26 freq
gates - 102 freq
guides - 15 freq
quids' - 1 freq
kiddies - 2 freq
gutties - 11 freq
coats - 27 freq
kids - 87 freq
goats - 17 freq
goddess - 53 freq
quits - 3 freq
goods - 21 freq
gads - 8 freq
cds - 19 freq
quotes - 15 freq
cadiz - 1 freq
kitty's - 1 freq
cute's - 1 freq
good's - 3 freq
cd's - 2 freq
gtice - 1 freq
goads - 30 freq
'goads - 1 freq
goads' - 1 freq
goodies - 6 freq
cuits - 2 freq
cuddies' - 1 freq
goddis - 3 freq
gait's - 4 freq
cut's - 2 freq
gut's - 4 freq
gats - 1 freq
guid's - 4 freq
gowdie's - 1 freq
'cuddies - 1 freq
coat's - 3 freq
gode's - 1 freq
'gutsie - 1 freq
queets - 6 freq
cats' - 2 freq
catties - 3 freq
ket's - 1 freq
kidz - 1 freq
couttie's - 3 freq
kits - 3 freq
cïties - 2 freq
gaudi's - 1 freq
kytes - 2 freq
kat's - 15 freq
'kat's - 1 freq
cots - 2 freq
göd's - 2 freq
goat's - 2 freq
göds - 1 freq
gaets - 4 freq
godes - 2 freq
'god's - 2 freq
cotts - 3 freq
gude's - 1 freq
godds - 1 freq
cöts - 3 freq
caddies - 2 freq
cutties - 3 freq
gaetes - 1 freq
quotas - 4 freq
kuts - 1 freq
gate's - 1 freq
caats - 4 freq
'katze - 1 freq
gowdies - 2 freq
guidis - 1 freq
quytes - 1 freq
gutsie - 2 freq
'gad's - 1 freq
quotes - 1 freq
guds - 1 freq
codds - 2 freq
gowds - 1 freq
coits - 1 freq
gudis - 3 freq
kidds - 2 freq
gaits - 1 freq
cutesy - 1 freq
cíties - 1 freq
wkds - 2 freq
cooties - 1 freq
kudos - 2 freq
gods' - 1 freq
coots - 1 freq
godddess - 1 freq
goddesss - 1 freq
cods - 2 freq
queats - 2 freq
cuddies - 1 freq
gods - 2 freq
quats - 1 freq
coads - 1 freq
goddis - 1 freq
gads - 1 freq
cattie's - 1 freq
gawds - 3 freq
gtz - 1 freq
caddis - 1 freq
qwwdci - 1 freq
god’s - 3 freq
kdaz - 1 freq
cts - 1 freq
cuddy's - 1 freq
kdz - 1 freq
gadz - 1 freq
gouts - 1 freq
gout's - 1 freq
cuddys - 1 freq
kid's - 1 freq
wgd's - 1 freq
ketts - 1 freq
cuties - 2 freq
coutts - 1 freq
ktze - 1 freq
gates’ - 1 freq
CÏTIES
Time to execute Levenshtein function - 0.197616 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.353543 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031056 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039174 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000745 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.