A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cities in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cities (0) - 43 freq
cites (1) - 12 freq
citie (1) - 10 freq
ceities (1) - 8 freq
cuties (1) - 2 freq
cyties (1) - 1 freq
cities' (1) - 1 freq
linties (2) - 3 freq
cuitie (2) - 1 freq
litres (2) - 3 freq
kites (2) - 6 freq
cittie (2) - 2 freq
citizen (2) - 10 freq
tithes (2) - 2 freq
sixties (2) - 15 freq
ceeties (2) - 7 freq
cutie (2) - 4 freq
lilties (2) - 2 freq
ritirs (2) - 1 freq
sites (2) - 22 freq
copies (2) - 41 freq
titles (2) - 13 freq
cairies (2) - 4 freq
catties (2) - 3 freq
cavities (2) - 1 freq
cities (0) - 43 freq
cyties (1) - 1 freq
cuties (1) - 2 freq
ceities (1) - 8 freq
cites (1) - 12 freq
cooties (2) - 1 freq
ceeties (2) - 7 freq
cits (2) - 1 freq
citie (2) - 10 freq
cities' (2) - 1 freq
cosies (3) - 1 freq
mites (3) - 1 freq
cats (3) - 124 freq
cries (3) - 101 freq
noties (3) - 2 freq
duties (3) - 16 freq
cuithes (3) - 4 freq
certies (3) - 17 freq
rites (3) - 8 freq
cutties (3) - 3 freq
caries (3) - 1 freq
cuts (3) - 45 freq
costies (3) - 8 freq
cite (3) - 2 freq
itis (3) - 1 freq
SoundEx code - C320
cottage - 49 freq
catch - 353 freq
cities - 43 freq
city's - 10 freq
cats - 124 freq
cat's - 32 freq
ceeties - 7 freq
cuddies - 49 freq
cuddie's - 10 freq
codes - 4 freq
cuts - 45 freq
coats - 28 freq
chats - 3 freq
cotch - 8 freq
cds - 20 freq
cadiz - 2 freq
cute's - 1 freq
cd's - 2 freq
cothous - 9 freq
cot-hous - 2 freq
cahoots - 2 freq
chat's - 1 freq
cuits - 2 freq
cuddies' - 2 freq
cut's - 2 freq
cit's - 1 freq
cathoose - 1 freq
cautch - 1 freq
coat's - 4 freq
cathy's - 27 freq
'cathy's - 2 freq
'cuddies - 1 freq
citz - 2 freq
cautious - 8 freq
cheats - 2 freq
cats' - 2 freq
catties - 3 freq
c-c-d's - 1 freq
'cheats' - 2 freq
couttie's - 3 freq
chotce - 1 freq
cïties - 2 freq
cottage' - 1 freq
'catch - 2 freq
cuithes - 4 freq
cots - 2 freq
cadgy - 1 freq
châteaus - 1 freq
chates - 1 freq
cadgie - 2 freq
cahootchie - 2 freq
catchy - 3 freq
cotts - 3 freq
cöts - 3 freq
caddies - 2 freq
chutes - 1 freq
cutties - 3 freq
cyties - 1 freq
cities' - 1 freq
cits - 1 freq
'catchie' - 1 freq
cites - 12 freq
caats - 4 freq
codgie - 2 freq
cowtious - 1 freq
cahoutchy - 1 freq
cootch - 3 freq
ceities - 8 freq
codds - 2 freq
cadge - 2 freq
coits - 1 freq
coutch - 1 freq
cutesy - 1 freq
chits - 1 freq
cíties - 1 freq
coattage - 1 freq
cooties - 1 freq
chaotic - 3 freq
coots - 1 freq
cods - 2 freq
€˜cuddies - 1 freq
coads - 1 freq
catchie - 2 freq
cottige - 1 freq
ceuithes - 1 freq
cattie's - 1 freq
czdq - 1 freq
caddis - 1 freq
cts - 1 freq
cuddy's - 1 freq
cattyish - 20 freq
cuddys - 1 freq
chdk - 1 freq
cyatcy - 1 freq
czdxi - 1 freq
cuties - 2 freq
coutts - 1 freq
caithess - 1 freq
ctdg - 1 freq
cedk - 1 freq
catwawk - 1 freq
MetaPhone code - STS
cities - 43 freq
sides - 155 freq
sits - 172 freq
set's - 2 freq
sates - 9 freq
city's - 10 freq
steys - 21 freq
ceeties - 7 freq
seat's - 3 freq
seats - 79 freq
sets - 130 freq
saets - 29 freq
seeds - 47 freq
stacey - 2 freq
stays - 28 freq
suits - 52 freq
seats' - 1 freq
suite's - 1 freq
saits - 4 freq
sait's - 1 freq
suit's - 2 freq
'sides - 1 freq
sods - 10 freq
sadie's - 3 freq
zits - 1 freq
hysts - 1 freq
syde's - 5 freq
cit's - 1 freq
seet's - 2 freq
saudis - 2 freq
side's - 3 freq
seduce - 1 freq
citz - 2 freq
sites - 22 freq
steis - 1 freq
sats' - 1 freq
stews - 5 freq
sïts - 1 freq
suds - 3 freq
saddos - 1 freq
sides' - 1 freq
saut's - 1 freq
sod's - 1 freq
saet's - 1 freq
settes - 1 freq
steasy - 1 freq
staeys - 1 freq
cyties - 1 freq
cities' - 1 freq
cits - 1 freq
suites - 2 freq
staws - 3 freq
cites - 12 freq
say-at's - 1 freq
setts - 2 freq
staas - 4 freq
ceities - 8 freq
saidis - 2 freq
seids - 1 freq
sids - 1 freq
stows - 1 freq
seeds' - 1 freq
saats - 1 freq
sties - 1 freq
staiys - 1 freq
sodas - 1 freq
sowt's - 1 freq
sts - 1 freq
sats - 1 freq
stce - 1 freq
CITIES
city - 288 freq
ceety - 28 freq
ceity - 22 freq
cietie - freq
citie - 10 freq
cities - 43 freq
citizen - 10 freq
citizens - 28 freq
Time to execute Levenshtein function - 0.219140 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.340714 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027764 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037014 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001028 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.