A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cis in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cis (0) - 26 freq
eis (1) - 478 freq
yis (1) - 275 freq
ciss (1) - 1 freq
lis (1) - 1 freq
c's (1) - 1 freq
uis (1) - 17 freq
dis (1) - 1552 freq
jis (1) - 38 freq
tis (1) - 38 freq
is (1) - 18321 freq
cif (1) - 4 freq
zis (1) - 2 freq
cas (1) - 4 freq
mis (1) - 3 freq
qis (1) - 1 freq
cos (1) - 456 freq
cist (1) - 1 freq
acis (1) - 3 freq
bis (1) - 19 freq
vis (1) - 1 freq
ris (1) - 41 freq
cih (1) - 2 freq
ais (1) - 18 freq
gis (1) - 1 freq
cis (0) - 26 freq
cas (1) - 4 freq
cos (1) - 456 freq
cus (1) - 130 freq
cais (1) - 1 freq
cs (1) - 5 freq
acis (1) - 3 freq
cia (2) - 3 freq
chis (2) - 2 freq
ciu (2) - 1 freq
his (2) - 17253 freq
cosy (2) - 57 freq
wis (2) - 28134 freq
kis (2) - 142 freq
iis (2) - 1 freq
cim (2) - 5 freq
'is (2) - 151 freq
cims (2) - 1 freq
wcis (2) - 1 freq
ices (2) - 2 freq
cigs (2) - 1 freq
csa (2) - 1 freq
ciaes (2) - 1 freq
caise (2) - 6 freq
cusa (2) - 1 freq
SoundEx code - C200
check - 203 freq
cosy - 57 freq
chucks - 5 freq
cheek - 141 freq
cheeks - 78 freq
case - 447 freq
cake - 166 freq
cook - 201 freq
cocky - 12 freq
cog - 3 freq
cause - 1186 freq
choosy-hoi - 1 freq
choke - 23 freq
cheeky - 92 freq
chuck - 29 freq
choice - 157 freq
cage - 59 freq
cease - 10 freq
chuise - 15 freq
'cause - 18 freq
cosh - 12 freq
cuzco - 1 freq
coos - 74 freq
chug - 2 freq
chaws - 1 freq
cash - 85 freq
couch - 67 freq
cos - 456 freq
chose - 62 freq
chessie - 1 freq
cock - 82 freq
cogs - 2 freq
causey - 25 freq
cheese - 140 freq
cheque - 20 freq
caas - 44 freq
chaise - 3 freq
chase - 51 freq
chis - 2 freq
chicks - 8 freq
cuz - 8 freq
cough - 61 freq
coz - 44 freq
'cos - 7 freq
cows - 5 freq
coach - 51 freq
choose - 80 freq
chowk - 6 freq
cack - 4 freq
chowks - 9 freq
cushie - 15 freq
cou's - 1 freq
chuckie - 10 freq
cushy - 3 freq
chik - 11 freq
cissie - 7 freq
caws - 36 freq
caus - 8 freq
caz - 2 freq
cassie - 10 freq
cuiks - 2 freq
coggie - 5 freq
czech - 7 freq
ciseau - 1 freq
cowk - 4 freq
chyce - 52 freq
chez - 2 freq
cocks - 11 freq
chack - 8 freq
ca's - 5 freq
cuik - 4 freq
cheesy - 15 freq
chookie - 66 freq
chakks - 1 freq
chico - 5 freq
czesc - 1 freq
chikk - 1 freq
cooks - 6 freq
chese - 1 freq
cogie - 6 freq
chaos - 24 freq
chooky - 6 freq
chuse - 9 freq
chess - 4 freq
cheesey - 3 freq
cauk - 5 freq
cock's - 3 freq
chooks - 18 freq
chice - 3 freq
couse - 23 freq
cus - 130 freq
cis - 26 freq
caise - 6 freq
coasee - 1 freq
ciss - 1 freq
chooch - 1 freq
chuk - 1 freq
choyse - 1 freq
coke - 34 freq
casey - 7 freq
choice' - 2 freq
chook - 4 freq
casks - 2 freq
cosie - 23 freq
causie - 9 freq
chak - 5 freq
cookie - 15 freq
coco - 5 freq
'cash - 1 freq
cow's - 2 freq
ciggie - 4 freq
cagey - 4 freq
cozy - 4 freq
checky - 3 freq
'check - 6 freq
'cook - 1 freq
casie - 3 freq
chiky - 3 freq
chiks - 17 freq
cheege - 1 freq
choosie - 1 freq
'cocky' - 2 freq
cheus - 6 freq
caase - 29 freq
cox - 5 freq
chyach - 1 freq
cozzie - 3 freq
chazza - 6 freq
coo's - 16 freq
cokey - 1 freq
chick - 16 freq
cauge - 1 freq
coca - 1 freq
chuza - 2 freq
choc - 4 freq
cacao - 14 freq
choo's - 1 freq
caause - 2 freq
chews - 1 freq
cigs - 1 freq
caa's - 1 freq
cass - 14 freq
cooch - 7 freq
'chookie - 2 freq
chucky - 1 freq
coose - 8 freq
causay - 2 freq
checks - 12 freq
cask - 2 freq
'cheeky - 1 freq
cowie's - 2 freq
cowie''s - 1 freq
cox's - 1 freq
cak - 1 freq
chukkie - 1 freq
coks - 1 freq
cheke - 5 freq
cheuss - 2 freq
choss - 2 freq
chuggie - 2 freq
coax - 4 freq
cas - 4 freq
chough - 1 freq
cocoa - 3 freq
chic - 10 freq
cha's - 1 freq
chocs - 1 freq
chaas - 2 freq
cues - 3 freq
cheik - 1 freq
cecco - 18 freq
ciaes - 1 freq
cais - 1 freq
caskie - 4 freq
cowks - 1 freq
chock - 1 freq
caess - 3 freq
cows' - 1 freq
coax' - 1 freq
caa-caa - 1 freq
chouks - 5 freq
cheise - 1 freq
chouk - 1 freq
coq - 1 freq
cheesie - 1 freq
cookie' - 2 freq
chyaach - 1 freq
choosy - 1 freq
€˜cozie - 1 freq
€˜cacks - 1 freq
chasse - 2 freq
choiss - 2 freq
ckeck - 1 freq
chakk - 4 freq
chikks - 6 freq
cok - 1 freq
cheis - 1 freq
cace - 1 freq
caig - 1 freq
cusa - 1 freq
€™casey - 1 freq
chows - 1 freq
cuckoo - 9 freq
coagh - 1 freq
causeway - 3 freq
€œcos - 1 freq
chuggy - 1 freq
chasie - 1 freq
cous - 1 freq
cac - 1 freq
cses - 2 freq
caese - 2 freq
chos - 1 freq
chugs - 1 freq
€œcuckoo - 4 freq
€˜case - 1 freq
€˜cock - 1 freq
€¦cos - 1 freq
€˜cause - 7 freq
€œcause - 2 freq
caius - 3 freq
€™cause - 2 freq
€œcheeky - 1 freq
€œcowk - 2 freq
chacks - 1 freq
chackie - 2 freq
chuko - 1 freq
chu-ko - 3 freq
chise - 2 freq
€˜cos - 4 freq
€™cheeky - 1 freq
cag - 1 freq
cheeeeese - 1 freq
€˜cus - 1 freq
ckykzz - 1 freq
chaz - 19 freq
cooÂ’s - 3 freq
czazo - 1 freq
casa - 1 freq
ciesx - 1 freq
chegs - 3 freq
ckhhyys - 1 freq
csez - 1 freq
cossye - 1 freq
cossy - 1 freq
coxÂ’ - 1 freq
cheezy - 1 freq
cxoj - 1 freq
cos' - 1 freq
ckwg - 1 freq
cashe - 1 freq
chacha - 1 freq
chase' - 1 freq
cheuch - 2 freq
cec - 1 freq
chek - 5 freq
'cheugh' - 1 freq
cig - 1 freq
ccxic - 1 freq
cheeko' - 2 freq
chck - 1 freq
ciky - 1 freq
cookoo - 1 freq
cuhj - 1 freq
chag - 1 freq
casc - 1 freq
cach - 1 freq
'cach - 1 freq
MetaPhone code - SS
says - 2414 freq
sees - 192 freq
sea's - 15 freq
size - 294 freq
'seuse - 1 freq
zis - 2 freq
so's - 87 freq
seas - 57 freq
cease - 10 freq
sauce - 51 freq
sais - 676 freq
say's - 136 freq
soss - 15 freq
si's - 3 freq
seys - 81 freq
sae's - 29 freq
cissie - 7 freq
sass - 5 freq
ciseau - 1 freq
sows - 2 freq
's's - 1 freq
sez - 30 freq
saace - 5 freq
suss - 7 freq
ses - 6 freq
saes - 10 freq
saws - 20 freq
susy - 5 freq
see's - 13 freq
sesse - 1 freq
susie - 38 freq
sus - 1 freq
cis - 26 freq
hce's - 6 freq
ciss - 1 freq
syze - 5 freq
wcis - 1 freq
so-so - 1 freq
so''s - 1 freq
sousse - 1 freq
sis - 5 freq
suzie - 5 freq
suze - 1 freq
sassy - 1 freq
sos - 4 freq
'says - 1 freq
soo's - 3 freq
say-so - 1 freq
sayz - 2 freq
sehz - 2 freq
sehze - 1 freq
siz - 3 freq
sze - 1 freq
sas - 4 freq
saaws - 1 freq
seize - 2 freq
öses - 1 freq
'sus - 1 freq
susi - 12 freq
suez - 13 freq
sews - 1 freq
soos - 3 freq
sie's - 3 freq
says- - 1 freq
sissie - 1 freq
sies - 3 freq
sissy - 3 freq
sehs - 1 freq
zeus - 12 freq
øses - 4 freq
'sas - 1 freq
sowsie - 1 freq
saies - 1 freq
soucie - 1 freq
sae-say - 1 freq
'see's - 1 freq
sizzie - 1 freq
€™ses - 1 freq
see'z - 1 freq
zoos - 2 freq
sess - 3 freq
suys - 1 freq
saas - 8 freq
€œsize - 1 freq
suzy - 24 freq
see-saw - 1 freq
se's - 1 freq
sys - 2 freq
see-saa - 1 freq
sussie - 1 freq
sauzee - 1 freq
xz - 4 freq
xs - 2 freq
zs - 5 freq
sz - 5 freq
suzzy - 3 freq
siis - 1 freq
xsy - 1 freq
xci - 1 freq
soz - 6 freq
xhz - 1 freq
zzzs - 1 freq
xoiwz - 1 freq
xehws - 1 freq
saucy - 1 freq
zzz's - 1 freq
zzzzzzzz's - 1 freq
xxz - 1 freq
sozz - 2 freq
xzo - 1 freq
soÂ’s - 1 freq
hzso - 1 freq
sece - 1 freq
susa - 3 freq
zuaz - 1 freq
suis - 1 freq
zss - 1 freq
zys - 1 freq
zse - 1 freq
zhws - 1 freq
sose - 9 freq
xhs - 1 freq
souce - 1 freq
CIS
Time to execute Levenshtein function - 0.182599 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.315048 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028187 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037242 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000909 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.