A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cer- in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cer- (0) - 1 freq
cer (1) - 1 freq
ger- (1) - 1 freq
cerd (1) - 7 freq
cern (1) - 1 freq
cert (1) - 1 freq
ceri (1) - 1 freq
her- (1) - 1 freq
clere (2) - 3 freq
eery (2) - 2 freq
terr (2) - 9 freq
cherm (2) - 5 freq
herm (2) - 51 freq
yerk (2) - 3 freq
nerr (2) - 9 freq
tern (2) - 1 freq
cede (2) - 1 freq
hers (2) - 85 freq
carl (2) - 13 freq
tery (2) - 1 freq
werd (2) - 1 freq
merc (2) - 2 freq
sere (2) - 3 freq
hera (2) - 12 freq
fere (2) - 18 freq
cer- (0) - 1 freq
ceri (2) - 1 freq
her- (2) - 1 freq
cern (2) - 1 freq
cert (2) - 1 freq
cer (2) - 1 freq
ger- (2) - 1 freq
cerd (2) - 7 freq
ceres (3) - 11 freq
curl (3) - 12 freq
cr (3) - 8 freq
card (3) - 61 freq
curb (3) - 1 freq
caers (3) - 1 freq
c- (3) - 2 freq
cora (3) - 3 freq
caer (3) - 5 freq
corp (3) - 51 freq
cure (3) - 54 freq
cart (3) - 8 freq
cory (3) - 2 freq
carp (3) - 1 freq
curd (3) - 5 freq
sor- (3) - 1 freq
cun- (3) - 1 freq
SoundEx code - C600
chair - 219 freq
car - 413 freq
cry - 369 freq
cairry - 212 freq
caur - 125 freq
care - 465 freq
cheer - 154 freq
chyre - 6 freq
cairrie - 35 freq
chureh - 1 freq
crew - 111 freq
cur - 6 freq
cower - 12 freq
cairy - 16 freq
cheery - 43 freq
coorie - 29 freq
crewe - 2 freq
choir - 16 freq
core - 54 freq
curry - 32 freq
cheerio - 65 freq
carry - 45 freq
cairie - 4 freq
cherry - 25 freq
cure - 54 freq
courie - 6 freq
crie - 7 freq
cherr - 2 freq
caur' - 1 freq
coor - 4 freq
corrie - 55 freq
ceri - 1 freq
craw - 67 freq
'care - 1 freq
craa - 40 freq
cherie - 2 freq
cerry - 7 freq
'cairrie - 1 freq
cheerie - 11 freq
craw-ye - 1 freq
cair - 3 freq
croo - 7 freq
ceoor - 1 freq
cra - 2 freq
cheereeoo - 1 freq
car'y - 15 freq
crooooooo - 1 freq
crow - 10 freq
carey - 2 freq
cariy - 1 freq
chore - 11 freq
'cheerio - 4 freq
cherrie - 3 freq
care' - 2 freq
cory - 2 freq
'cairry - 2 freq
cora - 3 freq
core' - 1 freq
cor - 4 freq
cura - 1 freq
caer - 5 freq
chere - 1 freq
carrie - 11 freq
chawer - 2 freq
chairiie - 1 freq
ca'ry - 1 freq
crö - 4 freq
chair' - 2 freq
cher - 6 freq
'cheerio' - 3 freq
cuir - 2 freq
cairrieaw - 1 freq
cheir - 9 freq
'car - 1 freq
cre - 3 freq
coo'r - 1 freq
crya - 1 freq
chaer - 1 freq
cer - 1 freq
chorey - 1 freq
caeiro - 1 freq
cree - 3 freq
caar - 1 freq
cour - 1 freq
curia - 1 freq
courrie - 1 freq
char - 2 freq
cray - 1 freq
€˜coeur - 1 freq
€˜cheery - 1 freq
crye - 1 freq
cooer - 1 freq
cairo - 2 freq
crue - 1 freq
cheirie - 1 freq
cer- - 1 freq
cara - 3 freq
caurie - 2 freq
carr - 3 freq
caerry - 2 freq
€™cry - 1 freq
carew - 1 freq
cawr - 3 freq
€œcarry - 1 freq
€œcraw - 3 freq
chiru - 1 freq
coar - 3 freq
'core' - 1 freq
€œcheerio - 1 freq
chree - 3 freq
curie - 1 freq
coyr - 20 freq
cr - 8 freq
‘car’ - 1 freq
'ciar - 1 freq
chrhowe - 1 freq
chorie - 1 freq
coeur” - 1 freq
cría - 1 freq
corey - 1 freq
cheery” - 1 freq
cary - 1 freq
corri - 1 freq
'cry' - 1 freq
currie - 1 freq
csr - 1 freq
ckr - 1 freq
MetaPhone code - SR
sair - 786 freq
sure - 1001 freq
sorry - 501 freq
saur - 12 freq
sour - 7 freq
sir- - 4 freq
sir - 358 freq
soor - 96 freq
'sir' - 1 freq
'sorry - 14 freq
saire - 4 freq
sare - 30 freq
sairy - 5 freq
sore - 53 freq
seer - 39 freq
'sair - 2 freq
'sir - 7 freq
sairie - 22 freq
sur - 23 freq
sorr - 1 freq
wycer - 6 freq
suir - 11 freq
soarry - 4 freq
ceri - 1 freq
sorrow - 29 freq
sairrie - 3 freq
sorra - 43 freq
syria - 12 freq
soiree - 5 freq
x-ray - 7 freq
sarah - 40 freq
cerry - 7 freq
soir - 1 freq
sire - 8 freq
siura - 1 freq
ceoor - 1 freq
soary - 3 freq
soarey - 1 freq
soaree - 1 freq
sour' - 1 freq
soar - 10 freq
sear - 2 freq
sara - 16 freq
zero - 9 freq
'sure - 6 freq
sure' - 2 freq
surrey - 2 freq
sorr-ee - 1 freq
zerah - 1 freq
sorrie - 2 freq
sooer - 1 freq
zarah - 1 freq
sarry - 2 freq
sere - 3 freq
soarro - 3 freq
soaroo - 2 freq
sierra - 1 freq
sirrah - 1 freq
sar - 1 freq
serr - 9 freq
sorrae - 2 freq
zara - 3 freq
cer - 1 freq
suree - 1 freq
ser - 19 freq
suire - 2 freq
€˜sorry - 11 freq
€˜sure - 3 freq
sor- - 1 freq
sri - 11 freq
xr - 3 freq
cer- - 1 freq
€œsorry - 10 freq
€œsure - 1 freq
seer' - 1 freq
soeur - 1 freq
sru - 1 freq
€œzero - 1 freq
'sare - 1 freq
sari - 3 freq
sre - 2 freq
€œsairie - 1 freq
€”sorry - 1 freq
€˜sair - 2 freq
sr - 3 freq
soirée - 1 freq
zr - 2 freq
zrr - 1 freq
zrae - 1 freq
sairÂ’ - 1 freq
“sorry - 1 freq
sir” - 1 freq
sarahw - 1 freq
seery - 2 freq
soory - 2 freq
'sarah - 1 freq
hzr - 1 freq
ssre - 1 freq
zry - 1 freq
‘sorry - 1 freq
siro - 1 freq
xre - 1 freq
señor - 1 freq
CER-
Time to execute Levenshtein function - 0.210259 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.370252 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027565 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037155 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000842 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.