A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to zoe in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
zoe (0) - 4 freq
soe (1) - 1 freq
yoe (1) - 4 freq
zob (1) - 1 freq
zfe (1) - 1 freq
poe (1) - 114 freq
zo (1) - 5 freq
voe (1) - 26 freq
foe (1) - 9 freq
coe (1) - 1 freq
woe (1) - 9 freq
zoo (1) - 40 freq
roe (1) - 15 freq
loe (1) - 15 freq
zse (1) - 1 freq
ze (1) - 4 freq
zqe (1) - 1 freq
doe (1) - 6 freq
toe (1) - 21 freq
moe (1) - 2 freq
oe (1) - 13 freq
goe (1) - 1 freq
joe (1) - 105 freq
zje (1) - 1 freq
zone (1) - 21 freq
zoe (0) - 4 freq
zoo (1) - 40 freq
ze (1) - 4 freq
zo (1) - 5 freq
zyo (2) - 1 freq
ooze (2) - 2 freq
zone (2) - 21 freq
hoe (2) - 2 freq
oz (2) - 14 freq
eze (2) - 1 freq
zje (2) - 1 freq
ozo (2) - 1 freq
zi (2) - 3 freq
uzo (2) - 1 freq
za (2) - 3 freq
ize (2) - 1 freq
zu (2) - 4 freq
z (2) - 119 freq
zio (2) - 1 freq
yoze (2) - 1 freq
aze (2) - 2 freq
joe (2) - 105 freq
zy (2) - 4 freq
soe (2) - 1 freq
zfe (2) - 1 freq
SoundEx code - Z000
zoo - 40 freq
zha - 1 freq
z - 119 freq
zoe - 4 freq
zz - 1 freq
'z' - 1 freq
z - 3 freq
zi - 3 freq
zu - 4 freq
z - 30 freq
zc - 3 freq
zcxz - 1 freq
zw - 2 freq
zs - 5 freq
ze - 4 freq
zy - 4 freq
zg - 6 freq
za - 3 freq
zk - 3 freq
zq - 6 freq
zh - 1 freq
zio - 1 freq
zkjg - 1 freq
zzzs - 1 freq
zz - 6 freq
zzg - 1 freq
zo - 5 freq
zzz's - 1 freq
zzzzzzzz's - 1 freq
zzzzzz - 1 freq
zsk - 1 freq
zzzzzzzz - 1 freq
zx - 1 freq
ziw - 1 freq
zgz - 1 freq
zqco - 1 freq
zyo - 1 freq
zje - 1 freq
zj - 1 freq
zss - 1 freq
zse - 1 freq
zkqx - 1 freq
zqe - 1 freq
MetaPhone code - S
see - 5831 freq
say - 3637 freq
sae - 4643 freq
'sae - 49 freq
saw - 1024 freq
'so - 46 freq
sea - 833 freq
so - 4266 freq
'see - 36 freq
saa - 360 freq
s - 791 freq
hyze - 2 freq
s'a - 7 freq
's - 76 freq
ss - 22 freq
s'aw - 2 freq
x - 544 freq
sea- - 5 freq
s' - 4 freq
xi - 13 freq
xii - 10 freq
wyse - 7 freq
soe - 1 freq
yyyaaaassss - 1 freq
sie - 41 freq
wyce - 66 freq
wyss - 1 freq
xiu - 1 freq
xiii - 3 freq
xx - 79 freq
xxi - 1 freq
xxii - 1 freq
xxiii - 1 freq
si - 41 freq
sah - 2 freq
si' - 1 freq
'x - 2 freq
soo - 28 freq
sey - 62 freq
see' - 4 freq
-sae - 1 freq
és - 1 freq
sow - 17 freq
sea' - 4 freq
''s - 1 freq
zoo - 40 freq
sue - 12 freq
sa - 61 freq
ce - 7 freq
se - 90 freq
sou - 6 freq
sahha - 1 freq
y'se - 3 freq
sce - 3 freq
ys - 4 freq
sshh - 4 freq
y'see - 1 freq
cih - 2 freq
soa - 3 freq
hce - 1 freq
'sea - 3 freq
sw - 3 freq
ceo - 3 freq
'say - 7 freq
ws - 6 freq
'°s - 1 freq
hs - 5 freq
sei - 94 freq
'sie - 1 freq
z - 119 freq
say' - 3 freq
'saw - 1 freq
sooo - 5 freq
zoe - 4 freq
suy - 1 freq
ïs - 260 freq
hïs - 331 freq
sa' - 2 freq
saiw - 1 freq
sai - 1 freq
zz - 1 freq
ös - 20 freq
'z' - 1 freq
'saw' - 1 freq
'-s' - 1 freq
say'i - 1 freq
xxxxx - 1 freq
sew - 9 freq
sae' - 1 freq
-so - 1 freq
ssshh - 1 freq
xx' - 1 freq
- 3 freq
xxx - 486 freq
xxxx - 3 freq
soy - 2 freq
sse - 27 freq
øse - 11 freq
z - 3 freq
s - 1 freq
zi - 3 freq
ci - 4 freq
-s - 1 freq
x' - 1 freq
'say' - 1 freq
øs - 5 freq
¢s - 2 freq
see- - 1 freq
saie - 1 freq
zu - 4 freq
süß - 1 freq
so' - 2 freq
su - 8 freq
s - 7757 freq
sae - 2 freq
see - 1 freq
'see' - 1 freq
s - 11 freq
say - 4 freq
see - 7 freq
s - 2 freq
sss - 1 freq
hsae - 1 freq
z - 30 freq
se - 4 freq
sae - 20 freq
so - 20 freq
sea - 1 freq
see - 7 freq
so - 1 freq
seo - 1 freq
so - 15 freq
s - 1 freq
say - 1 freq
ssshh - 1 freq
sci - 1 freq
hsi - 2 freq
ssh - 1 freq
ssh - 1 freq
's - 2 freq
saw - 2 freq
s - 19 freq
sua - 1 freq
si - 4 freq
sse - 1 freq
sa - 3 freq
se - 1 freq
so - 7 freq
sa - 1 freq
wse - 1 freq
zw - 2 freq
xh - 6 freq
‘s - 4 freq
xe - 4 freq
ze - 4 freq
zy - 4 freq
za - 3 freq
xo - 10 freq
xu - 3 freq
ysi - 1 freq
ssshhhhhh - 2 freq
“cei - 1 freq
cei - 1 freq
hz - 4 freq
zh - 1 freq
zio - 1 freq
soae - 1 freq
sshhhhh - 1 freq
’s - 9 freq
soooooo - 2 freq
sooooooooo - 1 freq
zz - 6 freq
zo - 5 freq
wci - 2 freq
xw - 4 freq
h's - 2 freq
soooo - 3 freq
zzzzzz - 1 freq
soooooooo - 1 freq
soooòoo - 1 freq
£'s - 1 freq
zzzzzzzz - 1 freq
seaaa - 1 freq
sooooooo - 2 freq
xua - 2 freq
ysw - 1 freq
“sae - 1 freq
“ce - 1 freq
são - 1 freq
xohw - 1 freq
ziw - 1 freq
hss - 1 freq
xa - 3 freq
syy - 1 freq
cy - 4 freq
ysoe - 1 freq
ssyy - 1 freq
xy - 1 freq
siu - 1 freq
sssa - 8 freq
xyyu - 1 freq
s'aww - 1 freq
yz - 1 freq
xxhw - 1 freq
xihy - 1 freq
wza - 1 freq
cea - 1 freq
ciu - 1 freq
hyz - 2 freq
seee - 1 freq
s-e - 1 freq
xxx… - 1 freq
cee - 2 freq
ZOE
Time to execute Levenshtein function - 0.307041 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.662657 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.057587 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.095438 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000823 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.