A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to zoo in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
zoo (0) - 49 freq
soo (1) - 29 freq
joo (1) - 3 freq
moo (1) - 175 freq
zob (1) - 1 freq
coo (1) - 190 freq
yoo (1) - 14 freq
zoe (1) - 4 freq
zo (1) - 5 freq
ioo (1) - 1 freq
ooo (1) - 23 freq
zto (1) - 1 freq
zio (1) - 1 freq
voo (1) - 3 freq
goo (1) - 5 freq
zlo (1) - 1 freq
loo (1) - 29 freq
doo (1) - 126 freq
zyo (1) - 1 freq
zoot (1) - 2 freq
'oo (1) - 10 freq
hoo (1) - 1076 freq
poo (1) - 18 freq
zoos (1) - 2 freq
foo (1) - 753 freq
zoo (0) - 49 freq
zoe (1) - 4 freq
zio (1) - 1 freq
zo (1) - 5 freq
zyo (1) - 1 freq
zool (2) - 7 freq
oo (2) - 422 freq
noo (2) - 5800 freq
too (2) - 1030 freq
boo (2) - 58 freq
zoom (2) - 19 freq
roo (2) - 79 freq
woo (2) - 19 freq
yooz (2) - 12 freq
oz (2) - 15 freq
uzo (2) - 1 freq
zy (2) - 4 freq
zi (2) - 3 freq
ozo (2) - 1 freq
foo (2) - 753 freq
zu (2) - 4 freq
za (2) - 3 freq
ze (2) - 4 freq
z (2) - 119 freq
yoo (2) - 14 freq
SoundEx code - Z000
zoo - 49 freq
zha - 1 freq
z - 119 freq
zoe - 4 freq
zz - 1 freq
'z' - 1 freq
z - 3 freq
zi - 3 freq
zu - 4 freq
z - 30 freq
zc - 3 freq
zcxz - 1 freq
zw - 2 freq
zs - 5 freq
ze - 4 freq
zy - 4 freq
zg - 6 freq
za - 3 freq
zk - 3 freq
zq - 6 freq
zh - 1 freq
zio - 1 freq
zkjg - 1 freq
zzzs - 1 freq
zz - 6 freq
zzg - 1 freq
zo - 5 freq
zzz's - 1 freq
zzzzzzzz's - 1 freq
zzzzzz - 1 freq
zsk - 1 freq
zzzzzzzz - 1 freq
zx - 1 freq
ziw - 1 freq
zgz - 1 freq
zqco - 1 freq
zyo - 1 freq
zje - 1 freq
zj - 1 freq
zss - 1 freq
zse - 1 freq
zkqx - 1 freq
zqe - 1 freq
MetaPhone code - S
see - 5912 freq
say - 3705 freq
sae - 4672 freq
'sae - 49 freq
saw - 1038 freq
'so - 48 freq
sea - 853 freq
so - 4382 freq
'see - 36 freq
saa - 360 freq
s - 793 freq
hyze - 2 freq
s'a - 7 freq
's - 78 freq
ss - 20 freq
s'aw - 2 freq
x - 544 freq
sea- - 5 freq
s' - 4 freq
xi - 13 freq
xii - 10 freq
wyse - 8 freq
soe - 1 freq
yyyaaaassss - 1 freq
sie - 41 freq
wyce - 68 freq
wyss - 1 freq
xiu - 1 freq
xiii - 3 freq
xx - 79 freq
xxi - 1 freq
xxii - 1 freq
xxiii - 1 freq
si - 43 freq
sah - 2 freq
si' - 1 freq
'x - 2 freq
soo - 29 freq
sey - 62 freq
see' - 4 freq
-sae - 1 freq
és - 1 freq
sow - 17 freq
sea' - 4 freq
''s - 1 freq
zoo - 49 freq
sue - 12 freq
sa - 62 freq
ce - 7 freq
se - 90 freq
sou - 6 freq
sahha - 1 freq
y'se - 3 freq
sce - 3 freq
ys - 4 freq
sshh - 4 freq
y'see - 1 freq
cih - 2 freq
soa - 3 freq
hce - 1 freq
'sea - 3 freq
sw - 3 freq
w's - 1 freq
'say - 8 freq
ceo - 3 freq
ws - 6 freq
'°s - 1 freq
hs - 5 freq
sei - 94 freq
'sie - 1 freq
z - 119 freq
say' - 3 freq
'saw - 1 freq
sooo - 5 freq
zoe - 4 freq
suy - 1 freq
ïs - 260 freq
hïs - 331 freq
sa' - 2 freq
saiw - 1 freq
sai - 1 freq
zz - 1 freq
ös - 20 freq
'z' - 1 freq
'saw' - 1 freq
'-s' - 1 freq
say'i - 1 freq
xxxxx - 1 freq
sew - 9 freq
sae' - 1 freq
-so - 1 freq
ssshh - 1 freq
xx' - 1 freq
- 3 freq
xxx - 486 freq
xxxx - 3 freq
soy - 2 freq
sse - 27 freq
øse - 11 freq
z - 3 freq
s - 1 freq
zi - 3 freq
ci - 4 freq
-s - 1 freq
x' - 1 freq
'say' - 1 freq
øs - 5 freq
¢s - 2 freq
see- - 1 freq
saie - 1 freq
zu - 4 freq
süß - 1 freq
so' - 2 freq
su - 8 freq
s - 7783 freq
sae - 2 freq
see - 1 freq
'see' - 1 freq
s - 11 freq
say - 4 freq
see - 7 freq
s - 2 freq
sss - 1 freq
hsae - 1 freq
z - 30 freq
se - 4 freq
sae - 20 freq
so - 20 freq
sea - 1 freq
see - 8 freq
so - 1 freq
seo - 1 freq
so - 15 freq
s - 1 freq
say - 1 freq
ssshh - 1 freq
sci - 1 freq
hsi - 2 freq
ssh - 1 freq
ssh - 1 freq
's - 2 freq
saw - 2 freq
s - 19 freq
sua - 1 freq
si - 4 freq
sse - 1 freq
sa - 3 freq
se - 1 freq
so - 7 freq
sa - 1 freq
wse - 1 freq
zw - 2 freq
xh - 6 freq
‘s - 4 freq
xe - 4 freq
ze - 4 freq
zy - 4 freq
za - 3 freq
xo - 10 freq
xu - 3 freq
ysi - 1 freq
ssshhhhhh - 2 freq
“cei - 1 freq
cei - 1 freq
hz - 4 freq
zh - 1 freq
zio - 1 freq
soae - 1 freq
sshhhhh - 1 freq
’s - 9 freq
soooooo - 2 freq
sooooooooo - 1 freq
zz - 6 freq
zo - 5 freq
wci - 2 freq
xw - 4 freq
h's - 2 freq
soooo - 3 freq
zzzzzz - 1 freq
soooooooo - 1 freq
soooòoo - 1 freq
£'s - 1 freq
zzzzzzzz - 1 freq
seaaa - 1 freq
sooooooo - 2 freq
xua - 2 freq
ysw - 1 freq
“sae - 1 freq
“ce - 1 freq
são - 1 freq
xohw - 1 freq
ziw - 1 freq
hss - 1 freq
xa - 3 freq
syy - 1 freq
cy - 4 freq
ysoe - 1 freq
ssyy - 1 freq
xy - 1 freq
siu - 1 freq
sssa - 8 freq
xyyu - 1 freq
s'aww - 1 freq
yz - 1 freq
xxhw - 1 freq
xihy - 1 freq
wza - 1 freq
cea - 1 freq
ciu - 1 freq
hyz - 2 freq
seee - 1 freq
s-e - 1 freq
xxx… - 1 freq
cee - 2 freq
ZOO
Time to execute Levenshtein function - 0.698489 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.187618 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027509 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.175355 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001109 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.