A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to zy in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
zy (0) - 4 freq
zx (1) - 1 freq
dy (1) - 236 freq
zm (1) - 1 freq
jy (1) - 2 freq
zv (1) - 3 freq
cy (1) - 4 freq
py (1) - 3 freq
zj (1) - 1 freq
zt (1) - 2 freq
ny (1) - 8 freq
zr (1) - 2 freq
zc (1) - 3 freq
zk (1) - 3 freq
iy (1) - 3 freq
zys (1) - 1 freq
wy (1) - 11 freq
ry (1) - 7 freq
vy (1) - 3 freq
y (1) - 155 freq
z (1) - 119 freq
-y (1) - 6 freq
zyo (1) - 1 freq
zb (1) - 3 freq
zw (1) - 2 freq
zy (0) - 4 freq
ze (1) - 4 freq
zyo (1) - 1 freq
za (1) - 3 freq
zu (1) - 4 freq
zi (1) - 3 freq
z (1) - 119 freq
zo (1) - 5 freq
xy (2) - 1 freq
dy (2) - 236 freq
oy (2) - 5 freq
zl (2) - 3 freq
zg (2) - 6 freq
ozo (2) - 1 freq
az (2) - 1 freq
ay (2) - 2480 freq
eze (2) - 1 freq
zp (2) - 6 freq
my (2) - 2946 freq
zm (2) - 1 freq
yy (2) - 1 freq
by (2) - 4567 freq
ey (2) - 141 freq
uy (2) - 4 freq
zj (2) - 1 freq
SoundEx code - Z000
zoo - 40 freq
zha - 1 freq
z - 119 freq
zoe - 4 freq
zz - 1 freq
'z' - 1 freq
z - 3 freq
zi - 3 freq
zu - 4 freq
z - 30 freq
zc - 3 freq
zcxz - 1 freq
zw - 2 freq
zs - 5 freq
ze - 4 freq
zy - 4 freq
zg - 6 freq
za - 3 freq
zk - 3 freq
zq - 6 freq
zh - 1 freq
zio - 1 freq
zkjg - 1 freq
zzzs - 1 freq
zz - 6 freq
zzg - 1 freq
zo - 5 freq
zzz's - 1 freq
zzzzzzzz's - 1 freq
zzzzzz - 1 freq
zsk - 1 freq
zzzzzzzz - 1 freq
zx - 1 freq
ziw - 1 freq
zgz - 1 freq
zqco - 1 freq
zyo - 1 freq
zje - 1 freq
zj - 1 freq
zss - 1 freq
zse - 1 freq
zkqx - 1 freq
zqe - 1 freq
MetaPhone code - S
see - 5882 freq
say - 3683 freq
sae - 4643 freq
'sae - 49 freq
saw - 1028 freq
'so - 47 freq
sea - 844 freq
so - 4339 freq
'see - 36 freq
saa - 360 freq
s - 793 freq
hyze - 2 freq
s'a - 7 freq
's - 78 freq
ss - 20 freq
s'aw - 2 freq
x - 544 freq
sea- - 5 freq
s' - 4 freq
xi - 13 freq
xii - 10 freq
wyse - 7 freq
soe - 1 freq
yyyaaaassss - 1 freq
sie - 41 freq
wyce - 66 freq
wyss - 1 freq
xiu - 1 freq
xiii - 3 freq
xx - 79 freq
xxi - 1 freq
xxii - 1 freq
xxiii - 1 freq
si - 41 freq
sah - 2 freq
si' - 1 freq
'x - 2 freq
soo - 28 freq
sey - 62 freq
see' - 4 freq
-sae - 1 freq
és - 1 freq
sow - 17 freq
sea' - 4 freq
''s - 1 freq
zoo - 40 freq
sue - 12 freq
sa - 62 freq
ce - 7 freq
se - 90 freq
sou - 6 freq
sahha - 1 freq
y'se - 3 freq
sce - 3 freq
ys - 4 freq
sshh - 4 freq
y'see - 1 freq
cih - 2 freq
soa - 3 freq
hce - 1 freq
'sea - 3 freq
sw - 3 freq
w's - 1 freq
'say - 8 freq
ceo - 3 freq
ws - 6 freq
'°s - 1 freq
hs - 5 freq
sei - 94 freq
'sie - 1 freq
z - 119 freq
say' - 3 freq
'saw - 1 freq
sooo - 5 freq
zoe - 4 freq
suy - 1 freq
ïs - 260 freq
hïs - 331 freq
sa' - 2 freq
saiw - 1 freq
sai - 1 freq
zz - 1 freq
ös - 20 freq
'z' - 1 freq
'saw' - 1 freq
'-s' - 1 freq
say'i - 1 freq
xxxxx - 1 freq
sew - 9 freq
sae' - 1 freq
-so - 1 freq
ssshh - 1 freq
xx' - 1 freq
- 3 freq
xxx - 486 freq
xxxx - 3 freq
soy - 2 freq
sse - 27 freq
øse - 11 freq
z - 3 freq
s - 1 freq
zi - 3 freq
ci - 4 freq
-s - 1 freq
x' - 1 freq
'say' - 1 freq
øs - 5 freq
¢s - 2 freq
see- - 1 freq
saie - 1 freq
zu - 4 freq
süß - 1 freq
so' - 2 freq
su - 8 freq
s - 7757 freq
sae - 2 freq
see - 1 freq
'see' - 1 freq
s - 11 freq
say - 4 freq
see - 7 freq
s - 2 freq
sss - 1 freq
hsae - 1 freq
z - 30 freq
se - 4 freq
sae - 20 freq
so - 20 freq
sea - 1 freq
see - 7 freq
so - 1 freq
seo - 1 freq
so - 15 freq
s - 1 freq
say - 1 freq
ssshh - 1 freq
sci - 1 freq
hsi - 2 freq
ssh - 1 freq
ssh - 1 freq
's - 2 freq
saw - 2 freq
s - 19 freq
sua - 1 freq
si - 4 freq
sse - 1 freq
sa - 3 freq
se - 1 freq
so - 7 freq
sa - 1 freq
wse - 1 freq
zw - 2 freq
xh - 6 freq
‘s - 4 freq
xe - 4 freq
ze - 4 freq
zy - 4 freq
za - 3 freq
xo - 10 freq
xu - 3 freq
ysi - 1 freq
ssshhhhhh - 2 freq
“cei - 1 freq
cei - 1 freq
hz - 4 freq
zh - 1 freq
zio - 1 freq
soae - 1 freq
sshhhhh - 1 freq
’s - 9 freq
soooooo - 2 freq
sooooooooo - 1 freq
zz - 6 freq
zo - 5 freq
wci - 2 freq
xw - 4 freq
h's - 2 freq
soooo - 3 freq
zzzzzz - 1 freq
soooooooo - 1 freq
soooòoo - 1 freq
£'s - 1 freq
zzzzzzzz - 1 freq
seaaa - 1 freq
sooooooo - 2 freq
xua - 2 freq
ysw - 1 freq
“sae - 1 freq
“ce - 1 freq
são - 1 freq
xohw - 1 freq
ziw - 1 freq
hss - 1 freq
xa - 3 freq
syy - 1 freq
cy - 4 freq
ysoe - 1 freq
ssyy - 1 freq
xy - 1 freq
siu - 1 freq
sssa - 8 freq
xyyu - 1 freq
s'aww - 1 freq
yz - 1 freq
xxhw - 1 freq
xihy - 1 freq
wza - 1 freq
cea - 1 freq
ciu - 1 freq
hyz - 2 freq
seee - 1 freq
s-e - 1 freq
xxx… - 1 freq
cee - 2 freq
ZY
Time to execute Levenshtein function - 0.193548 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.345542 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.036692 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.046815 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001165 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.