A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to xii in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
xii (0) - 10 freq
xi (1) - 13 freq
fii (1) - 1 freq
ii (1) - 70 freq
xiii (1) - 3 freq
iii (1) - 32 freq
xni (1) - 1 freq
xvii (1) - 1 freq
dii (1) - 1 freq
xci (1) - 1 freq
hii (1) - 1 freq
xji (1) - 1 freq
vii (1) - 16 freq
xvi (1) - 3 freq
xix (1) - 1 freq
xxi (1) - 1 freq
xiq (1) - 1 freq
tii (1) - 1 freq
xiv (1) - 4 freq
xiu (1) - 1 freq
wii (1) - 10 freq
xxii (1) - 1 freq
jik (2) - 1 freq
oki (2) - 5 freq
rid (2) - 313 freq
xii (0) - 10 freq
xiii (1) - 3 freq
xiu (1) - 1 freq
xi (1) - 13 freq
ix (2) - 15 freq
xa (2) - 3 freq
xu (2) - 3 freq
oxi (2) - 2 freq
ii (2) - 70 freq
xxii (2) - 1 freq
xy (2) - 1 freq
xo (2) - 10 freq
xe (2) - 4 freq
xua (2) - 2 freq
x (2) - 544 freq
aix (2) - 42 freq
xya (2) - 1 freq
xiv (2) - 4 freq
wii (2) - 10 freq
dii (2) - 1 freq
xci (2) - 1 freq
xvii (2) - 1 freq
xni (2) - 1 freq
fii (2) - 1 freq
iii (2) - 32 freq
SoundEx code - X000
x - 544 freq
xi - 13 freq
xii - 10 freq
xiu - 1 freq
xiii - 3 freq
xx - 79 freq
xxi - 1 freq
xxii - 1 freq
xxiii - 1 freq
'x - 2 freq
xxxxx - 1 freq
xx' - 1 freq
xxx - 486 freq
xxxx - 3 freq
x' - 1 freq
xja - 1 freq
xsyi - 1 freq
xz - 4 freq
xs - 2 freq
xsx - 1 freq
xh - 6 freq
xe - 4 freq
xzc - 1 freq
xsqzw - 1 freq
xo - 10 freq
xu - 3 freq
xwu - 1 freq
xsy - 1 freq
xkg - 1 freq
xci - 1 freq
xji - 1 freq
xc - 2 freq
xw - 4 freq
xgj - 1 freq
xxz - 1 freq
xzo - 1 freq
xzx - 1 freq
xwe - 1 freq
xcssa - 1 freq
xk - 1 freq
xua - 2 freq
xgkz - 1 freq
xohw - 1 freq
xju - 1 freq
xa - 3 freq
xje - 1 freq
xxxk - 1 freq
xya - 1 freq
xy - 1 freq
xq - 2 freq
xyyu - 1 freq
xjuiu - 1 freq
xqu - 2 freq
xxhw - 1 freq
xihy - 1 freq
xxg - 1 freq
xxx… - 1 freq
MetaPhone code - S
see - 5882 freq
say - 3683 freq
sae - 4643 freq
'sae - 49 freq
saw - 1028 freq
'so - 47 freq
sea - 844 freq
so - 4339 freq
'see - 36 freq
saa - 360 freq
s - 793 freq
hyze - 2 freq
s'a - 7 freq
's - 78 freq
ss - 20 freq
s'aw - 2 freq
x - 544 freq
sea- - 5 freq
s' - 4 freq
xi - 13 freq
xii - 10 freq
wyse - 7 freq
soe - 1 freq
yyyaaaassss - 1 freq
sie - 41 freq
wyce - 66 freq
wyss - 1 freq
xiu - 1 freq
xiii - 3 freq
xx - 79 freq
xxi - 1 freq
xxii - 1 freq
xxiii - 1 freq
si - 41 freq
sah - 2 freq
si' - 1 freq
'x - 2 freq
soo - 28 freq
sey - 62 freq
see' - 4 freq
-sae - 1 freq
és - 1 freq
sow - 17 freq
sea' - 4 freq
''s - 1 freq
zoo - 40 freq
sue - 12 freq
sa - 62 freq
ce - 7 freq
se - 90 freq
sou - 6 freq
sahha - 1 freq
y'se - 3 freq
sce - 3 freq
ys - 4 freq
sshh - 4 freq
y'see - 1 freq
cih - 2 freq
soa - 3 freq
hce - 1 freq
'sea - 3 freq
sw - 3 freq
w's - 1 freq
'say - 8 freq
ceo - 3 freq
ws - 6 freq
'°s - 1 freq
hs - 5 freq
sei - 94 freq
'sie - 1 freq
z - 119 freq
say' - 3 freq
'saw - 1 freq
sooo - 5 freq
zoe - 4 freq
suy - 1 freq
ïs - 260 freq
hïs - 331 freq
sa' - 2 freq
saiw - 1 freq
sai - 1 freq
zz - 1 freq
ös - 20 freq
'z' - 1 freq
'saw' - 1 freq
'-s' - 1 freq
say'i - 1 freq
xxxxx - 1 freq
sew - 9 freq
sae' - 1 freq
-so - 1 freq
ssshh - 1 freq
xx' - 1 freq
- 3 freq
xxx - 486 freq
xxxx - 3 freq
soy - 2 freq
sse - 27 freq
øse - 11 freq
z - 3 freq
s - 1 freq
zi - 3 freq
ci - 4 freq
-s - 1 freq
x' - 1 freq
'say' - 1 freq
øs - 5 freq
¢s - 2 freq
see- - 1 freq
saie - 1 freq
zu - 4 freq
süß - 1 freq
so' - 2 freq
su - 8 freq
s - 7757 freq
sae - 2 freq
see - 1 freq
'see' - 1 freq
s - 11 freq
say - 4 freq
see - 7 freq
s - 2 freq
sss - 1 freq
hsae - 1 freq
z - 30 freq
se - 4 freq
sae - 20 freq
so - 20 freq
sea - 1 freq
see - 7 freq
so - 1 freq
seo - 1 freq
so - 15 freq
s - 1 freq
say - 1 freq
ssshh - 1 freq
sci - 1 freq
hsi - 2 freq
ssh - 1 freq
ssh - 1 freq
's - 2 freq
saw - 2 freq
s - 19 freq
sua - 1 freq
si - 4 freq
sse - 1 freq
sa - 3 freq
se - 1 freq
so - 7 freq
sa - 1 freq
wse - 1 freq
zw - 2 freq
xh - 6 freq
‘s - 4 freq
xe - 4 freq
ze - 4 freq
zy - 4 freq
za - 3 freq
xo - 10 freq
xu - 3 freq
ysi - 1 freq
ssshhhhhh - 2 freq
“cei - 1 freq
cei - 1 freq
hz - 4 freq
zh - 1 freq
zio - 1 freq
soae - 1 freq
sshhhhh - 1 freq
’s - 9 freq
soooooo - 2 freq
sooooooooo - 1 freq
zz - 6 freq
zo - 5 freq
wci - 2 freq
xw - 4 freq
h's - 2 freq
soooo - 3 freq
zzzzzz - 1 freq
soooooooo - 1 freq
soooòoo - 1 freq
£'s - 1 freq
zzzzzzzz - 1 freq
seaaa - 1 freq
sooooooo - 2 freq
xua - 2 freq
ysw - 1 freq
“sae - 1 freq
“ce - 1 freq
são - 1 freq
xohw - 1 freq
ziw - 1 freq
hss - 1 freq
xa - 3 freq
syy - 1 freq
cy - 4 freq
ysoe - 1 freq
ssyy - 1 freq
xy - 1 freq
siu - 1 freq
sssa - 8 freq
xyyu - 1 freq
s'aww - 1 freq
yz - 1 freq
xxhw - 1 freq
xihy - 1 freq
wza - 1 freq
cea - 1 freq
ciu - 1 freq
hyz - 2 freq
seee - 1 freq
s-e - 1 freq
xxx… - 1 freq
cee - 2 freq
XII
Time to execute Levenshtein function - 0.209769 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.322317 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027964 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039937 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000900 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.