A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to o in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
o (0) - 56605 freq
(1) - 4 freq
on (1) - 18872 freq
(1) - 3 freq
(1) - 3 freq
ot (1) - 17 freq
(1) - 1 freq
(1) - 1 freq
(1) - 3 freq
uo (1) - 4 freq
(1) - 2 freq
of (1) - 4117 freq
(1) - 20 freq
(1) - 4 freq
os (1) - 11 freq
v (1) - 266 freq
to (1) - 4164 freq
k (1) - 205 freq
no (1) - 10287 freq
(1) - 1 freq
(1) - 4 freq
(1) - 2 freq
(1) - 1 freq
(1) - 3 freq
d (1) - 462 freq
o (0) - 56605 freq
oe (1) - 13 freq
oy (1) - 5 freq
yo (1) - 15 freq
ou (1) - 17 freq
u (1) - 523 freq
oi (1) - 18 freq
eo (1) - 6 freq
y (1) - 155 freq
uo (1) - 4 freq
a (1) - 92602 freq
e (1) - 4634 freq
i (1) - 18870 freq
oo (1) - 422 freq
ao (1) - 1 freq
io (1) - 7 freq
oa (1) - 14 freq
z (2) - 119 freq
c (2) - 465 freq
od (2) - 8 freq
(2) - 3 freq
ai (2) - 30 freq
(2) - 2 freq
ov (2) - 59 freq
lo (2) - 32 freq
SoundEx code - O000
o - 56605 freq
oh - 1188 freq
o' - 2284 freq
'oh - 123 freq
'o - 35 freq
oo - 422 freq
owe - 26 freq
oe - 13 freq
ow - 12 freq
ou - 17 freq
ooh - 37 freq
'oi - 1 freq
'oo - 10 freq
ooooooooh - 1 freq
'ohhhhhh - 2 freq
ohhhh - 1 freq
ohhhhh-hoooooo - 2 freq
'ohhhhh-hoooooo' - 1 freq
'ooooooh - 1 freq
oi - 18 freq
oooh - 8 freq
oh-oh - 1 freq
'oy - 1 freq
oa - 14 freq
oui - 5 freq
owey - 1 freq
ohhh - 5 freq
ohh - 3 freq
o- - 1 freq
oy - 5 freq
oa' - 1 freq
'o' - 33 freq
'oo' - 7 freq
'ooo' - 1 freq
ooow - 1 freq
oow - 6 freq
oo' - 17 freq
ooo - 23 freq
o'' - 4 freq
ooie - 1 freq
oohya - 1 freq
ooooh - 7 freq
ooh-ooaaaaah - 1 freq
oaaah-oaaah - 1 freq
oo'ae - 1 freq
o'a - 4 freq
- 1 freq
'ow-w-w - 1 freq
owa - 1 freq
oooo - 4 freq
ooy - 1 freq
oye - 1 freq
ooooo - 2 freq
oweeeee - 1 freq
'oweee - 2 freq
'oa' - 1 freq
'ow - 3 freq
'oh' - 1 freq
ohe - 1 freq
oooooh - 3 freq
oo-oo - 3 freq
oooooo - 1 freq
oh - 23 freq
oh - 91 freq
oi - 2 freq
o - 6 freq
oh - 2 freq
o - 7 freq
oooooooooo - 4 freq
oa - 2 freq
oo - 2 freq
ohio - 3 freq
oo - 3 freq
oh-a - 1 freq
oww - 1 freq
oioo - 1 freq
ooooo - 1 freq
o - 5 freq
ooo - 1 freq
ow-ow-ow-ow-oooyah - 1 freq
ooie - 1 freq
ou - 1 freq
ow - 1 freq
ou - 1 freq
oh - 4 freq
- 73 freq
ooohhh - 1 freq
“o - 1 freq
oee - 1 freq
-o - 1 freq
- 1 freq
oooooooahhhhhhhh - 1 freq
owieeeeee - 1 freq
‘ow’ - 1 freq
“oh - 4 freq
- 1 freq
oohhh - 2 freq
‘o - 1 freq
oweeee - 1 freq
ouh - 1 freq
ooa - 1 freq
oooooooh - 1 freq
ooohh - 1 freq
MetaPhone code - O
o - 56605 freq
oh - 1188 freq
o' - 2284 freq
'oh - 123 freq
'o - 35 freq
oo - 422 freq
oe - 13 freq
ow - 12 freq
ou - 17 freq
ooh - 37 freq
'oi - 1 freq
'oo - 10 freq
ooooooooh - 1 freq
'ohhhhhh - 2 freq
ohhhh - 1 freq
'ooooooh - 1 freq
oi - 18 freq
oooh - 8 freq
oh-oh - 1 freq
'oy - 1 freq
oa - 14 freq
oui - 5 freq
ohhh - 5 freq
ohh - 3 freq
o- - 1 freq
oy - 5 freq
oa' - 1 freq
'o' - 33 freq
'oo' - 7 freq
'ooo' - 1 freq
ooow - 1 freq
oow - 6 freq
oo' - 17 freq
ooo - 23 freq
o'' - 4 freq
ooie - 1 freq
ooooh - 7 freq
ooh-ooaaaaah - 1 freq
oaaah-oaaah - 1 freq
oo'ae - 1 freq
o'a - 4 freq
- 1 freq
'ow-w-w - 1 freq
oooo - 4 freq
ooy - 1 freq
ooooo - 2 freq
'oa' - 1 freq
'ow - 3 freq
'oh' - 1 freq
oooooh - 3 freq
oo-oo - 3 freq
oooooo - 1 freq
oh - 23 freq
oh - 91 freq
oi - 2 freq
o - 6 freq
oh - 2 freq
o - 7 freq
oooooooooo - 4 freq
oa - 2 freq
oo - 2 freq
oo - 3 freq
oh-a - 1 freq
oww - 1 freq
oioo - 1 freq
ooooo - 1 freq
o - 5 freq
ooo - 1 freq
ooie - 1 freq
ou - 1 freq
ow - 1 freq
ou - 1 freq
oh - 4 freq
- 73 freq
ooohhh - 1 freq
“o - 1 freq
oee - 1 freq
-o - 1 freq
- 1 freq
oooooooahhhhhhhh - 1 freq
‘ow’ - 1 freq
“oh - 4 freq
- 1 freq
oohhh - 2 freq
‘o - 1 freq
ouh - 1 freq
ooa - 1 freq
oooooooh - 1 freq
ooohh - 1 freq
O
o - 56605 freq
of - 4117 freq
ae - 5555 freq
o' - 2284 freq
Time to execute Levenshtein function - 0.171506 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.326711 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028905 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038179 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.004990 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.