A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ot in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ot (0) - 17 freq
dot (1) - 47 freq
vot (1) - 1 freq
o' (1) - 2284 freq
jot (1) - 13 freq
oo (1) - 422 freq
od (1) - 8 freq
qt (1) - 5 freq
oot (1) - 13916 freq
ov (1) - 59 freq
oq (1) - 4 freq
fot (1) - 6 freq
mot (1) - 8 freq
ft (1) - 19 freq
og (1) - 10 freq
on (1) - 18872 freq
kt (1) - 3 freq
or (1) - 9305 freq
o (1) - 56605 freq
sot (1) - 6 freq
oi (1) - 18 freq
ou (1) - 17 freq
rt (1) - 44 freq
oa (1) - 14 freq
oe (1) - 13 freq
ot (0) - 17 freq
oit (1) - 1 freq
out (1) - 790 freq
oto (1) - 1 freq
oat (1) - 12 freq
et (1) - 256 freq
oot (1) - 13916 freq
t (1) - 5648 freq
yt (1) - 12 freq
at (1) - 20437 freq
it (1) - 33301 freq
ut (1) - 9 freq
pt (2) - 5 freq
wot (2) - 4 freq
lot (2) - 800 freq
toe (2) - 22 freq
ate (2) - 117 freq
jt (2) - 20 freq
ct (2) - 8 freq
ta (2) - 2534 freq
ots (2) - 2 freq
pot (2) - 209 freq
hot (2) - 206 freq
ªt (2) - 6 freq
otq (2) - 2 freq
SoundEx code - O300
oot - 13916 freq
out - 790 freq
othe - 4 freq
o't - 277 freq
ooty - 21 freq
oot-d'ye - 1 freq
owt - 10 freq
oath - 13 freq
odd - 142 freq
'oot - 12 freq
ode - 13 freq
od't - 4 freq
ot - 17 freq
oat - 12 freq
o'd - 13 freq
ootae - 90 freq
-odd - 3 freq
oota - 54 freq
ootwi - 17 freq
oawthe - 1 freq
owed - 13 freq
owet - 1 freq
oit - 1 freq
ooadaa - 1 freq
owte - 2 freq
outta - 5 freq
od - 8 freq
oottae - 1 freq
ooto - 85 freq
oed - 6 freq
oot' - 4 freq
ootd - 2 freq
owd - 1 freq
o'tay - 1 freq
out' - 2 freq
'out - 2 freq
ootdae - 5 freq
o't' - 1 freq
owid - 7 freq
oatae - 1 freq
ott - 4 freq
oda - 5 freq
oot-o'-e-way - 2 freq
oot- - 1 freq
€˜oot - 5 freq
oo'd - 1 freq
€™out - 2 freq
out-the-wey - 1 freq
out-waw - 1 freq
€œout - 1 freq
€™oot - 45 freq
oot-the-wey - 1 freq
oeht - 1 freq
'oot' - 1 freq
odo - 1 freq
ootta - 2 freq
€œoot - 1 freq
othha - 1 freq
oad - 2 freq
€˜ootwi - 2 freq
€˜ode - 1 freq
outty - 2 freq
oudey - 1 freq
oid - 1 freq
o'the - 7 freq
oÂ’the - 8 freq
o'dee - 1 freq
o'at - 1 freq
oodie - 1 freq
outwi - 1 freq
oto - 1 freq
oÂ’t - 2 freq
ootÂ… - 1 freq
'ootwi' - 1 freq
outa - 6 freq
MetaPhone code - OT
oot - 13916 freq
out - 790 freq
o't - 277 freq
ooty - 21 freq
owt - 10 freq
odd - 142 freq
'oot - 12 freq
ode - 13 freq
ot - 17 freq
oat - 12 freq
o'd - 13 freq
ootae - 90 freq
-odd - 3 freq
oota - 54 freq
oit - 1 freq
ooadaa - 1 freq
owte - 2 freq
outta - 5 freq
od - 8 freq
oottae - 1 freq
ooto - 85 freq
oed - 6 freq
oot' - 4 freq
owd - 1 freq
o'tay - 1 freq
out' - 2 freq
'out - 2 freq
o't' - 1 freq
oatae - 1 freq
ott - 4 freq
oda - 5 freq
oot- - 1 freq
€˜oot - 5 freq
oo'd - 1 freq
€™out - 2 freq
€œout - 1 freq
€™oot - 45 freq
oeht - 1 freq
'oot' - 1 freq
odo - 1 freq
ootta - 2 freq
€œoot - 1 freq
oad - 2 freq
€˜ode - 1 freq
outty - 2 freq
oudey - 1 freq
oid - 1 freq
o'dee - 1 freq
o'at - 1 freq
oodie - 1 freq
oto - 1 freq
oÂ’t - 2 freq
ootÂ… - 1 freq
outa - 6 freq
OT
Time to execute Levenshtein function - 0.220222 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.354719 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028527 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041664 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001042 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.