A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to oov in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
oov (0) - 8 freq
pov (1) - 1 freq
ov (1) - 59 freq
rov (1) - 1 freq
oos (1) - 1 freq
ooa (1) - 1 freq
oot (1) - 13919 freq
obv (1) - 4 freq
oor (1) - 3999 freq
ooo (1) - 23 freq
mov (1) - 3 freq
oop (1) - 2 freq
oow (1) - 6 freq
ook (1) - 2 freq
oof (1) - 6 freq
oo (1) - 422 freq
ool (1) - 8 freq
gov (1) - 39 freq
oo' (1) - 17 freq
nov (1) - 4 freq
oon (1) - 2 freq
cov (1) - 1 freq
oove (1) - 1 freq
aov (1) - 1 freq
bov (1) - 1 freq
oov (0) - 8 freq
oove (1) - 1 freq
aov (1) - 1 freq
ov (1) - 59 freq
voe (2) - 26 freq
yoove (2) - 2 freq
vo (2) - 8 freq
voo (2) - 3 freq
yav (2) - 1 freq
moov (2) - 3 freq
ooh (2) - 37 freq
av (2) - 161 freq
eev (2) - 5 freq
vou (2) - 5 freq
ove (2) - 1 freq
yuv (2) - 15 freq
yev (2) - 7 freq
ev (2) - 3 freq
uv (2) - 12 freq
v (2) - 266 freq
auv (2) - 2 freq
yv (2) - 1 freq
youv (2) - 1 freq
yiv (2) - 59 freq
oob (2) - 1 freq
SoundEx code - O100
off - 546 freq
of - 4120 freq
oaf - 51 freq
ouf - 1 freq
oof - 6 freq
oaff - 110 freq
'of - 17 freq
oo've - 3 freq
oba - 2 freq
obey - 8 freq
ov - 59 freq
obe - 2 freq
oaap - 1 freq
ob - 8 freq
oaffy - 3 freq
oaffae - 2 freq
ope - 3 freq
oboe - 1 freq
ouff - 1 freq
oop - 2 freq
ove - 1 freq
ofhe - 1 freq
owef - 1 freq
o'view - 2 freq
owf - 1 freq
offae - 1 freq
ovey - 1 freq
'of' - 1 freq
oob - 1 freq
ohff - 1 freq
offy - 2 freq
€œof - 6 freq
op - 7 freq
offie - 2 freq
€˜of - 3 freq
€™ope - 2 freq
offay - 1 freq
oov - 8 freq
€™of - 3 freq
owuhv - 1 freq
opo - 1 freq
obvo - 1 freq
ooof - 1 freq
opyu - 1 freq
obf - 1 freq
obv - 4 freq
obbv - 1 freq
ohp - 1 freq
ofh - 1 freq
opui - 1 freq
obh - 2 freq
oup - 1 freq
oupa - 1 freq
oove - 1 freq
opp - 2 freq
obp - 1 freq
MetaPhone code - OF
off - 546 freq
of - 4120 freq
oaf - 51 freq
ouf - 1 freq
oof - 6 freq
oaff - 110 freq
'of - 17 freq
oo've - 3 freq
ov - 59 freq
oaffy - 3 freq
oaffae - 2 freq
ouff - 1 freq
ove - 1 freq
o'view - 2 freq
owf - 1 freq
offae - 1 freq
ovey - 1 freq
'of' - 1 freq
ohff - 1 freq
offy - 2 freq
€œof - 6 freq
offie - 2 freq
€˜of - 3 freq
offay - 1 freq
oov - 8 freq
€™of - 3 freq
ooof - 1 freq
ofh - 1 freq
oove - 1 freq
OOV
Time to execute Levenshtein function - 0.228728 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.385333 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030836 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037578 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000861 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.