A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to occa in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
occa (0) - 1 freq
ocna (1) - 1 freq
orca (1) - 7 freq
occam (1) - 1 freq
ocbm (2) - 1 freq
oncy (2) - 4 freq
wica (2) - 1 freq
obda (2) - 1 freq
cea (2) - 1 freq
csa (2) - 1 freq
ocht (2) - 317 freq
scc (2) - 1 freq
oncia (2) - 1 freq
ccp (2) - 3 freq
€œca (2) - 1 freq
wicca (2) - 1 freq
oka (2) - 1 freq
sicca (2) - 4 freq
acce (2) - 1 freq
oa (2) - 14 freq
hcc (2) - 1 freq
ca (2) - 194 freq
yca (2) - 2 freq
unca (2) - 12 freq
ora (2) - 1 freq
occa (0) - 1 freq
cocoa (2) - 3 freq
ccea (2) - 1 freq
acce (2) - 1 freq
ecc (2) - 1 freq
ecce (2) - 1 freq
coca (2) - 1 freq
cc (2) - 19 freq
cce (2) - 1 freq
orca (2) - 7 freq
ocna (2) - 1 freq
iccy (2) - 1 freq
occam (2) - 1 freq
echa (3) - 2 freq
cia (3) - 3 freq
reca (3) - 6 freq
'caa (3) - 1 freq
cra (3) - 2 freq
oecd (3) - 1 freq
cha (3) - 19 freq
bacca (3) - 2 freq
macca (3) - 12 freq
oic (3) - 1 freq
cva (3) - 1 freq
oocha (3) - 3 freq
SoundEx code - O200
och - 706 freq
'och - 119 freq
oose - 13 freq
oags - 2 freq
o's - 48 freq
okay - 67 freq
ouch - 24 freq
'ouch - 2 freq
og - 10 freq
oohs - 1 freq
ok - 116 freq
ox - 25 freq
'okay - 8 freq
oqo-oh - 1 freq
ochie - 8 freq
oak - 16 freq
oosie - 3 freq
ock - 1 freq
oz - 15 freq
oq - 4 freq
ooohs - 1 freq
ooze - 3 freq
'ox - 1 freq
o'is - 31 freq
os - 11 freq
ouija - 7 freq
ooies - 1 freq
owes - 4 freq
'owch - 1 freq
'ok - 6 freq
oic - 1 freq
ouk - 16 freq
o'z - 6 freq
owse - 5 freq
osc - 3 freq
oag - 2 freq
ooooooch - 1 freq
o'skye - 1 freq
oki - 5 freq
oozie - 1 freq
ook - 2 freq
oossie - 1 freq
oos - 1 freq
okey - 1 freq
oasy - 1 freq
og' - 1 freq
owis - 1 freq
€˜och - 13 freq
€œoch - 19 freq
ox-eye - 1 freq
oiss - 1 freq
oys - 3 freq
oes - 2 freq
€œox - 1 freq
€œok - 11 freq
ouse - 1 freq
€˜okay - 1 freq
€˜o-o-o-o-k-k-k-k - 1 freq
€”och - 1 freq
€œokay - 9 freq
oese - 4 freq
'oose' - 1 freq
'oyez - 1 freq
oyez' - 1 freq
oaks - 1 freq
€™ok - 1 freq
€™okay - 2 freq
oxj - 1 freq
oxhi - 1 freq
oj - 2 freq
oooocha - 1 freq
oc - 2 freq
okjjx - 1 freq
“och - 1 freq
oega - 1 freq
oceiyi - 1 freq
ouks - 2 freq
oxo - 1 freq
ooocha - 2 freq
oocha - 3 freq
oougg - 1 freq
oooosh - 1 freq
oxi - 2 freq
okaaayÂ… - 1 freq
ojo - 3 freq
ohhsqj - 1 freq
ouzo - 1 freq
okqyuy - 1 freq
oka - 1 freq
ougo - 1 freq
o'whisky - 1 freq
oeq - 1 freq
osz - 1 freq
ojxg - 1 freq
oxsie - 1 freq
oiseau - 1 freq
oqy - 1 freq
ojjz - 1 freq
ozo - 1 freq
oajo - 1 freq
ohohj - 1 freq
oxsi - 1 freq
oÂ’s - 1 freq
oaq - 1 freq
ooekk - 1 freq
oasz - 1 freq
oaj - 1 freq
oigj - 1 freq
okj - 1 freq
oche - 1 freq
occa - 1 freq
ozg - 1 freq
oeg - 1 freq
oszy - 1 freq
oqc - 1 freq
MetaPhone code - OKK
occa - 1 freq
oqc - 1 freq
OCCA
Time to execute Levenshtein function - 0.189088 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.342572 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028409 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037623 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000902 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.