A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to eric in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
eric (0) - 63 freq
erica (1) - 1 freq
'eric (1) - 1 freq
elic (1) - 2 freq
enric (1) - 1 freq
epic (1) - 20 freq
erik (1) - 2 freq
eric' (1) - 1 freq
eri (1) - 1 freq
erin (1) - 9 freq
ehic (1) - 1 freq
ric (1) - 10 freq
erie (1) - 1 freq
teri (2) - 4 freq
verie (2) - 23 freq
werit (2) - 1 freq
driv (2) - 12 freq
mrc (2) - 1 freq
rig (2) - 45 freq
dreic (2) - 1 freq
evie (2) - 8 freq
elc (2) - 5 freq
enin (2) - 4 freq
err (2) - 40 freq
grip (2) - 123 freq
eric (0) - 63 freq
erica (1) - 1 freq
ric (1) - 10 freq
rice (2) - 106 freq
rec (2) - 1 freq
'eric (2) - 1 freq
rac (2) - 1 freq
rico (2) - 2 freq
rc (2) - 3 freq
arc (2) - 7 freq
elic (2) - 2 freq
erie (2) - 1 freq
erik (2) - 2 freq
eri (2) - 1 freq
erin (2) - 9 freq
epic (2) - 20 freq
enric (2) - 1 freq
ehic (2) - 1 freq
eric' (2) - 1 freq
evoc (3) - 1 freq
ern (3) - 14 freq
dic (3) - 1 freq
erne (3) - 4 freq
erke (3) - 1 freq
erse (3) - 269 freq
SoundEx code - E620
erse - 269 freq
ears - 111 freq
erack - 1 freq
eares - 1 freq
eros - 16 freq
erik - 2 freq
ere's - 279 freq
eries - 1 freq
erchie - 59 freq
eers - 9 freq
err's - 1 freq
erch - 3 freq
errs - 1 freq
eariwig - 2 freq
eras - 2 freq
er's - 20 freq
eirse - 2 freq
eureka - 3 freq
eric - 63 freq
'eric - 1 freq
eric' - 1 freq
'eric's - 1 freq
ear's - 1 freq
eric's - 8 freq
ere''s - 1 freq
eer's - 1 freq
eoorse - 2 freq
eross - 1 freq
erika - 1 freq
eeyore's - 1 freq
euros - 7 freq
'ere's - 35 freq
eariewigs - 1 freq
eariewig - 2 freq
'eureka - 1 freq
¬‚ers - 1 freq
erk - 5 freq
'ears - 8 freq
ee-ers - 1 freq
erek - 1 freq
eirs - 3 freq
errows - 1 freq
ersei - 1 freq
erica - 1 freq
ersh - 1 freq
'er's - 1 freq
ehrs - 2 freq
erz - 3 freq
erza - 1 freq
errza - 2 freq
earse - 4 freq
€™ers - 4 freq
eres - 1 freq
erss - 3 freq
ers - 25 freq
€˜erchie - 1 freq
eurasia - 1 freq
€œeros - 1 freq
eurig - 1 freq
€˜ears - 3 freq
ergo - 1 freq
€™erse - 1 freq
ersch - 1 freq
ersche - 1 freq
eriwazqu - 1 freq
eurohoschie - 1 freq
‘erse’ - 3 freq
erse' - 1 freq
erikg - 1 freq
erke - 1 freq
'eres - 1 freq
eeeerrrrrs - 1 freq
MetaPhone code - ERK
erack - 1 freq
erik - 2 freq
eureka - 3 freq
eric - 63 freq
'eric - 1 freq
eric' - 1 freq
erika - 1 freq
'eureka - 1 freq
erk - 5 freq
erek - 1 freq
erica - 1 freq
eurig - 1 freq
ergo - 1 freq
erke - 1 freq
ERIC
Time to execute Levenshtein function - 0.311272 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.502432 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.060609 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036317 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000832 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.