A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to erek in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
erek (0) - 1 freq
erik (1) - 2 freq
ere (1) - 287 freq
rerek (1) - 1 freq
rek (1) - 2 freq
erey (1) - 1 freq
derek (1) - 40 freq
trek (1) - 13 freq
brek (1) - 131 freq
eek (1) - 1 freq
ere' (1) - 3 freq
eres (1) - 1 freq
erk (1) - 5 freq
daek (2) - 18 freq
dreh (2) - 2 freq
erect (2) - 3 freq
eves (2) - 1 freq
pree (2) - 30 freq
errs (2) - 1 freq
ers (2) - 25 freq
ewe (2) - 3 freq
sek (2) - 4 freq
drem (2) - 7 freq
eros (2) - 16 freq
grei (2) - 1 freq
erek (0) - 1 freq
erk (1) - 5 freq
rek (1) - 2 freq
erik (1) - 2 freq
reek (2) - 167 freq
rok (2) - 1 freq
reik (2) - 15 freq
raek (2) - 1 freq
erke (2) - 1 freq
ark (2) - 12 freq
eureka (2) - 3 freq
riek (2) - 2 freq
rik (2) - 6 freq
reyk (2) - 1 freq
irk (2) - 2 freq
yerk (2) - 3 freq
rk (2) - 2 freq
ruk (2) - 4 freq
reak (2) - 1 freq
erika (2) - 1 freq
derek (2) - 40 freq
erey (2) - 1 freq
ere' (2) - 3 freq
eres (2) - 1 freq
uerk (2) - 1 freq
SoundEx code - E620
erse - 269 freq
ears - 111 freq
erack - 1 freq
eares - 1 freq
eros - 16 freq
erik - 2 freq
ere's - 279 freq
eries - 1 freq
erchie - 59 freq
eers - 9 freq
err's - 1 freq
erch - 3 freq
errs - 1 freq
eariwig - 2 freq
eras - 2 freq
er's - 20 freq
eirse - 2 freq
eureka - 3 freq
eric - 63 freq
'eric - 1 freq
eric' - 1 freq
'eric's - 1 freq
ear's - 1 freq
eric's - 8 freq
ere''s - 1 freq
eer's - 1 freq
eoorse - 2 freq
eross - 1 freq
erika - 1 freq
eeyore's - 1 freq
euros - 7 freq
'ere's - 35 freq
eariewigs - 1 freq
eariewig - 2 freq
'eureka - 1 freq
¬‚ers - 1 freq
erk - 5 freq
'ears - 8 freq
ee-ers - 1 freq
erek - 1 freq
eirs - 3 freq
errows - 1 freq
ersei - 1 freq
erica - 1 freq
ersh - 1 freq
'er's - 1 freq
ehrs - 2 freq
erz - 3 freq
erza - 1 freq
errza - 2 freq
earse - 4 freq
€™ers - 4 freq
eres - 1 freq
erss - 3 freq
ers - 25 freq
€˜erchie - 1 freq
eurasia - 1 freq
€œeros - 1 freq
eurig - 1 freq
€˜ears - 3 freq
ergo - 1 freq
€™erse - 1 freq
ersch - 1 freq
ersche - 1 freq
eriwazqu - 1 freq
eurohoschie - 1 freq
‘erse’ - 3 freq
erse' - 1 freq
erikg - 1 freq
erke - 1 freq
'eres - 1 freq
eeeerrrrrs - 1 freq
MetaPhone code - ERK
erack - 1 freq
erik - 2 freq
eureka - 3 freq
eric - 63 freq
'eric - 1 freq
eric' - 1 freq
erika - 1 freq
'eureka - 1 freq
erk - 5 freq
erek - 1 freq
erica - 1 freq
eurig - 1 freq
ergo - 1 freq
erke - 1 freq
EREK
Time to execute Levenshtein function - 0.199622 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.340108 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027382 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036756 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000818 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.