A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to eureka in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
eureka (0) - 3 freq
'eureka (1) - 1 freq
burka (2) - 1 freq
erek (2) - 1 freq
erika (2) - 1 freq
europa (2) - 7 freq
afrika (3) - 5 freq
bure (3) - 3 freq
errza (3) - 2 freq
furra (3) - 42 freq
era (3) - 30 freq
egress (3) - 2 freq
pure (3) - 676 freq
cura (3) - 1 freq
surest (3) - 1 freq
erect (3) - 3 freq
aurora (3) - 10 freq
eliska (3) - 1 freq
cure't (3) - 2 freq
eugene (3) - 1 freq
duress (3) - 4 freq
sure's (3) - 3 freq
puka (3) - 1 freq
'freya (3) - 3 freq
curia (3) - 1 freq
eureka (0) - 3 freq
erika (2) - 1 freq
erek (2) - 1 freq
'eureka (2) - 1 freq
rek (3) - 2 freq
erk (3) - 5 freq
erik (3) - 2 freq
roka (3) - 3 freq
erke (3) - 1 freq
europa (3) - 7 freq
burka (3) - 1 freq
eire (4) - 3 freq
oure (4) - 7 freq
wurk (4) - 73 freq
ork (4) - 2 freq
riek (4) - 2 freq
ebka (4) - 1 freq
ure (4) - 5 freq
derek (4) - 40 freq
trek (4) - 13 freq
yerk (4) - 3 freq
eyre (4) - 7 freq
euro (4) - 21 freq
lurk (4) - 2 freq
burke (4) - 13 freq
SoundEx code - E620
erse - 270 freq
ears - 112 freq
erack - 1 freq
eares - 1 freq
eros - 16 freq
erik - 2 freq
ere's - 279 freq
eries - 1 freq
erchie - 59 freq
eers - 9 freq
err's - 1 freq
erch - 3 freq
errs - 1 freq
eariwig - 2 freq
eras - 2 freq
er's - 20 freq
eirse - 2 freq
eureka - 3 freq
eric - 63 freq
'eric - 1 freq
eric' - 1 freq
'eric's - 1 freq
ear's - 1 freq
eric's - 8 freq
ere''s - 1 freq
eer's - 1 freq
eoorse - 2 freq
eross - 1 freq
erika - 1 freq
eeyore's - 1 freq
euros - 7 freq
'ere's - 35 freq
eariewigs - 1 freq
eariewig - 2 freq
'eureka - 1 freq
¬‚ers - 1 freq
erk - 5 freq
'ears - 8 freq
ee-ers - 1 freq
erek - 1 freq
eirs - 3 freq
errows - 1 freq
ersei - 1 freq
erica - 1 freq
ersh - 1 freq
'er's - 1 freq
ehrs - 2 freq
erz - 3 freq
erza - 1 freq
errza - 2 freq
earse - 4 freq
€™ers - 4 freq
eres - 1 freq
erss - 3 freq
ers - 25 freq
€˜erchie - 1 freq
eurasia - 1 freq
€œeros - 1 freq
eurig - 1 freq
€˜ears - 3 freq
ergo - 1 freq
€™erse - 1 freq
ersch - 1 freq
ersche - 1 freq
eriwazqu - 1 freq
eurohoschie - 1 freq
‘erse’ - 3 freq
erse' - 1 freq
erikg - 1 freq
erke - 1 freq
'eres - 1 freq
eeeerrrrrs - 1 freq
MetaPhone code - ERK
erack - 1 freq
erik - 2 freq
eureka - 3 freq
eric - 63 freq
'eric - 1 freq
eric' - 1 freq
erika - 1 freq
'eureka - 1 freq
erk - 5 freq
erek - 1 freq
erica - 1 freq
eurig - 1 freq
ergo - 1 freq
erke - 1 freq
EUREKA
Time to execute Levenshtein function - 0.263052 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.347101 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028339 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037113 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000918 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.