A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to erke in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
erke (0) - 1 freq
erle (1) - 4 freq
werke (1) - 1 freq
erse (1) - 269 freq
ere (1) - 287 freq
erae (1) - 1 freq
erie (1) - 1 freq
erk (1) - 5 freq
elke (1) - 3 freq
erne (1) - 4 freq
eake (1) - 1 freq
eke (1) - 3 freq
verge (2) - 9 freq
trie (2) - 2 freq
er (2) - 627 freq
erts (2) - 11 freq
else (2) - 723 freq
ease (2) - 91 freq
irkt (2) - 1 freq
fere (2) - 18 freq
™rme (2) - 1 freq
brk (2) - 1 freq
earse (2) - 4 freq
ake (2) - 7 freq
elkie (2) - 1 freq
erke (0) - 1 freq
erk (1) - 5 freq
rake (2) - 31 freq
ork (2) - 2 freq
erik (2) - 2 freq
rk (2) - 2 freq
uerk (2) - 1 freq
ark (2) - 12 freq
roke (2) - 1 freq
erika (2) - 1 freq
erek (2) - 1 freq
irk (2) - 2 freq
ryke (2) - 4 freq
yerk (2) - 3 freq
uarke (2) - 1 freq
erae (2) - 1 freq
erie (2) - 1 freq
arky (2) - 1 freq
ere (2) - 287 freq
erse (2) - 269 freq
erle (2) - 4 freq
werke (2) - 1 freq
erne (2) - 4 freq
elke (2) - 3 freq
eke (2) - 3 freq
SoundEx code - E620
erse - 269 freq
ears - 111 freq
erack - 1 freq
eares - 1 freq
eros - 16 freq
erik - 2 freq
ere's - 279 freq
eries - 1 freq
erchie - 59 freq
eers - 9 freq
err's - 1 freq
erch - 3 freq
errs - 1 freq
eariwig - 2 freq
eras - 2 freq
er's - 20 freq
eirse - 2 freq
eureka - 3 freq
eric - 63 freq
'eric - 1 freq
eric' - 1 freq
'eric's - 1 freq
ear's - 1 freq
eric's - 8 freq
ere''s - 1 freq
eer's - 1 freq
eoorse - 2 freq
eross - 1 freq
erika - 1 freq
eeyore's - 1 freq
euros - 7 freq
'ere's - 35 freq
eariewigs - 1 freq
eariewig - 2 freq
'eureka - 1 freq
¬‚ers - 1 freq
erk - 5 freq
'ears - 8 freq
ee-ers - 1 freq
erek - 1 freq
eirs - 3 freq
errows - 1 freq
ersei - 1 freq
erica - 1 freq
ersh - 1 freq
'er's - 1 freq
ehrs - 2 freq
erz - 3 freq
erza - 1 freq
errza - 2 freq
earse - 4 freq
€™ers - 4 freq
eres - 1 freq
erss - 3 freq
ers - 25 freq
€˜erchie - 1 freq
eurasia - 1 freq
€œeros - 1 freq
eurig - 1 freq
€˜ears - 3 freq
ergo - 1 freq
€™erse - 1 freq
ersch - 1 freq
ersche - 1 freq
eriwazqu - 1 freq
eurohoschie - 1 freq
‘erse’ - 3 freq
erse' - 1 freq
erikg - 1 freq
erke - 1 freq
'eres - 1 freq
eeeerrrrrs - 1 freq
MetaPhone code - ERK
erack - 1 freq
erik - 2 freq
eureka - 3 freq
eric - 63 freq
'eric - 1 freq
eric' - 1 freq
erika - 1 freq
'eureka - 1 freq
erk - 5 freq
erek - 1 freq
erica - 1 freq
eurig - 1 freq
ergo - 1 freq
erke - 1 freq
ERKE
Time to execute Levenshtein function - 0.334728 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.760407 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030521 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.098511 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000944 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.