A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to erk in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
erk (0) - 5 freq
erd (1) - 11 freq
verk (1) - 2 freq
serk (1) - 3 freq
uerk (1) - 1 freq
eck (1) - 425 freq
perk (1) - 28 freq
derk (1) - 99 freq
erz (1) - 3 freq
eek (1) - 1 freq
ek (1) - 3 freq
elk (1) - 5 freq
merk (1) - 62 freq
irk (1) - 2 freq
brk (1) - 1 freq
ert (1) - 18 freq
ery (1) - 15 freq
err (1) - 40 freq
erf (1) - 2 freq
ork (1) - 2 freq
ere (1) - 287 freq
werk (1) - 8 freq
er (1) - 627 freq
yerk (1) - 3 freq
berk (1) - 3 freq
erk (0) - 5 freq
yerk (1) - 3 freq
ork (1) - 2 freq
erke (1) - 1 freq
erik (1) - 2 freq
erek (1) - 1 freq
irk (1) - 2 freq
rk (1) - 2 freq
ark (1) - 12 freq
uerk (1) - 1 freq
erm (2) - 66 freq
eek (2) - 1 freq
rok (2) - 1 freq
eri (2) - 1 freq
arky (2) - 1 freq
verk (2) - 2 freq
jerk (2) - 8 freq
herk (2) - 3 freq
serk (2) - 3 freq
erd (2) - 11 freq
erika (2) - 1 freq
reik (2) - 15 freq
rek (2) - 2 freq
airk (2) - 3 freq
reak (2) - 1 freq
SoundEx code - E620
erse - 269 freq
ears - 111 freq
erack - 1 freq
eares - 1 freq
eros - 16 freq
erik - 2 freq
ere's - 279 freq
eries - 1 freq
erchie - 59 freq
eers - 9 freq
err's - 1 freq
erch - 3 freq
errs - 1 freq
eariwig - 2 freq
eras - 2 freq
er's - 20 freq
eirse - 2 freq
eureka - 3 freq
eric - 63 freq
'eric - 1 freq
eric' - 1 freq
'eric's - 1 freq
ear's - 1 freq
eric's - 8 freq
ere''s - 1 freq
eer's - 1 freq
eoorse - 2 freq
eross - 1 freq
erika - 1 freq
eeyore's - 1 freq
euros - 7 freq
'ere's - 35 freq
eariewigs - 1 freq
eariewig - 2 freq
'eureka - 1 freq
¬‚ers - 1 freq
erk - 5 freq
'ears - 8 freq
ee-ers - 1 freq
erek - 1 freq
eirs - 3 freq
errows - 1 freq
ersei - 1 freq
erica - 1 freq
ersh - 1 freq
'er's - 1 freq
ehrs - 2 freq
erz - 3 freq
erza - 1 freq
errza - 2 freq
earse - 4 freq
€™ers - 4 freq
eres - 1 freq
erss - 3 freq
ers - 25 freq
€˜erchie - 1 freq
eurasia - 1 freq
€œeros - 1 freq
eurig - 1 freq
€˜ears - 3 freq
ergo - 1 freq
€™erse - 1 freq
ersch - 1 freq
ersche - 1 freq
eriwazqu - 1 freq
eurohoschie - 1 freq
‘erse’ - 3 freq
erse' - 1 freq
erikg - 1 freq
erke - 1 freq
'eres - 1 freq
eeeerrrrrs - 1 freq
MetaPhone code - ERK
erack - 1 freq
erik - 2 freq
eureka - 3 freq
eric - 63 freq
'eric - 1 freq
eric' - 1 freq
erika - 1 freq
'eureka - 1 freq
erk - 5 freq
erek - 1 freq
erica - 1 freq
eurig - 1 freq
ergo - 1 freq
erke - 1 freq
ERK
Time to execute Levenshtein function - 0.214693 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.338748 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029046 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036959 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.003343 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.