A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to krem in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
krem (0) - 1 freq
krehm (1) - 1 freq
drem (1) - 7 freq
kreme (1) - 1 freq
kre (1) - 1 freq
crem (1) - 2 freq
rem (1) - 2 freq
frem (1) - 2 freq
rpm (2) - 1 freq
'em (2) - 43 freq
tre (2) - 5 freq
drum (2) - 72 freq
creme (2) - 1 freq
kteu (2) - 2 freq
gred (2) - 2 freq
keer (2) - 1 freq
bukrem (2) - 1 freq
dem (2) - 722 freq
giem (2) - 1 freq
kemp (2) - 46 freq
drems (2) - 1 freq
skrek (2) - 4 freq
craem (2) - 1 freq
neem (2) - 22 freq
ream (2) - 4 freq
krem (0) - 1 freq
kreme (1) - 1 freq
karm (2) - 2 freq
krmu (2) - 1 freq
karam (2) - 1 freq
frem (2) - 2 freq
akram (2) - 1 freq
rem (2) - 2 freq
krehm (2) - 1 freq
kre (2) - 1 freq
drem (2) - 7 freq
crem (2) - 2 freq
prum (3) - 1 freq
remo (3) - 1 freq
karep (3) - 1 freq
ker (3) - 24 freq
korea (3) - 6 freq
cram (3) - 2 freq
rm (3) - 11 freq
brom (3) - 1 freq
kyre (3) - 1 freq
nrm (3) - 3 freq
krap (3) - 1 freq
kmm (3) - 1 freq
bram (3) - 1 freq
SoundEx code - K650
kirn - 33 freq
karma - 5 freq
karen - 114 freq
kern - 3 freq
kernow - 2 freq
korma - 3 freq
karm - 2 freq
kerry-on - 2 freq
kerry-in - 1 freq
kairn - 1 freq
kerryin - 12 freq
króna - 1 freq
krone - 2 freq
kerryoan - 1 freq
kkerryin - 1 freq
kerrion - 1 freq
koarn - 1 freq
kyerryin - 1 freq
kerryeen - 1 freq
karn - 7 freq
kerryon - 1 freq
karine - 2 freq
korean - 8 freq
kieran - 26 freq
kerryan - 1 freq
kerrien - 3 freq
kerriein - 1 freq
koran - 1 freq
karohna - 1 freq
karoona - 1 freq
krmu - 1 freq
kcrommie - 19 freq
karam - 1 freq
keiran - 3 freq
krem - 1 freq
krehm - 1 freq
kreme - 1 freq
MetaPhone code - KRM
grimy - 2 freq
grimm - 4 freq
cream - 136 freq
groom - 17 freq
grim - 51 freq
crime - 70 freq
'crime - 1 freq
grime - 5 freq
garm - 1 freq
karma - 5 freq
crum - 1 freq
crem - 2 freq
crumb - 7 freq
'crime' - 2 freq
creamy - 18 freq
cram - 2 freq
gruim - 2 freq
créme - 2 freq
korma - 3 freq
karm - 2 freq
crimbo - 6 freq
gram - 10 freq
carmo' - 1 freq
grame - 3 freq
crimea - 4 freq
gairm - 4 freq
graeme - 19 freq
crambo - 1 freq
crumey - 2 freq
'cream - 1 freq
craem - 1 freq
craemy - 1 freq
quorum - 1 freq
gorm - 1 freq
groam - 1 freq
€˜gorm - 1 freq
crame - 1 freq
goram - 1 freq
creme - 1 freq
krmu - 1 freq
‘crime - 1 freq
hcurmi - 1 freq
creamÂ’ - 1 freq
karam - 1 freq
qrme - 1 freq
cromw - 1 freq
creeme - 1 freq
krem - 1 freq
krehm - 1 freq
kreme - 1 freq
KREM
Time to execute Levenshtein function - 0.290254 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.434468 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027628 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036778 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000870 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.