A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to criminals in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
criminals (0) - 12 freq
criminall (1) - 1 freq
criminal (1) - 38 freq
creiminal (2) - 1 freq
originals (2) - 3 freq
criminally (2) - 1 freq
criminoal (2) - 1 freq
creeminals (2) - 1 freq
criminaluk (2) - 1 freq
cardinals (3) - 3 freq
creeminal (3) - 2 freq
liminal (3) - 2 freq
chemicals (3) - 12 freq
criminabil (3) - 1 freq
terminals (3) - 2 freq
critical (3) - 19 freq
rinnals (3) - 1 freq
nominals (3) - 1 freq
original (3) - 117 freq
tribunals (3) - 1 freq
crïmnals (3) - 4 freq
ordinals (3) - 3 freq
britnats (4) - 2 freq
originates (4) - 2 freq
writings (4) - 2 freq
criminals (0) - 12 freq
creeminals (2) - 1 freq
criminal (2) - 38 freq
criminall (2) - 1 freq
criminoal (3) - 1 freq
creiminal (3) - 1 freq
criminaluk (3) - 1 freq
criminally (3) - 1 freq
terminals (4) - 2 freq
creeminal (4) - 2 freq
cardinals (4) - 3 freq
originals (4) - 3 freq
ordinals (5) - 3 freq
crïmnals (5) - 4 freq
crennels (5) - 1 freq
crumnilt (5) - 1 freq
tribunals (5) - 1 freq
ceramians (5) - 1 freq
cardinalis (5) - 1 freq
nominals (5) - 1 freq
chemicals (5) - 12 freq
criminabil (5) - 1 freq
rinnals (5) - 1 freq
caramels (6) - 4 freq
ceremonial (6) - 1 freq
SoundEx code - C655
cruinin - 1 freq
crounin - 2 freq
crooning - 2 freq
croonin - 10 freq
charmin - 8 freq
criminal - 38 freq
churnin - 9 freq
crownin - 2 freq
ceremony - 18 freq
carmen - 2 freq
criminals - 12 freq
charming - 3 freq
crinin - 2 freq
creeminal - 2 freq
ceremonious - 1 freq
curnin - 1 freq
chairman - 12 freq
ceremoniously - 3 freq
cairryin-on - 1 freq
cranin - 1 freq
chairmin - 5 freq
cernunnos - 2 freq
carmen' - 1 freq
ceremonie - 3 freq
crowning - 1 freq
chermometer - 1 freq
crammin - 1 freq
croonan - 1 freq
crownan - 1 freq
crewmen - 1 freq
criminoal - 1 freq
ceremonies - 2 freq
creeminals - 1 freq
creiminal - 1 freq
€˜croonin - 1 freq
charmingly - 1 freq
crimond - 4 freq
ceramians - 1 freq
chermin - 1 freq
criminabil - 1 freq
criminall - 1 freq
chirmin - 1 freq
careenin - 1 freq
ceremonial - 1 freq
chairmanship - 1 freq
carmunnock - 7 freq
'carmunnock - 1 freq
carmunnock's - 1 freq
criminally - 1 freq
chairwoman - 1 freq
corinmain - 2 freq
criminaluk - 1 freq
carmond - 4 freq
ciaraninaa - 2 freq
carmoney - 2 freq
MetaPhone code - KRMNLS
criminals - 12 freq
crïmnals - 4 freq
creeminals - 1 freq
CRIMINALS
Time to execute Levenshtein function - 0.185453 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.384394 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029099 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038984 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001023 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.