A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to crimes in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
crimes (0) - 14 freq
crymes (1) - 1 freq
chimes (1) - 8 freq
climes (1) - 6 freq
crises (1) - 1 freq
crime (1) - 70 freq
crimea (1) - 4 freq
cries (1) - 101 freq
rimer (2) - 1 freq
chices (2) - 1 freq
chines (2) - 7 freq
criet (2) - 1 freq
arises (2) - 5 freq
crikey (2) - 5 freq
prices (2) - 34 freq
crisps (2) - 34 freq
haimes (2) - 3 freq
times (2) - 934 freq
crame (2) - 1 freq
rises (2) - 42 freq
grime (2) - 5 freq
drames (2) - 7 freq
trims (2) - 6 freq
rives (2) - 4 freq
'crime (2) - 1 freq
crimes (0) - 14 freq
crymes (1) - 1 freq
crams (2) - 2 freq
crums (2) - 1 freq
crimea (2) - 4 freq
cries (2) - 101 freq
chimes (2) - 8 freq
crime (2) - 70 freq
climes (2) - 6 freq
crises (2) - 1 freq
crumey (3) - 2 freq
creme (3) - 1 freq
craves (3) - 2 freq
cranes (3) - 7 freq
clims (3) - 1 freq
crisis (3) - 31 freq
crumbs (3) - 44 freq
crits (3) - 1 freq
rims (3) - 3 freq
coires (3) - 1 freq
cims (3) - 1 freq
crimp (3) - 1 freq
cram's (3) - 1 freq
criss (3) - 5 freq
cramps (3) - 3 freq
SoundEx code - C652
cronies - 41 freq
crammassy - 1 freq
crying - 16 freq
carnage - 17 freq
crunchin - 11 freq
crunches - 4 freq
crunch - 20 freq
crummock - 4 freq
crums - 1 freq
cramasie - 1 freq
crannies - 4 freq
creenged - 2 freq
cairngorm - 6 freq
corrieneuchin - 7 freq
croons - 14 freq
carnegie's - 2 freq
carnegie - 8 freq
crinched - 3 freq
currency - 16 freq
cherms - 1 freq
coronach - 4 freq
crunkled - 1 freq
crinklet - 1 freq
crimson - 16 freq
crimes - 14 freq
cornucopia - 1 freq
crinkled - 3 freq
crank - 7 freq
curns - 127 freq
cringe - 33 freq
creams - 9 freq
cairngill - 1 freq
corncrakes - 1 freq
crankums - 1 freq
charms - 8 freq
cairngorms - 6 freq
chronicles - 9 freq
chronicle - 15 freq
crinklit - 1 freq
chirms - 2 freq
crammasie - 3 freq
crunk - 5 freq
crunker - 1 freq
crunkin - 1 freq
crunchity - 1 freq
curing - 1 freq
crankin - 3 freq
chronic - 7 freq
cornice - 1 freq
cranny's - 1 freq
cranked - 1 freq
cranks - 4 freq
crummies - 1 freq
cairnsmore - 2 freq
'crammasy' - 1 freq
carmichael - 2 freq
carmichael's - 3 freq
carmichaels - 1 freq
cairns - 13 freq
cringin - 3 freq
crunkit - 3 freq
crunched - 3 freq
cronies' - 1 freq
carron's - 1 freq
cranes - 7 freq
corns - 1 freq
carnegies - 1 freq
cranshed - 1 freq
cram's - 1 freq
creamy-coloured - 2 freq
creamiest - 1 freq
cramassie-coloured - 1 freq
cairry-ons - 1 freq
churns - 4 freq
cringes - 1 freq
cramsh'' - 1 freq
chronos' - 1 freq
charon's - 3 freq
ceramic - 2 freq
currans - 1 freq
crang - 2 freq
crammicks - 1 freq
creenge - 19 freq
crayons - 4 freq
curn's - 1 freq
cramshin - 1 freq
craunch - 1 freq
crannog - 2 freq
crummocks - 1 freq
cairry-oan's - 1 freq
crunchy - 4 freq
crunchan - 1 freq
carry-ons - 1 freq
cranachan - 3 freq
carnies - 2 freq
coherence - 2 freq
cairnsmill - 9 freq
cornceres - 1 freq
cringed - 2 freq
carnoustie - 1 freq
corn-stooks - 1 freq
crinchin - 1 freq
cranns - 1 freq
cramson-faced - 1 freq
crymes - 1 freq
cramson - 2 freq
crams - 2 freq
cornish - 48 freq
cramassie - 1 freq
crannogs - 1 freq
cheerieness - 1 freq
crinkelt - 1 freq
crinkles - 1 freq
chronos - 1 freq
cronus - 1 freq
carynx - 1 freq
crinklin - 1 freq
creinge - 3 freq
corynoch - 1 freq
crunklin - 2 freq
cronykil - 2 freq
crankery - 1 freq
cairrying - 3 freq
chairms - 1 freq
crinch - 1 freq
€œcarrying - 2 freq
carngranny - 2 freq
cornes - 1 freq
corneas - 1 freq
crankum - 1 freq
crans - 1 freq
crannachan - 4 freq
crunkles - 1 freq
crouns - 4 freq
cornis - 1 freq
crowns - 2 freq
cormack - 5 freq
caring - 5 freq
cairnies - 1 freq
cringers - 2 freq
cringewirthy - 1 freq
carmichaelus - 1 freq
cornicin - 1 freq
croneus's - 1 freq
corncrake - 2 freq
crunkelt - 1 freq
chirring - 2 freq
crynes - 1 freq
€œcrunchie - 1 freq
crummoch - 1 freq
cramasay - 1 freq
cheeriness - 1 freq
carrying - 2 freq
crankit - 1 freq
currencies - 1 freq
creenges - 1 freq
cranachan's - 1 freq
cran-cran - 1 freq
creengin-crabbit - 1 freq
€˜creengin-crabbit - 1 freq
churnstaffs - 1 freq
cairnstoon - 2 freq
cream's - 1 freq
cormackdavie - 6 freq
ciaranmacairt - 7 freq
ciaranmcmenamin - 1 freq
carnesure - 1 freq
crunks - 1 freq
cheering - 2 freq
ciaranxyz - 1 freq
cranachanbooks - 4 freq
carmichaelmovies - 1 freq
ciaranastewart - 1 freq
carmic - 2 freq
carnock - 1 freq
cromecast - 1 freq
cranky - 1 freq
“charnock” - 1 freq
cornishnews - 1 freq
cringing - 1 freq
cringeworthy - 1 freq
carn-swallae - 1 freq
caramac - 1 freq
chroniclelive - 1 freq
carryonkeith - 1 freq
MetaPhone code - KRMS
crumbs - 44 freq
crammassy - 1 freq
grimace - 3 freq
crums - 1 freq
cramasie - 1 freq
crimes - 14 freq
creams - 9 freq
grooms - 3 freq
crammasie - 3 freq
crummies - 1 freq
'crammasy' - 1 freq
gramsci - 1 freq
cram's - 1 freq
groom's - 2 freq
gruims - 1 freq
gormiess - 1 freq
crymes - 1 freq
crams - 2 freq
cramassie - 1 freq
gorms - 2 freq
cramasay - 1 freq
grams - 2 freq
cream's - 1 freq
CRIMES
Time to execute Levenshtein function - 0.307433 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.480341 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.034896 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040095 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000955 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.