A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to crop in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
crop (0) - 23 freq
crok (1) - 1 freq
cros (1) - 1 freq
crow (1) - 10 freq
chop (1) - 25 freq
croup (1) - 2 freq
crep (1) - 4 freq
coop (1) - 1 freq
crops (1) - 19 freq
croo (1) - 7 freq
rop (1) - 6 freq
drop (1) - 39 freq
prop (1) - 14 freq
cop (1) - 4 freq
crap (1) - 85 freq
fro (2) - 3 freq
droo (2) - 1 freq
rap (2) - 20 freq
ckp (2) - 1 freq
chos (2) - 1 freq
cowp (2) - 64 freq
creep (2) - 31 freq
pro (2) - 11 freq
choo (2) - 3 freq
cyp (2) - 1 freq
crop (0) - 23 freq
crep (1) - 4 freq
croup (1) - 2 freq
crap (1) - 85 freq
chop (2) - 25 freq
crok (2) - 1 freq
cop (2) - 4 freq
creep (2) - 31 freq
corp (2) - 51 freq
creip (2) - 2 freq
carp (2) - 1 freq
prop (2) - 14 freq
crepe (2) - 1 freq
coop (2) - 1 freq
cros (2) - 1 freq
crow (2) - 10 freq
drop (2) - 39 freq
crops (2) - 19 freq
rop (2) - 6 freq
croo (2) - 7 freq
csp (3) - 2 freq
crony (3) - 5 freq
coup (3) - 25 freq
acroo (3) - 1 freq
cusp (3) - 3 freq
SoundEx code - C610
creep - 31 freq
criv - 2 freq
cheeri-bye - 1 freq
crab - 32 freq
corbie - 27 freq
chirp - 2 freq
corp - 51 freq
creepy - 30 freq
chirpy - 3 freq
crep - 4 freq
crave - 13 freq
curve - 24 freq
crib - 7 freq
crap - 85 freq
crieff - 7 freq
crappy - 6 freq
cheerfu - 4 freq
croup - 2 freq
creip - 2 freq
craib - 5 freq
crub - 3 freq
curvy - 3 freq
curfew - 2 freq
crop - 23 freq
carefu - 23 freq
crabbie - 2 freq
cribbie - 1 freq
creepie - 9 freq
crowpie - 1 freq
cheery-bye - 1 freq
carve - 5 freq
curb - 1 freq
carefou - 1 freq
carefu' - 1 freq
choirboy - 2 freq
crepe - 1 freq
'creep - 1 freq
cheeribye - 1 freq
'creepy - 1 freq
carpe - 2 freq
€˜creepy - 1 freq
cruive - 1 freq
carp - 1 freq
corpy - 1 freq
corfu - 3 freq
cairve - 1 freq
cuervo - 1 freq
cerfoo - 1 freq
crieffy - 1 freq
crbi - 1 freq
craufy - 1 freq
cherrybye - 1 freq
MetaPhone code - KRP
grip - 125 freq
creep - 31 freq
corp - 51 freq
grup - 52 freq
group - 346 freq
creepy - 30 freq
gripe - 5 freq
crep - 4 freq
crap - 85 freq
grippy - 9 freq
grippie - 7 freq
graip - 5 freq
crappy - 6 freq
croup - 2 freq
grape - 8 freq
creip - 2 freq
kreepee - 1 freq
crop - 23 freq
gruppie - 2 freq
'grip - 1 freq
karep - 1 freq
grippy' - 1 freq
creepie - 9 freq
krapp - 1 freq
'group - 1 freq
crowpie - 1 freq
grappa - 3 freq
groop - 1 freq
krap - 1 freq
crepe - 1 freq
'creep - 1 freq
greep - 1 freq
groupie - 3 freq
gruip - 1 freq
gruppy - 1 freq
'creepy - 1 freq
carpe - 2 freq
€˜creepy - 1 freq
grupp - 2 freq
carp - 1 freq
corpy - 1 freq
qrp - 1 freq
qirip - 1 freq
CROP
Time to execute Levenshtein function - 0.300662 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.784645 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.081796 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.087064 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001002 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.