A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to corp in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
corp (0) - 51 freq
corn (1) - 104 freq
coup (1) - 25 freq
cop (1) - 4 freq
cora (1) - 3 freq
comp (1) - 3 freq
coap (1) - 2 freq
cowp (1) - 64 freq
cor (1) - 4 freq
core (1) - 54 freq
cord (1) - 18 freq
corps (1) - 13 freq
coop (1) - 1 freq
carp (1) - 1 freq
copp (1) - 1 freq
cors (1) - 1 freq
cork (1) - 31 freq
corpy (1) - 1 freq
cory (1) - 2 freq
tori (2) - 1 freq
arp (2) - 1 freq
compy (2) - 1 freq
dop (2) - 1 freq
poop (2) - 3 freq
mrp (2) - 1 freq
corp (0) - 51 freq
carp (1) - 1 freq
corpy (1) - 1 freq
cop (2) - 4 freq
cory (2) - 2 freq
croup (2) - 2 freq
cork (2) - 31 freq
crap (2) - 85 freq
crep (2) - 4 freq
crop (2) - 23 freq
carpe (2) - 2 freq
corn (2) - 104 freq
copp (2) - 1 freq
cors (2) - 1 freq
cowp (2) - 64 freq
coap (2) - 2 freq
comp (2) - 3 freq
cora (2) - 3 freq
coup (2) - 25 freq
cor (2) - 4 freq
coop (2) - 1 freq
core (2) - 54 freq
corps (2) - 13 freq
cord (2) - 18 freq
coard (3) - 3 freq
SoundEx code - C610
creep - 31 freq
criv - 2 freq
cheeri-bye - 1 freq
crab - 32 freq
corbie - 27 freq
chirp - 2 freq
corp - 51 freq
creepy - 30 freq
chirpy - 3 freq
crep - 4 freq
crave - 13 freq
curve - 24 freq
crib - 7 freq
crap - 85 freq
crieff - 7 freq
crappy - 6 freq
cheerfu - 4 freq
croup - 2 freq
creip - 2 freq
craib - 5 freq
crub - 3 freq
curvy - 3 freq
curfew - 2 freq
crop - 23 freq
carefu - 23 freq
crabbie - 2 freq
cribbie - 1 freq
creepie - 9 freq
crowpie - 1 freq
cheery-bye - 1 freq
carve - 5 freq
curb - 1 freq
carefou - 1 freq
carefu' - 1 freq
choirboy - 2 freq
crepe - 1 freq
'creep - 1 freq
cheeribye - 1 freq
'creepy - 1 freq
carpe - 2 freq
€˜creepy - 1 freq
cruive - 1 freq
carp - 1 freq
corpy - 1 freq
corfu - 3 freq
cairve - 1 freq
cuervo - 1 freq
cerfoo - 1 freq
crieffy - 1 freq
crbi - 1 freq
craufy - 1 freq
cherrybye - 1 freq
MetaPhone code - KRP
grip - 125 freq
creep - 31 freq
corp - 51 freq
grup - 52 freq
group - 346 freq
creepy - 30 freq
gripe - 5 freq
crep - 4 freq
crap - 85 freq
grippy - 9 freq
grippie - 7 freq
graip - 5 freq
crappy - 6 freq
croup - 2 freq
grape - 8 freq
creip - 2 freq
kreepee - 1 freq
crop - 23 freq
gruppie - 2 freq
'grip - 1 freq
karep - 1 freq
grippy' - 1 freq
creepie - 9 freq
krapp - 1 freq
'group - 1 freq
crowpie - 1 freq
grappa - 3 freq
groop - 1 freq
krap - 1 freq
crepe - 1 freq
'creep - 1 freq
greep - 1 freq
groupie - 3 freq
gruip - 1 freq
gruppy - 1 freq
'creepy - 1 freq
carpe - 2 freq
€˜creepy - 1 freq
grupp - 2 freq
carp - 1 freq
corpy - 1 freq
qrp - 1 freq
qirip - 1 freq
CORP
Time to execute Levenshtein function - 0.499382 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.735483 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.080170 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.088920 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000927 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.