A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to corp in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
corp (0) - 51 freq
corps (1) - 13 freq
coop (1) - 1 freq
core (1) - 54 freq
cowp (1) - 63 freq
cop (1) - 4 freq
cory (1) - 2 freq
cord (1) - 18 freq
coap (1) - 2 freq
cors (1) - 1 freq
copp (1) - 1 freq
carp (1) - 1 freq
cora (1) - 3 freq
corpy (1) - 1 freq
cork (1) - 31 freq
coup (1) - 25 freq
corn (1) - 102 freq
cor (1) - 3 freq
comp (1) - 3 freq
pomp (2) - 2 freq
coapy (2) - 2 freq
cox (2) - 5 freq
work (2) - 654 freq
cur (2) - 6 freq
wop (2) - 1 freq
corp (0) - 51 freq
corpy (1) - 1 freq
carp (1) - 1 freq
corn (2) - 102 freq
cork (2) - 31 freq
cor (2) - 3 freq
coup (2) - 25 freq
crep (2) - 4 freq
crap (2) - 83 freq
carpe (2) - 2 freq
crop (2) - 23 freq
comp (2) - 3 freq
croup (2) - 2 freq
cowp (2) - 63 freq
core (2) - 54 freq
cora (2) - 3 freq
coop (2) - 1 freq
cop (2) - 4 freq
corps (2) - 13 freq
coap (2) - 2 freq
cory (2) - 2 freq
cord (2) - 18 freq
cors (2) - 1 freq
copp (2) - 1 freq
co-op (3) - 15 freq
SoundEx code - C610
creep - 30 freq
criv - 2 freq
cheeri-bye - 1 freq
crab - 32 freq
corbie - 27 freq
chirp - 2 freq
corp - 51 freq
creepy - 29 freq
chirpy - 3 freq
crep - 4 freq
crave - 13 freq
curve - 24 freq
crib - 7 freq
crap - 83 freq
crieff - 7 freq
crappy - 6 freq
cheerfu - 4 freq
croup - 2 freq
creip - 2 freq
craib - 5 freq
crub - 3 freq
curvy - 3 freq
curfew - 2 freq
crop - 23 freq
carefu - 23 freq
crabbie - 2 freq
cribbie - 1 freq
creepie - 9 freq
crowpie - 1 freq
cheery-bye - 1 freq
carve - 5 freq
curb - 1 freq
carefou - 1 freq
carefu' - 1 freq
choirboy - 2 freq
crepe - 1 freq
'creep - 1 freq
cheeribye - 1 freq
'creepy - 1 freq
carpe - 2 freq
€˜creepy - 1 freq
cruive - 1 freq
carp - 1 freq
corpy - 1 freq
corfu - 3 freq
cairve - 1 freq
cuervo - 1 freq
cerfoo - 1 freq
crieffy - 1 freq
crbi - 1 freq
craufy - 1 freq
cherrybye - 1 freq
MetaPhone code - KRP
grip - 123 freq
creep - 30 freq
corp - 51 freq
grup - 52 freq
group - 332 freq
creepy - 29 freq
gripe - 5 freq
crep - 4 freq
crap - 83 freq
grippy - 9 freq
grippie - 7 freq
graip - 5 freq
crappy - 6 freq
croup - 2 freq
grape - 8 freq
creip - 2 freq
kreepee - 1 freq
crop - 23 freq
gruppie - 2 freq
'grip - 1 freq
karep - 1 freq
grippy' - 1 freq
creepie - 9 freq
krapp - 1 freq
'group - 1 freq
crowpie - 1 freq
grappa - 3 freq
groop - 1 freq
krap - 1 freq
crepe - 1 freq
'creep - 1 freq
greep - 1 freq
groupie - 3 freq
gruip - 1 freq
gruppy - 1 freq
'creepy - 1 freq
carpe - 2 freq
€˜creepy - 1 freq
grupp - 2 freq
carp - 1 freq
corpy - 1 freq
qrp - 1 freq
qirip - 1 freq
CORP
Time to execute Levenshtein function - 0.501165 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.095649 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.102217 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.102191 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000775 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.