A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cedars in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cedars (0) - 1 freq
cedar (1) - 4 freq
redars (1) - 1 freq
ledars (1) - 1 freq
pedlars (2) - 1 freq
lears (2) - 2 freq
cigars (2) - 5 freq
ledar (2) - 1 freq
cheddars (2) - 1 freq
leears (2) - 6 freq
years (2) - 1172 freq
nears (2) - 3 freq
tears (2) - 307 freq
kedar (2) - 1 freq
dears (2) - 12 freq
velars (2) - 1 freq
pedals (2) - 3 freq
cesar (2) - 2 freq
cesare (2) - 1 freq
clears (2) - 19 freq
ears (2) - 112 freq
chars (2) - 1 freq
cars (2) - 100 freq
gears (2) - 15 freq
czars (2) - 2 freq
cedars (0) - 1 freq
cedar (2) - 4 freq
ledars (2) - 1 freq
redars (2) - 1 freq
chars (3) - 1 freq
clears (3) - 19 freq
cars (3) - 100 freq
coars (3) - 1 freq
dears (3) - 12 freq
czars (3) - 2 freq
radars (3) - 1 freq
cigars (3) - 5 freq
calders (4) - 2 freq
cheers (4) - 219 freq
riders (4) - 8 freq
edders (4) - 1 freq
coads (4) - 1 freq
curers (4) - 1 freq
cinders (4) - 7 freq
leaders (4) - 30 freq
daurs (4) - 2 freq
tudors (4) - 2 freq
colors (4) - 1 freq
anders (4) - 1 freq
cds (4) - 20 freq
SoundEx code - C362
cattrick - 1 freq
chitters - 7 freq
catharsis - 1 freq
cottaries - 1 freq
citris - 3 freq
citrus - 1 freq
cedars - 1 freq
cottars - 19 freq
citrusy - 1 freq
cottar-hooses - 1 freq
catter's - 1 freq
cutter's - 1 freq
cheeters - 1 freq
cataracts - 1 freq
cataract - 5 freq
cottars' - 3 freq
cottar's - 1 freq
cottar-hoose - 5 freq
citric - 2 freq
cheaters - 1 freq
cathersmcg - 5 freq
cheddars - 1 freq
MetaPhone code - STRS
storeys - 3 freq
stairs - 209 freq
stars - 106 freq
stories - 359 freq
stours - 3 freq
steers - 11 freq
staris - 1 freq
stress - 41 freq
story's - 1 freq
soutar's - 6 freq
stares - 41 freq
staurs - 3 freq
stirs - 6 freq
stores - 30 freq
citris - 3 freq
strays - 5 freq
citrus - 1 freq
cedars - 1 freq
citrusy - 1 freq
straws - 6 freq
star's - 1 freq
steirs - 4 freq
strae's - 1 freq
strahs - 1 freq
steer's - 1 freq
satires - 5 freq
stars' - 1 freq
sters - 1 freq
straes - 3 freq
stores-' - 1 freq
stoor's - 1 freq
strauss - 93 freq
setter's - 1 freq
saeter's - 1 freq
steeers - 1 freq
satyrs - 3 freq
stair's - 2 freq
sterrs - 7 freq
stories' - 3 freq
soutars' - 1 freq
sottars - 3 freq
storeys' - 1 freq
stres - 1 freq
suitors - 4 freq
strassa' - 1 freq
stors - 1 freq
souders - 1 freq
storees - 1 freq
straas - 1 freq
'stars' - 1 freq
CEDARS
Time to execute Levenshtein function - 0.244126 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.438506 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033919 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039226 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000920 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.