A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to nearest in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
nearest (0) - 64 freq
naarest (1) - 1 freq
neatest (1) - 1 freq
neardest (1) - 1 freq
dearest (1) - 19 freq
earnest (2) - 13 freq
newest (2) - 15 freq
weakest (2) - 3 freq
neared (2) - 3 freq
nears (2) - 3 freq
ferest (2) - 1 freq
neast (2) - 1 freq
faarest (2) - 1 freq
foarest (2) - 1 freq
erest (2) - 10 freq
merest (2) - 6 freq
narrest (2) - 6 freq
naurest (2) - 1 freq
rarest (2) - 4 freq
neest (2) - 88 freq
clearest (2) - 4 freq
nearer (2) - 72 freq
meanest (2) - 1 freq
eares (2) - 1 freq
rerrest (2) - 1 freq
nearest (0) - 64 freq
naarest (1) - 1 freq
naurest (2) - 1 freq
dearest (2) - 19 freq
neatest (2) - 1 freq
neardest (2) - 1 freq
merest (3) - 6 freq
narrest (3) - 6 freq
rarest (3) - 4 freq
erest (3) - 10 freq
noreast (3) - 2 freq
nurst (3) - 2 freq
unrest (3) - 6 freq
neest (3) - 88 freq
newest (3) - 15 freq
foarest (3) - 1 freq
ferest (3) - 1 freq
nears (3) - 3 freq
faarest (3) - 1 freq
neast (3) - 1 freq
rest (4) - 683 freq
fairest (4) - 7 freq
peeriest (4) - 14 freq
neirish (4) - 2 freq
warst (4) - 71 freq
SoundEx code - N623
nearest - 64 freq
nor-east - 21 freq
narrest - 6 freq
nurst - 2 freq
nearesthaund - 1 freq
nor'easter - 1 freq
norside - 1 freq
nourishit - 1 freq
nursed - 5 freq
narkit - 1 freq
nyirgit - 2 freq
nor'west - 1 freq
nouriced - 1 freq
near-shut - 2 freq
nor'aester - 2 freq
noreaster - 1 freq
nor-wastawa - 1 freq
naarest - 1 freq
nor-aist - 3 freq
nourished - 1 freq
nor-wast - 2 freq
norwasterly - 1 freq
nor-wasterd - 1 freq
narcotiks - 1 freq
nor-easters - 2 freq
narked - 1 freq
nor-westerly - 1 freq
nursit - 1 freq
naurest - 1 freq
naerestforthewicked - 1 freq
noreast - 2 freq
nareystoepoker - 1 freq
nursiedear - 1 freq
MetaPhone code - NRST
nearest - 64 freq
nor-east - 21 freq
narrest - 6 freq
nurst - 2 freq
norside - 1 freq
nursed - 5 freq
nouriced - 1 freq
naarest - 1 freq
nor-aist - 3 freq
nursit - 1 freq
naurest - 1 freq
noreast - 2 freq
NEAREST
near - 1153 freq
nearer - 72 freq
nearest - 64 freq
Time to execute Levenshtein function - 0.359214 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.788971 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027168 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.098247 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000855 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.