A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to nicolas in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
nicolas (0) - 1 freq
nicola's (1) - 16 freq
nicol's (1) - 2 freq
nicola (1) - 128 freq
nicholas (1) - 4 freq
nicoles (1) - 1 freq
'nicholas (2) - 1 freq
nicoll (2) - 5 freq
nicklaus (2) - 1 freq
nichols (2) - 27 freq
nikolai (2) - 2 freq
nicol (2) - 19 freq
nicola'd (2) - 1 freq
nicole (2) - 2 freq
colas (2) - 1 freq
nicolaÂ’s (2) - 1 freq
nikola (2) - 3 freq
nichts (3) - 156 freq
niklaus (3) - 1 freq
scolds (3) - 2 freq
idol's (3) - 4 freq
noas (3) - 2 freq
nicus (3) - 1 freq
finola (3) - 1 freq
nicholl (3) - 2 freq
nicolas (0) - 1 freq
nicoles (1) - 1 freq
nicola (2) - 128 freq
nicholas (2) - 4 freq
nicol's (2) - 2 freq
nicola's (2) - 16 freq
nicole (3) - 2 freq
colas (3) - 1 freq
nicol (3) - 19 freq
nicoll (3) - 5 freq
nicklaus (3) - 1 freq
nichols (3) - 27 freq
uncles (4) - 19 freq
niches (4) - 1 freq
nirls (4) - 1 freq
nibals (4) - 1 freq
nicolytes (4) - 1 freq
coles (4) - 4 freq
nice's (4) - 2 freq
nic's (4) - 1 freq
incels (4) - 1 freq
nicolson (4) - 1 freq
niklaus (4) - 1 freq
nicolaÂ’s (4) - 1 freq
nicola'd (4) - 1 freq
SoundEx code - N242
nezahualcoyotl - 4 freq
nazi-lookin - 1 freq
nosey-like - 1 freq
neglectit - 7 freq
noiseless - 1 freq
necklace - 27 freq
nicolson - 1 freq
njal's - 4 freq
nosy-walkin' - 1 freq
neglect - 9 freq
niklaus - 1 freq
neglected - 6 freq
nicholas - 4 freq
'nicholas - 1 freq
negleckit - 3 freq
nicola's - 16 freq
negleck - 7 freq
negligence - 1 freq
negleckin - 1 freq
neglectin - 2 freq
nichols - 27 freq
nicolas - 1 freq
ænchils - 1 freq
nclusion - 1 freq
necklaces - 3 freq
nasals - 1 freq
neo-classical - 3 freq
neglekkit - 1 freq
neglek - 2 freq
nicholson's - 1 freq
€˜negligent - 1 freq
nicholson - 1 freq
neglectfu - 1 freq
nickelson - 1 freq
neglecit - 1 freq
nicklaus - 1 freq
nicol's - 2 freq
nicolasturgeon - 94 freq
nicolejunemcka - 11 freq
nicolacharters - 1 freq
nogoals - 1 freq
nicoles - 1 freq
nicolaÂ’s - 1 freq
ncglasgreens - 1 freq
nigella's - 1 freq
nicolaesen's - 1 freq
noclikalinks - 1 freq
nkhlkjqj - 1 freq
nicolashatton - 1 freq
negligee - 1 freq
nicholsonn - 1 freq
nicklezard - 1 freq
MetaPhone code - NKLS
knuckles - 29 freq
necklace - 27 freq
niklaus - 1 freq
nicola's - 16 freq
nicolas - 1 freq
nicklaus - 1 freq
nicol's - 2 freq
nogoals - 1 freq
nicoles - 1 freq
nicolaÂ’s - 1 freq
NICOLAS
Time to execute Levenshtein function - 0.173432 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.346539 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027796 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036800 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000817 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.