A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to nosey-like in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
nosey-like (0) - 1 freq
cosy-like (2) - 2 freq
sojer-like (3) - 1 freq
wise-like (3) - 3 freq
wyse-like (3) - 1 freq
body-like (3) - 1 freq
nippy-like (3) - 3 freq
some-like (3) - 1 freq
queer-like (4) - 5 freq
douce-like (4) - 8 freq
awfy-like (4) - 1 freq
canny-like (4) - 1 freq
uneasy-like (4) - 1 freq
coorse-like (4) - 1 freq
leddy-like (4) - 1 freq
hastylike (4) - 1 freq
nordic-like (4) - 1 freq
manly-like (4) - 1 freq
maze-like (4) - 1 freq
saw-like (4) - 1 freq
nosedive (4) - 1 freq
wyce-like (4) - 15 freq
biker-like (4) - 1 freq
somelike (4) - 2 freq
dour-like (4) - 1 freq
nosey-like (0) - 1 freq
cosy-like (3) - 2 freq
uneasy-like (4) - 1 freq
wyse-like (4) - 1 freq
wise-like (4) - 3 freq
nate-lik (5) - 1 freq
some-like (5) - 1 freq
nippy-like (5) - 3 freq
body-like (5) - 1 freq
roit-like (6) - 1 freq
crouse-like (6) - 1 freq
funny-like (6) - 2 freq
wiselike (6) - 6 freq
meek-like (6) - 1 freq
uncle-like (6) - 1 freq
joco-like (6) - 1 freq
quaet-like (6) - 2 freq
taen-like (6) - 3 freq
pensie-like (6) - 1 freq
cosh-lyke (6) - 1 freq
wice-like (6) - 3 freq
ivy-like (6) - 1 freq
sae-lik (6) - 2 freq
unco-like (6) - 4 freq
owre-like (6) - 1 freq
SoundEx code - N242
nezahualcoyotl - 4 freq
nazi-lookin - 1 freq
nosey-like - 1 freq
neglectit - 7 freq
noiseless - 1 freq
necklace - 28 freq
nicolson - 1 freq
njal's - 4 freq
nosy-walkin' - 1 freq
neglect - 9 freq
niklaus - 1 freq
neglected - 6 freq
nicholas - 4 freq
'nicholas - 1 freq
negleckit - 3 freq
nicola's - 16 freq
negleck - 7 freq
negligence - 1 freq
negleckin - 1 freq
neglectin - 2 freq
nichols - 27 freq
nicolas - 1 freq
ænchils - 1 freq
nclusion - 1 freq
necklaces - 3 freq
nasals - 1 freq
neo-classical - 3 freq
neglekkit - 1 freq
neglek - 2 freq
nicholson's - 1 freq
€˜negligent - 1 freq
nicholson - 1 freq
neglectfu - 1 freq
nickelson - 1 freq
neglecit - 1 freq
nicklaus - 1 freq
nicol's - 2 freq
nicolasturgeon - 94 freq
nicolejunemcka - 11 freq
nicolacharters - 1 freq
nogoals - 1 freq
nicoles - 1 freq
nicolaÂ’s - 1 freq
ncglasgreens - 1 freq
nigella's - 1 freq
nicolaesen's - 1 freq
noclikalinks - 1 freq
nkhlkjqj - 1 freq
nicolashatton - 1 freq
negligee - 1 freq
nicholsonn - 1 freq
nicklezard - 1 freq
MetaPhone code - NSLK
nosey-like - 1 freq
gnsalq - 1 freq
NOSEY-LIKE
Time to execute Levenshtein function - 0.233098 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.410858 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031396 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039356 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000943 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.