A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to nosy-walkin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
nosy-walkin' (1) - 1 freq
oot-walin (4) - 1 freq
hill-walkin (4) - 2 freq
boak-makkin (5) - 3 freq
nostalgia (5) - 9 freq
raipwalkin (5) - 1 freq
hillwalkin (5) - 4 freq
swallin (5) - 11 freq
nostalgic (5) - 8 freq
netwarkin (5) - 1 freq
sleep-walkin (5) - 3 freq
nasty-lookin (5) - 1 freq
stalkin (5) - 7 freq
rorywalker (5) - 11 freq
wide-waukin (5) - 1 freq
free-warkin (5) - 1 freq
no-cannin (5) - 1 freq
staalkin (5) - 2 freq
yalkin (5) - 2 freq
kin-walin (5) - 1 freq
walkin (5) - 299 freq
hard-warkin (5) - 3 freq
waalkin (6) - 6 freq
a-winkin (6) - 3 freq
€™s-jackie (6) - 1 freq
nosy-walkin' (2) - 1 freq
oot-walin (7) - 1 freq
hill-walkin (7) - 2 freq
nasty-lookin (7) - 1 freq
wide-waukin (8) - 1 freq
free-warkin (8) - 1 freq
stalkin (8) - 7 freq
staalkin (8) - 2 freq
walkin (8) - 299 freq
kin-walin (8) - 1 freq
swilken (8) - 1 freq
sleep-walkin (8) - 3 freq
raipwalkin (8) - 1 freq
nazi-lookin (8) - 1 freq
nosey-like (8) - 1 freq
netwarkin (8) - 1 freq
hillwalkin (8) - 4 freq
swallin (8) - 11 freq
non-livin (9) - 1 freq
nyowlan (9) - 1 freq
swillin (9) - 2 freq
nasty-looking (9) - 1 freq
non-work (9) - 1 freq
nestlin (9) - 2 freq
walkan (9) - 8 freq
SoundEx code - N242
nezahualcoyotl - 4 freq
nazi-lookin - 1 freq
nosey-like - 1 freq
neglectit - 7 freq
noiseless - 1 freq
necklace - 28 freq
nicolson - 1 freq
njal's - 4 freq
nosy-walkin' - 1 freq
neglect - 9 freq
niklaus - 1 freq
neglected - 6 freq
nicholas - 4 freq
'nicholas - 1 freq
negleckit - 3 freq
nicola's - 16 freq
negleck - 7 freq
negligence - 1 freq
negleckin - 1 freq
neglectin - 2 freq
nichols - 27 freq
nicolas - 1 freq
ænchils - 1 freq
nclusion - 1 freq
necklaces - 3 freq
nasals - 1 freq
neo-classical - 3 freq
neglekkit - 1 freq
neglek - 2 freq
nicholson's - 1 freq
€˜negligent - 1 freq
nicholson - 1 freq
neglectfu - 1 freq
nickelson - 1 freq
neglecit - 1 freq
nicklaus - 1 freq
nicol's - 2 freq
nicolasturgeon - 94 freq
nicolejunemcka - 11 freq
nicolacharters - 1 freq
nogoals - 1 freq
nicoles - 1 freq
nicolaÂ’s - 1 freq
ncglasgreens - 1 freq
nigella's - 1 freq
nicolaesen's - 1 freq
noclikalinks - 1 freq
nkhlkjqj - 1 freq
nicolashatton - 1 freq
negligee - 1 freq
nicholsonn - 1 freq
nicklezard - 1 freq
MetaPhone code - NSWLKN
nosy-walkin' - 1 freq
NOSY-WALKIN
Time to execute Levenshtein function - 0.210915 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.406856 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028375 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.044204 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000921 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.