A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to naughty in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
naughty (0) - 8 freq
haughty (1) - 3 freq
naught (1) - 27 freq
draughty (2) - 1 freq
eaught (2) - 1 freq
saught (2) - 1 freq
caught (2) - 191 freq
nought (2) - 20 freq
naucht (2) - 3 freq
aught (2) - 4 freq
pauchty (2) - 2 freq
hauchty (2) - 1 freq
faught (2) - 2 freq
mauchty (2) - 1 freq
nighty (2) - 1 freq
noughte (2) - 1 freq
doughty (2) - 3 freq
taught (2) - 66 freq
laught (2) - 1 freq
sought (3) - 10 freq
draught (3) - 6 freq
eighty (3) - 12 freq
auchth (3) - 1 freq
laughs (3) - 56 freq
doughy (3) - 1 freq
naughty (0) - 8 freq
naught (1) - 27 freq
nighty (2) - 1 freq
nought (2) - 20 freq
noughte (2) - 1 freq
haughty (2) - 3 freq
doughty (3) - 3 freq
taught (3) - 66 freq
noght (3) - 1 freq
nyght (3) - 3 freq
nght (3) - 2 freq
laught (3) - 1 freq
night (3) - 923 freq
naucht (3) - 3 freq
eaught (3) - 1 freq
caught (3) - 191 freq
saught (3) - 1 freq
aught (3) - 4 freq
faught (3) - 2 freq
nmght (4) - 1 freq
dighty (4) - 2 freq
nightie (4) - 1 freq
oaght (4) - 1 freq
en'ught (4) - 1 freq
tought (4) - 3 freq
SoundEx code - N230
nixt - 473 freq
nicht - 2261 freq
next - 916 freq
night - 923 freq
nest - 81 freq
nocht - 209 freq
naisty - 9 freq
neist - 560 freq
nght - 2 freq
nmght - 1 freq
nosewhit - 1 freq
nakit - 30 freq
'next - 1 freq
naked - 21 freq
newest - 15 freq
nyackit - 1 freq
naysaid - 1 freq
nasty - 27 freq
nesty - 38 freq
nassed - 1 freq
nickt - 2 freq
nicked - 11 freq
nyakit - 3 freq
necht - 5 freq
neckit - 2 freq
nochty - 1 freq
neukt - 1 freq
noust - 2 freq
nicht' - 2 freq
nocket - 1 freq
nokket - 2 freq
nyakkit - 2 freq
'nixt - 1 freq
nosed - 6 freq
nekit - 2 freq
neist' - 1 freq
naukit - 6 freq
night' - 3 freq
neest - 88 freq
nickit - 4 freq
nigged - 1 freq
ncht - 5 freq
newsed - 6 freq
necd - 2 freq
naggit - 1 freq
nightah - 1 freq
naughty - 8 freq
ïnside - 2 freq
nashed - 4 freq
nagged - 3 freq
nyaakit - 4 freq
na-said - 1 freq
naught - 27 freq
nest - 1 freq
nacht - 2 freq
nought - 20 freq
'night' - 1 freq
nestie - 7 freq
noost - 4 freq
'nesty - 1 freq
neisty - 1 freq
nichtie - 6 freq
nighty - 1 freq
noughte - 1 freq
nichty - 3 freq
nyght - 3 freq
noght - 1 freq
necked - 1 freq
nugget - 8 freq
nuzzied - 1 freq
naist - 7 freq
noecht - 1 freq
niest - 2 freq
nockt - 1 freq
'nocht - 1 freq
naakit - 2 freq
negate - 2 freq
nicety - 1 freq
noaked - 1 freq
nycht - 4 freq
nochtie - 3 freq
nicht - 35 freq
nikket - 1 freq
nichit - 1 freq
neukit - 1 freq
neixt - 3 freq
neshed - 1 freq
night - 1 freq
neext - 1 freq
neast - 1 freq
ngt - 1 freq
nightie - 1 freq
necktie - 1 freq
next - 1 freq
neist - 1 freq
next - 1 freq
noked - 1 freq
nocht - 4 freq
naucht - 3 freq
naistie - 1 freq
na-sayed - 1 freq
nekst - 1 freq
noucht - 1 freq
nekid - 1 freq
nxt - 38 freq
nkt - 1 freq
naakid - 1 freq
n’est - 2 freq
nickety - 1 freq
nackety - 1 freq
nkst - 1 freq
next” - 1 freq
nest- - 1 freq
nkzd - 1 freq
njdy - 1 freq
nact - 1 freq
nzt - 1 freq
njoyed - 4 freq
nocked - 1 freq
nasuwt - 1 freq
night’ - 1 freq
neekid - 2 freq
MetaPhone code - NFT
night - 923 freq
nght - 2 freq
nifty - 3 freq
niftie - 1 freq
knight - 10 freq
night' - 3 freq
nightah - 1 freq
nevet - 1 freq
naughty - 8 freq
ïnvite - 3 freq
naught - 27 freq
nought - 20 freq
'night' - 1 freq
nighty - 1 freq
noughte - 1 freq
nyght - 3 freq
noght - 1 freq
n'avait - 1 freq
night - 1 freq
nightie - 1 freq
knifed - 1 freq
knight' - 1 freq
night’ - 1 freq
nvt - 1 freq
NAUGHTY
Time to execute Levenshtein function - 0.215775 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.496401 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027397 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037142 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000756 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.