A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to night in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
night (0) - 955 freq
noght (1) - 1 freq
nicht (1) - 2275 freq
nights (1) - 68 freq
nigh (1) - 18 freq
eight (1) - 69 freq
nyght (1) - 3 freq
ight (1) - 1 freq
right (1) - 1436 freq
nght (1) - 2 freq
hight (1) - 2 freq
knight (1) - 10 freq
light (1) - 300 freq
tight (1) - 75 freq
night' (1) - 3 freq
gight (1) - 3 freq
nmght (1) - 1 freq
wight (1) - 1 freq
dight (1) - 3 freq
fight (1) - 95 freq
nighty (1) - 1 freq
might (1) - 421 freq
sight (1) - 134 freq
nippt (2) - 5 freq
aicht (2) - 33 freq
night (0) - 955 freq
nght (1) - 2 freq
nighty (1) - 1 freq
nyght (1) - 3 freq
noght (1) - 1 freq
fight (2) - 95 freq
wight (2) - 1 freq
nmght (2) - 1 freq
nought (2) - 32 freq
might (2) - 421 freq
naught (2) - 27 freq
gight (2) - 3 freq
sight (2) - 134 freq
nightie (2) - 1 freq
dight (2) - 3 freq
nigh (2) - 18 freq
eight (2) - 69 freq
night' (2) - 3 freq
nights (2) - 68 freq
nicht (2) - 2275 freq
right (2) - 1436 freq
ight (2) - 1 freq
tight (2) - 75 freq
light (2) - 300 freq
knight (2) - 10 freq
SoundEx code - N230
nixt - 473 freq
nicht - 2275 freq
next - 955 freq
night - 955 freq
nest - 81 freq
nocht - 211 freq
naisty - 10 freq
neist - 561 freq
nght - 2 freq
nmght - 1 freq
nosewhit - 1 freq
nakit - 30 freq
'next - 1 freq
naked - 22 freq
newest - 15 freq
nyackit - 1 freq
naysaid - 1 freq
nasty - 27 freq
nesty - 38 freq
nassed - 1 freq
nickt - 2 freq
nicked - 14 freq
nyakit - 5 freq
necht - 5 freq
neckit - 2 freq
nochty - 1 freq
neukt - 1 freq
noust - 2 freq
nicht' - 2 freq
nocket - 1 freq
nokket - 2 freq
nyakkit - 2 freq
naughty - 9 freq
necked - 2 freq
nought - 32 freq
'nought - 1 freq
nikita - 1 freq
'nixt - 1 freq
nosed - 6 freq
nekit - 2 freq
neist' - 1 freq
naukit - 6 freq
night' - 3 freq
neest - 88 freq
nickit - 4 freq
nigged - 1 freq
ncht - 5 freq
newsed - 6 freq
necd - 2 freq
naggit - 1 freq
nightah - 1 freq
ïnside - 2 freq
nashed - 4 freq
nagged - 3 freq
nyaakit - 4 freq
na-said - 1 freq
naught - 27 freq
nest - 1 freq
nacht - 2 freq
'night' - 1 freq
nestie - 7 freq
noost - 4 freq
'nesty - 1 freq
neisty - 1 freq
nichtie - 6 freq
nighty - 1 freq
noughte - 1 freq
nichty - 3 freq
nyght - 3 freq
noght - 1 freq
nugget - 8 freq
nuzzied - 1 freq
naist - 7 freq
noecht - 1 freq
niest - 2 freq
nockt - 1 freq
'nocht - 1 freq
naakit - 2 freq
negate - 2 freq
nicety - 1 freq
noaked - 1 freq
nycht - 4 freq
nochtie - 3 freq
nicht - 35 freq
nikket - 1 freq
nichit - 1 freq
neukit - 1 freq
neixt - 3 freq
neshed - 1 freq
night - 1 freq
neext - 1 freq
neast - 1 freq
ngt - 1 freq
nightie - 1 freq
necktie - 1 freq
next - 1 freq
neist - 1 freq
next - 1 freq
noked - 1 freq
nocht - 4 freq
naucht - 3 freq
naistie - 1 freq
na-sayed - 1 freq
nekst - 1 freq
noucht - 1 freq
nekid - 1 freq
nxt - 38 freq
nkt - 1 freq
naakid - 1 freq
n’est - 2 freq
nickety - 1 freq
nackety - 1 freq
nkst - 1 freq
next” - 1 freq
nest- - 1 freq
nkzd - 1 freq
njdy - 1 freq
nact - 1 freq
nzt - 1 freq
njoyed - 4 freq
nocked - 1 freq
nasuwt - 1 freq
night’ - 1 freq
neekid - 2 freq
MetaPhone code - NFT
night - 955 freq
nght - 2 freq
nifty - 3 freq
niftie - 1 freq
knight - 10 freq
naughty - 9 freq
nought - 32 freq
'nought - 1 freq
night' - 3 freq
nightah - 1 freq
nevet - 1 freq
ïnvite - 3 freq
naught - 27 freq
'night' - 1 freq
nighty - 1 freq
noughte - 1 freq
nyght - 3 freq
noght - 1 freq
n'avait - 1 freq
night - 1 freq
nightie - 1 freq
knifed - 1 freq
knight' - 1 freq
night’ - 1 freq
nvt - 1 freq
NIGHT
nicht - 2275 freq
night - 955 freq
nights - 68 freq
Time to execute Levenshtein function - 0.207775 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.393719 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030743 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.042369 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001108 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.