A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to nith in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
nith (0) - 12 freq
noth (1) - 5 freq
neth (1) - 5 freq
dith (1) - 1 freq
nits (1) - 21 freq
lith (1) - 1 freq
pith (1) - 9 freq
kith (1) - 13 freq
nih (1) - 1 freq
eith (1) - 26 freq
sith (1) - 6 freq
nitb (1) - 1 freq
with (1) - 856 freq
ninth (1) - 10 freq
nit (1) - 10 freq
ith (1) - 2 freq
aith (1) - 23 freq
nite (1) - 87 freq
nigh (1) - 18 freq
fith (1) - 1 freq
mith (1) - 63 freq
kits (2) - 3 freq
inwith (2) - 11 freq
tilth (2) - 4 freq
'with (2) - 1 freq
nith (0) - 12 freq
neth (1) - 5 freq
noth (1) - 5 freq
fith (2) - 1 freq
nigh (2) - 18 freq
ith (2) - 2 freq
mith (2) - 63 freq
aith (2) - 23 freq
aneith (2) - 8 freq
neath (2) - 4 freq
anaith (2) - 6 freq
aneth (2) - 177 freq
nit (2) - 10 freq
nite (2) - 87 freq
ninth (2) - 10 freq
kith (2) - 13 freq
lith (2) - 1 freq
nits (2) - 21 freq
dith (2) - 1 freq
nih (2) - 1 freq
pith (2) - 9 freq
eith (2) - 26 freq
sith (2) - 6 freq
with (2) - 856 freq
nitb (2) - 1 freq
SoundEx code - N300
nut - 127 freq
nod - 107 freq
need - 1780 freq
not - 712 freq
neat - 42 freq
nutt - 2 freq
nott - 55 freq
note - 238 freq
nd - 88 freq
nowt - 116 freq
net - 87 freq
needy - 7 freq
'ned' - 1 freq
'not - 8 freq
'need - 2 freq
ned - 42 freq
needty - 1 freq
nate - 9 freq
nouat - 1 freq
'nowt - 2 freq
neth - 5 freq
needae - 1 freq
nith - 12 freq
notie - 8 freq
nout - 2 freq
neid - 1 freq
noat - 6 freq
noad - 11 freq
nutty - 6 freq
natty - 2 freq
nato - 5 freq
-nut - 1 freq
now-at - 1 freq
nite - 87 freq
naet - 3 freq
neddie - 1 freq
neath - 4 freq
neddy - 1 freq
nit - 10 freq
nudie - 2 freq
ïntae - 93 freq
neyt - 3 freq
needie - 3 freq
nat - 21 freq
nied - 3 freq
nd - 1 freq
notae - 1 freq
'ned - 1 freq
'nd' - 2 freq
næmt - 2 freq
'need' - 1 freq
nooat - 1 freq
nt - 2 freq
nd - 1 freq
nt - 34 freq
n't - 2 freq
noth - 5 freq
nuid - 1 freq
'neath - 2 freq
net - 1 freq
nuit' - 1 freq
'neth - 1 freq
nait - 1 freq
not - 8 freq
'nut' - 1 freq
nede - 3 freq
nude - 4 freq
not - 3 freq
nuyt - 1 freq
netta - 6 freq
need - 2 freq
nato- - 1 freq
noida - 1 freq
node - 4 freq
needa - 1 freq
nitty - 1 freq
neth - 1 freq
nyte - 1 freq
‘not’ - 1 freq
nou oot - 1 freq
newt - 1 freq
nowt' - 5 freq
“netty - 1 freq
nud - 1 freq
noddy - 1 freq
MetaPhone code - N0
neth - 5 freq
nith - 12 freq
neath - 4 freq
noth - 5 freq
'neath - 2 freq
'neth - 1 freq
neth - 1 freq
NITH
Time to execute Levenshtein function - 0.210379 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.337120 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029083 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036746 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000841 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.