A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to neath in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
neath (0) - 4 freq
heath (1) - 2 freq
neat (1) - 42 freq
death (1) - 168 freq
neamh (1) - 1 freq
neth (1) - 5 freq
teath (1) - 1 freq
aneath (1) - 105 freq
beath (1) - 3 freq
'neath (1) - 2 freq
dath (2) - 1 freq
yeats (2) - 1 freq
noat (2) - 6 freq
deith (2) - 102 freq
seats (2) - 76 freq
peats (2) - 49 freq
yeat (2) - 2 freq
sneith (2) - 2 freq
eth (2) - 8 freq
leathr (2) - 1 freq
meyth (2) - 1 freq
nean (2) - 1 freq
noah (2) - 22 freq
north (2) - 391 freq
neave (2) - 3 freq
neath (0) - 4 freq
neth (1) - 5 freq
aneath (1) - 105 freq
nith (2) - 12 freq
aneth (2) - 177 freq
noth (2) - 5 freq
'neath (2) - 2 freq
aneith (2) - 8 freq
beath (2) - 3 freq
heath (2) - 2 freq
death (2) - 168 freq
neamh (2) - 1 freq
neat (2) - 42 freq
teath (2) - 1 freq
earth (3) - 251 freq
niamh (3) - 16 freq
tenth (3) - 14 freq
beith (3) - 5 freq
nah (3) - 86 freq
math (3) - 6 freq
loath (3) - 4 freq
nets (3) - 46 freq
ninth (3) - 10 freq
neyt (3) - 3 freq
vath (3) - 1 freq
SoundEx code - N300
nut - 127 freq
nod - 107 freq
need - 1780 freq
not - 712 freq
neat - 42 freq
nutt - 2 freq
nott - 55 freq
note - 238 freq
nd - 88 freq
nowt - 116 freq
net - 87 freq
needy - 7 freq
'ned' - 1 freq
'not - 8 freq
'need - 2 freq
ned - 42 freq
needty - 1 freq
nate - 9 freq
nouat - 1 freq
'nowt - 2 freq
neth - 5 freq
needae - 1 freq
nith - 12 freq
notie - 8 freq
nout - 2 freq
neid - 1 freq
noat - 6 freq
noad - 11 freq
nutty - 6 freq
natty - 2 freq
nato - 5 freq
-nut - 1 freq
now-at - 1 freq
nite - 87 freq
naet - 3 freq
neddie - 1 freq
neath - 4 freq
neddy - 1 freq
nit - 10 freq
nudie - 2 freq
ïntae - 93 freq
neyt - 3 freq
needie - 3 freq
nat - 21 freq
nied - 3 freq
nd - 1 freq
notae - 1 freq
'ned - 1 freq
'nd' - 2 freq
næmt - 2 freq
'need' - 1 freq
nooat - 1 freq
nt - 2 freq
nd - 1 freq
nt - 34 freq
n't - 2 freq
noth - 5 freq
nuid - 1 freq
'neath - 2 freq
net - 1 freq
nuit' - 1 freq
'neth - 1 freq
nait - 1 freq
not - 8 freq
'nut' - 1 freq
nede - 3 freq
nude - 4 freq
not - 3 freq
nuyt - 1 freq
netta - 6 freq
need - 2 freq
nato- - 1 freq
noida - 1 freq
node - 4 freq
needa - 1 freq
nitty - 1 freq
neth - 1 freq
nyte - 1 freq
‘not’ - 1 freq
nou oot - 1 freq
newt - 1 freq
nowt' - 5 freq
“netty - 1 freq
nud - 1 freq
noddy - 1 freq
MetaPhone code - N0
neth - 5 freq
nith - 12 freq
neath - 4 freq
noth - 5 freq
'neath - 2 freq
'neth - 1 freq
neth - 1 freq
NEATH
Time to execute Levenshtein function - 0.213233 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.373454 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028976 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040489 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001083 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.