A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to naehin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
naehin (0) - 3 freq
nashin (1) - 3 freq
nathin (1) - 6 freq
naemin (1) - 1 freq
naein (1) - 1 freq
baehin (1) - 11 freq
naethin (1) - 800 freq
naeyin (1) - 2 freq
na'hin (1) - 2 freq
saein (2) - 3 freq
machin (2) - 1 freq
haevin (2) - 2 freq
naithin (2) - 3 freq
fashin (2) - 28 freq
neebin (2) - 2 freq
daein (2) - 882 freq
nein (2) - 4 freq
naeithin (2) - 1 freq
awhin (2) - 1 freq
nuthin (2) - 66 freq
saemin (2) - 1 freq
gathin (2) - 2 freq
washin (2) - 85 freq
gaein (2) - 147 freq
nithin (2) - 71 freq
naehin (0) - 3 freq
nohin (2) - 1 freq
nuhin (2) - 48 freq
oniehin (2) - 3 freq
nihin (2) - 5 freq
naeyin (2) - 2 freq
na'hin (2) - 2 freq
naethin (2) - 800 freq
naemin (2) - 1 freq
nathin (2) - 6 freq
nashin (2) - 3 freq
baehin (2) - 11 freq
naein (2) - 1 freq
naen (3) - 10 freq
lehin (3) - 4 freq
nevin (3) - 1 freq
behin (3) - 41 freq
newin (3) - 1 freq
nedin (3) - 1 freq
aathin (3) - 203 freq
ahin (3) - 262 freq
aa'hin (3) - 1 freq
namin (3) - 7 freq
nasin (3) - 1 freq
naewan (3) - 22 freq
SoundEx code - N500
nane - 547 freq
nae'n - 1 freq
name - 1220 freq
neen - 131 freq
nem - 53 freq
newin - 1 freq
nuhin - 48 freq
nine - 234 freq
none - 57 freq
naewan - 22 freq
'name - 2 freq
noon - 29 freq
non - 39 freq
'nemm - 2 freq
na-na - 1 freq
n'en - 6 freq
nuin - 14 freq
name' - 3 freq
nan - 14 freq
'nane - 5 freq
neem - 22 freq
neon - 7 freq
nanny - 7 freq
naim - 5 freq
neyn - 1 freq
'nuhin - 1 freq
nuhin- - 1 freq
nim - 9 freq
nun - 10 freq
nono - 1 freq
nino - 1 freq
naein - 1 freq
naen - 10 freq
naomh - 1 freq
'nine - 2 freq
nein - 4 freq
'nein - 1 freq
nimmo - 7 freq
niné - 1 freq
nen - 6 freq
nain - 5 freq
nahum - 3 freq
'no-no' - 1 freq
numm - 1 freq
naomi - 14 freq
nannie - 4 freq
nemm - 6 freq
nyne - 4 freq
naem - 48 freq
nano- - 1 freq
nemme - 9 freq
non- - 1 freq
nana - 11 freq
nina - 1 freq
naime - 7 freq
nehm - 1 freq
nummi - 3 freq
nohin - 1 freq
nemea - 9 freq
noun - 48 freq
neamh - 1 freq
nayme - 2 freq
'none - 1 freq
nönin - 1 freq
€˜nannie - 1 freq
naeyin - 2 freq
€œnane - 3 freq
nin - 1 freq
naehin - 3 freq
na'hin - 2 freq
nom - 2 freq
€˜name - 1 freq
nehemiah - 1 freq
nu'hin' - 5 freq
neon- - 1 freq
€œneen - 2 freq
naun - 1 freq
nuhin' - 1 freq
niamh - 16 freq
€˜niamh - 1 freq
nien - 5 freq
neean - 1 freq
naamah - 2 freq
no-one - 3 freq
€™nine - 1 freq
nihin - 5 freq
neeenaaw - 1 freq
niemi - 1 freq
nean - 1 freq
nani - 1 freq
nyum - 2 freq
nuhin” - 1 freq
'naan' - 1 freq
naan - 2 freq
nanna - 1 freq
nyhn - 1 freq
nymy - 1 freq
nym - 1 freq
nyummy - 1 freq
MetaPhone code - NHN
nuhin - 48 freq
'nuhin - 1 freq
nuhin- - 1 freq
nohin - 1 freq
naehin - 3 freq
na'hin - 2 freq
nu'hin' - 5 freq
nuhin' - 1 freq
nihin - 5 freq
nuhin” - 1 freq
NAEHIN
Time to execute Levenshtein function - 0.246877 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.402087 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028873 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037598 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000916 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.