A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to na-said in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
na-said (0) - 1 freq
naysaid (1) - 1 freq
-said (2) - 2 freq
na-say (2) - 8 freq
na-sayed (2) - 1 freq
na-sayin (2) - 4 freq
na-says (2) - 3 freq
on-said (2) - 1 freq
a-sayin (3) - 1 freq
unsaid (3) - 6 freq
barmaid (3) - 21 freq
naggai (3) - 3 freq
ba-heid (3) - 1 freq
aaid (3) - 2 freq
hansard (3) - 3 freq
'said (3) - 5 freq
naakid (3) - 1 freq
nasal (3) - 9 freq
nasin (3) - 1 freq
nae-baud (3) - 1 freq
non-paid (3) - 1 freq
nasa (3) - 17 freq
naard (3) - 1 freq
narraed (3) - 1 freq
abraid (3) - 11 freq
na-said (0) - 1 freq
on-said (2) - 1 freq
na-sayed (2) - 1 freq
naysaid (2) - 1 freq
na-say (3) - 8 freq
na-says (3) - 3 freq
na-sayin (3) - 4 freq
-said (3) - 2 freq
nae-say (4) - 3 freq
nae-baud (4) - 1 freq
nassed (4) - 1 freq
unsaid (4) - 6 freq
ane-saet (5) - 1 freq
unafraid (5) - 1 freq
naesay (5) - 2 freq
na-na (5) - 1 freq
naistie (5) - 1 freq
nae-tail (5) - 1 freq
oanside (5) - 1 freq
newsed (5) - 6 freq
norside (5) - 1 freq
ansed (5) - 1 freq
said (5) - 11590 freq
nosed (5) - 6 freq
nursed (5) - 5 freq
SoundEx code - N230
nixt - 473 freq
nicht - 2275 freq
next - 955 freq
night - 955 freq
nest - 81 freq
nocht - 211 freq
naisty - 10 freq
neist - 561 freq
nght - 2 freq
nmght - 1 freq
nosewhit - 1 freq
nakit - 30 freq
'next - 1 freq
naked - 22 freq
newest - 15 freq
nyackit - 1 freq
naysaid - 1 freq
nasty - 27 freq
nesty - 38 freq
nassed - 1 freq
nickt - 2 freq
nicked - 14 freq
nyakit - 5 freq
necht - 5 freq
neckit - 2 freq
nochty - 1 freq
neukt - 1 freq
noust - 2 freq
nicht' - 2 freq
nocket - 1 freq
nokket - 2 freq
nyakkit - 2 freq
naughty - 9 freq
necked - 2 freq
nought - 32 freq
'nought - 1 freq
nikita - 1 freq
'nixt - 1 freq
nosed - 6 freq
nekit - 2 freq
neist' - 1 freq
naukit - 6 freq
night' - 3 freq
neest - 88 freq
nickit - 4 freq
nigged - 1 freq
ncht - 5 freq
newsed - 6 freq
necd - 2 freq
naggit - 1 freq
nightah - 1 freq
ïnside - 2 freq
nashed - 4 freq
nagged - 3 freq
nyaakit - 4 freq
na-said - 1 freq
naught - 27 freq
nest - 1 freq
nacht - 2 freq
'night' - 1 freq
nestie - 7 freq
noost - 4 freq
'nesty - 1 freq
neisty - 1 freq
nichtie - 6 freq
nighty - 1 freq
noughte - 1 freq
nichty - 3 freq
nyght - 3 freq
noght - 1 freq
nugget - 8 freq
nuzzied - 1 freq
naist - 7 freq
noecht - 1 freq
niest - 2 freq
nockt - 1 freq
'nocht - 1 freq
naakit - 2 freq
negate - 2 freq
nicety - 1 freq
noaked - 1 freq
nycht - 4 freq
nochtie - 3 freq
nicht - 35 freq
nikket - 1 freq
nichit - 1 freq
neukit - 1 freq
neixt - 3 freq
neshed - 1 freq
night - 1 freq
neext - 1 freq
neast - 1 freq
ngt - 1 freq
nightie - 1 freq
necktie - 1 freq
next - 1 freq
neist - 1 freq
next - 1 freq
noked - 1 freq
nocht - 4 freq
naucht - 3 freq
naistie - 1 freq
na-sayed - 1 freq
nekst - 1 freq
noucht - 1 freq
nekid - 1 freq
nxt - 38 freq
nkt - 1 freq
naakid - 1 freq
n’est - 2 freq
nickety - 1 freq
nackety - 1 freq
nkst - 1 freq
next” - 1 freq
nest- - 1 freq
nkzd - 1 freq
njdy - 1 freq
nact - 1 freq
nzt - 1 freq
njoyed - 4 freq
nocked - 1 freq
nasuwt - 1 freq
night’ - 1 freq
neekid - 2 freq
MetaPhone code - NST
nest - 81 freq
naisty - 10 freq
neist - 561 freq
naysaid - 1 freq
nasty - 27 freq
nesty - 38 freq
nassed - 1 freq
noust - 2 freq
wneist - 1 freq
nosed - 6 freq
neist' - 1 freq
neest - 88 freq
newsed - 6 freq
ïnside - 2 freq
na-said - 1 freq
nest - 1 freq
nestie - 7 freq
noost - 4 freq
'nesty - 1 freq
neisty - 1 freq
nuzzied - 1 freq
naist - 7 freq
niest - 2 freq
nicety - 1 freq
neast - 1 freq
neist - 1 freq
naistie - 1 freq
n’est - 2 freq
nest- - 1 freq
nzt - 1 freq
nasuwt - 1 freq
NA-SAID
Time to execute Levenshtein function - 0.243529 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.416171 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028750 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038669 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000942 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.