A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to nigh in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
nigh (0) - 18 freq
nih (1) - 1 freq
sigh (1) - 96 freq
righ (1) - 1 freq
high (1) - 619 freq
neigh (1) - 4 freq
nith (1) - 12 freq
night (1) - 955 freq
nights (2) - 68 freq
nip (2) - 119 freq
niche (2) - 7 freq
nxgu (2) - 1 freq
sig (2) - 3 freq
niki (2) - 1 freq
agh (2) - 11 freq
aige (2) - 11 freq
bigg (2) - 110 freq
migo (2) - 1 freq
gogh (2) - 1 freq
nag (2) - 6 freq
rig (2) - 45 freq
nghs (2) - 1 freq
pigs (2) - 97 freq
liga (2) - 1 freq
hig (2) - 4 freq
nigh (0) - 18 freq
neigh (1) - 4 freq
nih (2) - 1 freq
anagh (2) - 1 freq
enogh (2) - 20 freq
ngho (2) - 1 freq
enugh (2) - 2 freq
night (2) - 955 freq
righ (2) - 1 freq
sigh (2) - 96 freq
nith (2) - 12 freq
high (2) - 619 freq
neig (3) - 1 freq
ng (3) - 6 freq
naah (3) - 3 freq
hygh (3) - 1 freq
enough (3) - 911 freq
unich (3) - 1 freq
nigel (3) - 49 freq
eneugh (3) - 49 freq
nsh (3) - 1 freq
heigh (3) - 7 freq
yogh (3) - 3 freq
righe (3) - 1 freq
ngs (3) - 1 freq
SoundEx code - N200
neck - 328 freq
nice - 425 freq
noisy - 13 freq
nock - 14 freq
neuk - 183 freq
noise - 152 freq
nae-say - 3 freq
necks - 25 freq
niece - 11 freq
nex - 22 freq
nosy - 6 freq
nose - 216 freq
nix - 7 freq
neck- - 1 freq
neuks - 30 freq
naigs - 3 freq
news - 471 freq
'nice - 6 freq
nick's - 16 freq
newse - 3 freq
nike - 6 freq
nosey - 13 freq
nis - 2 freq
nash - 2 freq
na-say - 8 freq
nessie - 11 freq
noose - 15 freq
naig - 21 freq
noo's - 11 freq
nesh - 4 freq
ness - 41 freq
noos - 8 freq
nigh - 18 freq
ngs - 1 freq
newhouse - 1 freq
niche - 7 freq
nus - 5 freq
nee's - 1 freq
noak - 3 freq
nick - 133 freq
nausea - 2 freq
nook - 8 freq
nyuck - 1 freq
nookie - 2 freq
nazi - 9 freq
neig - 1 freq
nog - 3 freq
nhs - 38 freq
nac - 4 freq
nk - 3 freq
neck's - 2 freq
ngo - 2 freq
nch - 2 freq
nooks - 2 freq
noah's - 4 freq
'nick' - 1 freq
nicey - 3 freq
niko - 3 freq
nc - 5 freq
nic - 3 freq
-nsi- - 1 freq
neigh - 4 freq
ns - 15 freq
naggai - 3 freq
newess - 1 freq
naise - 2 freq
'nike' - 1 freq
neck' - 1 freq
nou's - 5 freq
noyes' - 1 freq
nags - 4 freq
nyook - 3 freq
noas - 2 freq
noyse - 1 freq
'now's - 1 freq
nous - 9 freq
njoo - 3 freq
nex' - 10 freq
nyse - 2 freq
naesay - 2 freq
nag - 6 freq
naeweys - 2 freq
niys - 1 freq
ng - 6 freq
nisi - 5 freq
no-wyce - 2 freq
-nis - 1 freq
nosh - 1 freq
neek - 1 freq
naigie - 1 freq
nms - 2 freq
noch - 2 freq
nasa - 17 freq
«nius - 1 freq
nicky - 8 freq
neuk's - 1 freq
nsh - 1 freq
nj - 4 freq
ncs - 2 freq
nyows - 3 freq
neice - 3 freq
nok - 4 freq
nece - 1 freq
nek - 2 freq
noks - 1 freq
€œnazi - 1 freq
neyse - 1 freq
€œneuk - 1 freq
nyc - 1 freq
naewise - 1 freq
naggie - 2 freq
'nice' - 1 freq
nes - 1 freq
neows - 1 freq
€˜nhs - 2 freq
naws - 1 freq
€˜nice - 2 freq
nokia - 2 freq
no-show - 1 freq
neige - 2 freq
nocks - 1 freq
niz - 3 freq
niki - 1 freq
€™ness - 1 freq
€™nous - 1 freq
€œnews - 1 freq
'noo's - 1 freq
nox - 1 freq
nq - 6 freq
nwk - 1 freq
nuxh - 1 freq
njzo - 1 freq
nz - 5 freq
nwkj - 1 freq
newzh - 1 freq
najq - 1 freq
nisx - 1 freq
newsy - 1 freq
neis - 1 freq
nows - 1 freq
nmckay - 1 freq
nosie - 1 freq
nuce - 1 freq
njc - 1 freq
nck - 1 freq
nxz - 1 freq
ni's - 1 freq
nsxo - 1 freq
nx - 3 freq
‘noise’ - 1 freq
neuky - 1 freq
noize - 1 freq
nic's - 1 freq
nocky - 1 freq
njk - 1 freq
neg - 1 freq
nÂ’es - 1 freq
nhsggc - 1 freq
'noak' - 1 freq
noiz - 1 freq
nsa - 1 freq
naze - 1 freq
nise - 1 freq
nxu - 1 freq
ngho - 1 freq
nxgu - 1 freq
neggy - 1 freq
nec - 2 freq
ns's - 1 freq
neeq - 1 freq
nxua - 1 freq
niuc - 1 freq
nkea - 1 freq
nsz - 1 freq
nwz - 1 freq
nqq - 1 freq
nahz - 1 freq
nhx - 1 freq
njoy - 8 freq
nikki - 10 freq
nahuyako - 1 freq
nhsaaa - 1 freq
nak - 1 freq
newky - 1 freq
MetaPhone code - NF
knife - 103 freq
nieve - 29 freq
neive - 21 freq
knave - 48 freq
navy - 45 freq
'nuff - 1 freq
nephew - 28 freq
nfu - 1 freq
nigh - 18 freq
gnef - 1 freq
nov - 4 freq
naive - 11 freq
nova - 7 freq
nevahhh - 1 freq
'knife - 1 freq
nove - 1 freq
naïve - 5 freq
gnough - 1 freq
neave - 3 freq
no've - 2 freq
neigh - 4 freq
knif - 1 freq
naff - 2 freq
nev - 6 freq
nav - 3 freq
naïf - 1 freq
niff - 1 freq
niv - 5 freq
nevoy - 7 freq
€˜knifey - 1 freq
hyne-aff - 1 freq
nava - 30 freq
€™nyf - 1 freq
nouveau - 3 freq
nave - 1 freq
naffu - 1 freq
navvy - 1 freq
new-faa - 1 freq
nv - 5 freq
nf - 2 freq
nvw - 1 freq
nva - 2 freq
nuf - 1 freq
gnev - 1 freq
wnvh - 1 freq
noyfea - 1 freq
nhv - 1 freq
ngho - 1 freq
pnfw - 1 freq
naafi - 1 freq
NIGH
Time to execute Levenshtein function - 0.239268 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.337960 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028979 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.042284 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001255 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.