A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to nae-whaur in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
nae-whaur (0) - 1 freq
naewhaur (1) - 57 freq
onie-whaur (2) - 1 freq
naewhar (2) - 3 freq
naewhaurs (2) - 1 freq
naewhair (2) - 1 freq
naewhur (2) - 4 freq
naewhaar (2) - 3 freq
naewherr (3) - 1 freq
nae-baud (3) - 1 freq
awhaur (3) - 2 freq
naewher (3) - 4 freq
anywhaur (3) - 2 freq
ontewhaur (3) - 1 freq
naewahr (3) - 1 freq
aawhaur (3) - 5 freq
nowhaur (3) - 1 freq
tae-haud (3) - 1 freq
oniewhaur (3) - 16 freq
a'whaur (3) - 1 freq
naewur (3) - 2 freq
elsewhaur (4) - 13 freq
near-haun (4) - 5 freq
awhaurs (4) - 1 freq
anywhur (4) - 2 freq
nae-whaur (0) - 1 freq
naewhaur (2) - 57 freq
onie-whaur (2) - 1 freq
naewhaar (3) - 3 freq
naewhur (3) - 4 freq
naewhair (3) - 1 freq
naewhar (3) - 3 freq
ontewhaur (4) - 1 freq
naewahr (4) - 1 freq
nowhaur (4) - 1 freq
anywhaur (4) - 2 freq
any-whar (4) - 1 freq
naewhaurs (4) - 1 freq
naewher (4) - 4 freq
oniewhaur (4) - 16 freq
oniewhar (5) - 2 freq
naewhere (5) - 49 freq
oniewhur (5) - 1 freq
onywhaur (5) - 58 freq
anywhur (5) - 2 freq
anywhar (5) - 2 freq
innywhaur (5) - 1 freq
nowhar (5) - 4 freq
a'whaur (5) - 1 freq
awhaur (5) - 2 freq
SoundEx code - N600
nor - 1978 freq
near - 1175 freq
naewhaur - 57 freq
narra - 38 freq
naar - 7 freq
norrie - 20 freq
nar - 15 freq
naewhere - 49 freq
nir - 49 freq
narrae - 15 freq
naur - 10 freq
neer - 51 freq
norie - 47 freq
ne'er - 173 freq
nairrae - 4 freq
nerra - 33 freq
nary - 2 freq
nerrae - 2 freq
nowhere - 9 freq
nairae - 8 freq
niwer - 28 freq
norroway - 7 freq
noor - 44 freq
norrowa - 26 freq
norrawa - 1 freq
ne're - 1 freq
nor' - 1 freq
narraa - 1 freq
nur - 61 freq
naewherr - 1 freq
narrow - 15 freq
naewhur - 4 freq
nero - 1 freq
newer - 7 freq
narrie - 1 freq
'ne'er - 1 freq
nora - 6 freq
ner - 53 freq
naewhar - 3 freq
neri - 3 freq
nahor - 3 freq
'nor - 2 freq
nairra - 9 freq
ni'er - 1 freq
nairoo - 2 freq
naewhaar - 3 freq
norway - 15 freq
'naewhere - 1 freq
nerr - 9 freq
nerrow - 1 freq
nairroo - 2 freq
nowhaur - 1 freq
nowhar - 4 freq
neir - 24 freq
'near - 1 freq
norwa - 3 freq
naerrae - 6 freq
naerrie - 1 freq
narie - 1 freq
norrowey - 2 freq
nuir - 1 freq
nuwara - 2 freq
nairrie - 1 freq
€”nor - 1 freq
naewur - 2 freq
nr - 8 freq
noir - 3 freq
naywhere - 1 freq
€˜narrow - 1 freq
nae-whaur - 1 freq
€œnir - 1 freq
norr - 3 freq
ne'r - 1 freq
nory - 4 freq
naewhair - 1 freq
naewahr - 1 freq
nair - 2 freq
nyir - 1 freq
€˜nor - 1 freq
nahr - 1 freq
naewher - 4 freq
nairrow - 1 freq
nare - 1 freq
niro - 1 freq
nhr - 1 freq
nweir - 6 freq
norrow - 1 freq
neÂ’er - 1 freq
MetaPhone code - NHR
naewhaur - 57 freq
naewhere - 49 freq
nowhere - 9 freq
naewherr - 1 freq
naewhur - 4 freq
naewhar - 3 freq
nahor - 3 freq
naewhaar - 3 freq
'naewhere - 1 freq
nowhaur - 1 freq
nowhar - 4 freq
naywhere - 1 freq
nae-whaur - 1 freq
naewhair - 1 freq
naewher - 4 freq
NAE-WHAUR
Time to execute Levenshtein function - 0.244770 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.387914 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031922 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037998 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000866 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.