A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to nightclubs in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
nightclubs (0) - 1 freq
nightclub (1) - 2 freq
nichtclub (2) - 2 freq
nightmares (4) - 6 freq
high-class (4) - 3 freq
nighbours (4) - 3 freq
nightclubfails (4) - 2 freq
nightly (4) - 3 freq
nichtcaps (4) - 1 freq
nightdress (4) - 1 freq
night's (4) - 7 freq
nightcap (4) - 3 freq
righteous (4) - 8 freq
nights (4) - 68 freq
lightbulbs (4) - 1 freq
dights (5) - 5 freq
highlans (5) - 1 freq
nichtmares (5) - 10 freq
nicus (5) - 1 freq
nighty (5) - 1 freq
digitalis (5) - 1 freq
rightful (5) - 1 freq
fighters (5) - 4 freq
lightly (5) - 7 freq
nightie (5) - 1 freq
nightclubs (0) - 1 freq
nightclub (2) - 2 freq
nichtclub (4) - 2 freq
nightclubfails (6) - 2 freq
lightbulbs (6) - 1 freq
night's (7) - 7 freq
nightdress (7) - 1 freq
nights (7) - 68 freq
nightcap (7) - 3 freq
nichtcaps (7) - 1 freq
high-class (7) - 3 freq
nightmares (7) - 6 freq
nightly (7) - 3 freq
righteous (8) - 8 freq
weightless (8) - 1 freq
nochtless (8) - 1 freq
nyght's (8) - 1 freq
nighbours (8) - 3 freq
nichts (9) - 159 freq
sights (9) - 7 freq
bathtubs (9) - 1 freq
tightens (9) - 2 freq
rights (9) - 32 freq
night' (9) - 3 freq
eighties (9) - 11 freq
SoundEx code - N232
nicht's - 38 freq
nichts - 159 freq
nights - 68 freq
nightcap - 3 freq
night's - 7 freq
nuggets - 6 freq
neckties - 1 freq
nestis - 1 freq
nichtshirt - 1 freq
nests - 27 freq
nae-kids - 1 freq
neistweys - 1 freq
nichts' - 3 freq
nct's - 1 freq
nightclub - 2 freq
noughties - 1 freq
nichtclub - 2 freq
nichtcaps - 1 freq
nasties - 1 freq
nyght's - 1 freq
nights' - 1 freq
nichtgoon - 1 freq
nicht-wach - 1 freq
negates - 1 freq
noosts - 2 freq
naegaits - 5 freq
'noctes - 1 freq
nachtgeräusche - 2 freq
nest-egg - 1 freq
nicht-sky - 1 freq
€™nichts - 5 freq
niceties - 1 freq
nightshift - 3 freq
nochts - 1 freq
€”noctes - 1 freq
nightclubs - 1 freq
nztgo - 1 freq
njudj - 1 freq
nightclubfails - 2 freq
njsdkvt - 1 freq
neukatyke - 1 freq
nsaiedxs - 1 freq
nichtÂ’s - 1 freq
nÂ’est-ce - 1 freq
nztcz - 1 freq
nichts'll - 1 freq
nikkithegreen - 1 freq
MetaPhone code - NFTKLBS
nightclubs - 1 freq
NIGHTCLUBS
Time to execute Levenshtein function - 0.321588 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.584016 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030368 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.080172 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000878 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.