A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to needles in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
needles (0) - 17 freq
needler (1) - 2 freq
needle's (1) - 4 freq
needless (1) - 16 freq
needle (1) - 39 freq
neddies (2) - 2 freq
meedle (2) - 1 freq
needed (2) - 150 freq
noodles (2) - 7 freq
needae (2) - 1 freq
fiedles (2) - 2 freq
feefles (2) - 1 freq
weesles (2) - 1 freq
needs (2) - 340 freq
deedless (2) - 1 freq
nettles (2) - 23 freq
needie (2) - 3 freq
nurdles (2) - 1 freq
teetles (2) - 1 freq
nedes (2) - 2 freq
beetles (2) - 2 freq
deedle (2) - 2 freq
seddles (2) - 1 freq
needet (2) - 2 freq
deedled (2) - 1 freq
needles (0) - 17 freq
noodles (2) - 7 freq
needless (2) - 16 freq
needle (2) - 39 freq
needler (2) - 2 freq
needle's (2) - 4 freq
needs (3) - 340 freq
nurdles (3) - 1 freq
fiedles (3) - 2 freq
needly (3) - 4 freq
nedes (3) - 2 freq
neddies (3) - 2 freq
seedlies (3) - 7 freq
landles (4) - 2 freq
candles (4) - 9 freq
nudges (4) - 4 freq
naples (4) - 1 freq
nobles (4) - 7 freq
idles (4) - 1 freq
nables (4) - 1 freq
nicoles (4) - 1 freq
handles (4) - 4 freq
ladles (4) - 2 freq
bedlaes (4) - 1 freq
noodle (4) - 7 freq
SoundEx code - N342
needles - 17 freq
needless - 16 freq
nettles - 23 freq
noodles - 7 freq
needle's - 4 freq
natalia's - 1 freq
natheless - 2 freq
nate-lik - 1 freq
nettils - 1 freq
nae-dialectial - 1 freq
naetheless - 3 freq
nataliejsteele - 1 freq
nae-the-like-o-us - 1 freq
MetaPhone code - NTLS
needles - 17 freq
needless - 16 freq
wyndless - 1 freq
nettles - 23 freq
noodles - 7 freq
needle's - 4 freq
natalia's - 1 freq
knotless - 3 freq
nettils - 1 freq
NEEDLES
Time to execute Levenshtein function - 0.346679 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.549280 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.075747 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.081062 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000930 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.