A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to diamonds in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
diamonds (0) - 22 freq
diamond (1) - 12 freq
diamonts (1) - 3 freq
diamont's (2) - 1 freq
diamants (2) - 1 freq
diamon (2) - 2 freq
dymons (3) - 1 freq
amends (3) - 3 freq
damorns (3) - 1 freq
monds (3) - 8 freq
viands (3) - 3 freq
demands (3) - 21 freq
diamante (3) - 2 freq
desmond (3) - 3 freq
demons (3) - 35 freq
dehmond (3) - 1 freq
almonds (3) - 2 freq
towmonds (3) - 1 freq
dragon's (3) - 1 freq
lamond (3) - 1 freq
simon's (3) - 2 freq
eamonns (3) - 1 freq
diamant (3) - 2 freq
demon's (3) - 1 freq
dillon's (3) - 1 freq
diamonds (0) - 22 freq
diamond (2) - 12 freq
diamonts (2) - 3 freq
demands (3) - 21 freq
diamants (3) - 1 freq
demaunds (4) - 1 freq
demons (4) - 35 freq
edmunds (4) - 1 freq
damns (4) - 2 freq
demaands (4) - 3 freq
demon's (4) - 1 freq
almonds (4) - 2 freq
amends (4) - 3 freq
monds (4) - 8 freq
diamon (4) - 2 freq
dymons (4) - 1 freq
diamont's (4) - 1 freq
diomedes (5) - 10 freq
domines (5) - 1 freq
depeinds (5) - 1 freq
mynds (5) - 14 freq
demonise (5) - 1 freq
damndest (5) - 1 freq
defends (5) - 1 freq
demans (5) - 1 freq
SoundEx code - D553
dementit - 22 freq
doon-in-the-mooth - 1 freq
diamonds - 22 freq
diamont's - 1 freq
diamants - 1 freq
demands - 21 freq
demand - 51 freq
diamant - 2 freq
diamond - 12 freq
demanded - 9 freq
demandit - 45 freq
diamante - 2 freq
demented - 8 freq
dominated - 3 freq
demandin - 11 freq
dynamite - 7 freq
dementia - 9 freq
dominate - 3 freq
deminted - 1 freq
doon-an-oots - 1 freq
'dementit - 1 freq
diamond-studdit - 1 freq
demaands - 3 freq
demandan - 1 freq
doonwind - 1 freq
dehmond - 1 freq
demanding - 4 freq
diminutive - 5 freq
demaunds - 1 freq
dominatin - 6 freq
diamonts - 3 freq
doon-haundelt - 7 freq
demeaned - 1 freq
diminutives - 3 freq
demained - 1 freq
dominatit - 1 freq
dominates - 2 freq
diminted - 1 freq
demaundit - 2 freq
demaundin - 1 freq
demaund - 2 freq
diamondsteel - 1 freq
diamondsteelcomics - 1 freq
domination - 1 freq
downmanned - 1 freq
denend - 1 freq
dannyhandling - 1 freq
dominating - 1 freq
dementedbonxie - 2 freq
MetaPhone code - TMNTS
diamonds - 22 freq
diamont's - 1 freq
diamants - 1 freq
demands - 21 freq
demaands - 3 freq
towmonds - 1 freq
demaunds - 1 freq
diamonts - 3 freq
dominates - 2 freq
DIAMONDS
Time to execute Levenshtein function - 0.336967 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.599570 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029609 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.070990 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000812 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.