A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to diamond in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
diamond (0) - 12 freq
diamonds (1) - 22 freq
diamon (1) - 2 freq
lamond (2) - 1 freq
desmond (2) - 3 freq
diamant (2) - 2 freq
diamonts (2) - 3 freq
dehmond (2) - 1 freq
dispone (3) - 2 freq
redmond (3) - 2 freq
dixon (3) - 1 freq
eamonn (3) - 9 freq
twalmond (3) - 5 freq
amend (3) - 3 freq
dramin (3) - 1 freq
dtammcd (3) - 1 freq
dimmock (3) - 1 freq
kiemon (3) - 1 freq
raymond (3) - 2 freq
lamont (3) - 8 freq
salmond (3) - 24 freq
damson (3) - 1 freq
dimmed (3) - 5 freq
damorn (3) - 6 freq
damoa (3) - 7 freq
diamond (0) - 12 freq
diamon (2) - 2 freq
diamonds (2) - 22 freq
damned (3) - 39 freq
dehmond (3) - 1 freq
demand (3) - 51 freq
desmond (3) - 3 freq
lamond (3) - 1 freq
diamant (3) - 2 freq
niemand (4) - 1 freq
demaund (4) - 2 freq
diamonts (4) - 3 freq
damns (4) - 2 freq
lomond (4) - 16 freq
damed (4) - 1 freq
damn (4) - 73 freq
demons (4) - 35 freq
droond (4) - 5 freq
diamante (4) - 2 freq
damnt (4) - 15 freq
dymons (4) - 1 freq
dimmed (4) - 5 freq
emond (4) - 1 freq
amend (4) - 3 freq
almond (4) - 4 freq
SoundEx code - D553
dementit - 22 freq
doon-in-the-mooth - 1 freq
diamonds - 22 freq
diamont's - 1 freq
diamants - 1 freq
demands - 21 freq
demand - 51 freq
diamant - 2 freq
diamond - 12 freq
demanded - 9 freq
demandit - 45 freq
diamante - 2 freq
demented - 8 freq
dominated - 3 freq
demandin - 11 freq
dynamite - 7 freq
dementia - 9 freq
dominate - 3 freq
deminted - 1 freq
doon-an-oots - 1 freq
'dementit - 1 freq
diamond-studdit - 1 freq
demaands - 3 freq
demandan - 1 freq
doonwind - 1 freq
dehmond - 1 freq
demanding - 4 freq
diminutive - 5 freq
demaunds - 1 freq
dominatin - 6 freq
diamonts - 3 freq
doon-haundelt - 7 freq
demeaned - 1 freq
diminutives - 3 freq
demained - 1 freq
dominatit - 1 freq
dominates - 2 freq
diminted - 1 freq
demaundit - 2 freq
demaundin - 1 freq
demaund - 2 freq
diamondsteel - 1 freq
diamondsteelcomics - 1 freq
domination - 1 freq
downmanned - 1 freq
denend - 1 freq
dannyhandling - 1 freq
dominating - 1 freq
dementedbonxie - 2 freq
MetaPhone code - TMNT
damned - 39 freq
demand - 51 freq
diamant - 2 freq
diamond - 12 freq
towmond - 12 freq
diamante - 2 freq
damn't - 5 freq
'damned - 1 freq
damnit - 5 freq
damnt - 15 freq
dominate - 3 freq
dehmond - 1 freq
damndee - 1 freq
demeaned - 1 freq
demained - 1 freq
demaund - 2 freq
DIAMOND
Time to execute Levenshtein function - 0.270335 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.522067 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.063799 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037201 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000872 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.