A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dimension in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dimension (0) - 11 freq
dimensions (1) - 2 freq
dimensional (2) - 1 freq
diveision (2) - 1 freq
dissension (2) - 2 freq
diversion (2) - 4 freq
detention (3) - 10 freq
deceision (3) - 5 freq
division (3) - 17 freq
pension (3) - 54 freq
omeesion (3) - 2 freq
mention (3) - 147 freq
deceesion (3) - 13 freq
deveesion (3) - 1 freq
diveesions (3) - 1 freq
devensian (3) - 1 freq
mansion (3) - 25 freq
diversioun (3) - 1 freq
direction (3) - 123 freq
immersion (3) - 5 freq
'mention (3) - 1 freq
dispensin (3) - 2 freq
licensin (3) - 1 freq
demission (3) - 2 freq
dickensian (3) - 1 freq
dimension (0) - 11 freq
dimensions (2) - 2 freq
dimensional (3) - 1 freq
mansion (4) - 25 freq
devensian (4) - 1 freq
demission (4) - 2 freq
dominion (4) - 4 freq
diversion (4) - 4 freq
diveision (4) - 1 freq
dissension (4) - 2 freq
mansin (5) - 5 freq
dansin (5) - 14 freq
damnin (5) - 2 freq
pinsion (5) - 1 freq
meesion (5) - 1 freq
demissioun (5) - 1 freq
admeission (5) - 1 freq
monsoon (5) - 8 freq
dennison (5) - 10 freq
diminish (5) - 3 freq
danson (5) - 1 freq
damnation (5) - 3 freq
manson (5) - 14 freq
admission (5) - 4 freq
damson (5) - 1 freq
SoundEx code - D552
demonstratin - 5 freq
diminisht - 1 freq
demonstrate - 12 freq
dominie's - 3 freq
dominies - 23 freq
denying - 3 freq
diminish - 3 freq
demans - 1 freq
demonstrating - 1 freq
demonic - 6 freq
diminished - 5 freq
demonstration - 7 freq
dining - 6 freq
domino's - 2 freq
dimension - 11 freq
dinnings - 1 freq
demons - 36 freq
doo-manes - 1 freq
dynamic - 12 freq
diminishes - 4 freq
'dunoon's - 2 freq
dinin-chair - 1 freq
dominos - 5 freq
dinin's - 1 freq
diminishin - 5 freq
demonstrations - 3 freq
dominique - 1 freq
denims - 5 freq
domains - 6 freq
demonstrative - 8 freq
dining' - 1 freq
dominoes - 2 freq
demonstraetion - 1 freq
dining-room - 1 freq
demauns - 1 freq
demons' - 1 freq
dimensional - 1 freq
demonised - 3 freq
dominus - 2 freq
demonstratives - 3 freq
dimensions - 2 freq
dunning - 1 freq
'demons - 1 freq
demonstratit - 1 freq
demon's - 1 freq
domines - 1 freq
dawning - 1 freq
denunce - 1 freq
dominik - 1 freq
demonstrably - 1 freq
daimen-icker - 1 freq
daoming - 1 freq
diminishing - 1 freq
dymons - 1 freq
dynamic' - 1 freq
demonstrated - 2 freq
demonise - 1 freq
dynamics - 3 freq
denooncin - 1 freq
dehumanises - 1 freq
denounced - 2 freq
dinning - 1 freq
dominicmhinde - 3 freq
dominiquetaegon - 1 freq
donnanicol - 2 freq
dominic - 3 freq
downing - 2 freq
dehumanize - 1 freq
demonstrates - 1 freq
downingstreet - 1 freq
dominiccummimgs - 1 freq
damien’s - 1 freq
MetaPhone code - TMNXN
dimension - 11 freq
damnation - 3 freq
diminishin - 5 freq
dimínishin - 1 freq
domination - 1 freq
DIMENSION
Time to execute Levenshtein function - 0.207594 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.386472 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027729 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038384 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000869 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.