A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to closer in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
closer (0) - 134 freq
closet (1) - 9 freq
clooer (1) - 2 freq
cooser (1) - 3 freq
closes (1) - 27 freq
loser (1) - 4 freq
close (1) - 492 freq
closser (1) - 13 freq
closey (1) - 1 freq
clover (1) - 17 freq
cloaser (1) - 5 freq
closed (1) - 126 freq
closest (2) - 20 freq
cloor (2) - 3 freq
clopper (2) - 1 freq
closses (2) - 1 freq
clocker (2) - 3 freq
cloned (2) - 1 freq
loses (2) - 6 freq
'close' (2) - 1 freq
clooers (2) - 1 freq
closeen (2) - 2 freq
closure (2) - 3 freq
coose (2) - 8 freq
lower (2) - 57 freq
closer (0) - 134 freq
cloaser (1) - 5 freq
closey (2) - 1 freq
clover (2) - 17 freq
closure (2) - 3 freq
closser (2) - 13 freq
closed (2) - 126 freq
clooer (2) - 2 freq
closet (2) - 9 freq
close (2) - 492 freq
cooser (2) - 3 freq
loser (2) - 4 freq
closes (2) - 27 freq
couser (3) - 2 freq
clese (3) - 1 freq
clased (3) - 2 freq
closely (3) - 21 freq
clever (3) - 66 freq
closie (3) - 5 freq
clos (3) - 14 freq
chaser (3) - 9 freq
clase (3) - 7 freq
closse (3) - 1 freq
clouse (3) - 1 freq
clyse (3) - 10 freq
SoundEx code - C426
closer - 134 freq
clessroom - 29 freq
claggier - 1 freq
cleesher - 1 freq
clessruim - 2 freq
classroom - 43 freq
clockers - 6 freq
classrooms - 2 freq
cleisher - 1 freq
cloaser - 5 freq
closure - 3 freq
clockwork - 11 freq
claggers - 1 freq
closures - 1 freq
'clockers - 1 freq
clocker - 3 freq
clockwark - 4 freq
closser - 13 freq
cloakroom - 2 freq
classreum - 1 freq
clocherin - 1 freq
click-cracklan - 1 freq
clausura - 1 freq
clessruims - 1 freq
'clugger' - 1 freq
clugger - 4 freq
clugger's - 2 freq
clessrooms - 1 freq
claggordie - 1 freq
clockwirk - 1 freq
clessreum - 2 freq
cless-room - 2 freq
cless-rooms - 2 freq
calligrapher - 1 freq
calzer - 1 freq
culchierules - 2 freq
chelzreese - 17 freq
clqhrfndaa - 1 freq
closeronline - 2 freq
colgravesound - 1 freq
clyhagruho - 1 freq
classroom' - 1 freq
caulker - 1 freq
claesirgenderneutral - 1 freq
MetaPhone code - KLSR
closer - 134 freq
cloaser - 5 freq
closure - 3 freq
closser - 13 freq
clausura - 1 freq
glossary - 15 freq
glacier - 2 freq
glossary' - 1 freq
calzer - 1 freq
glazier - 1 freq
CLOSER
close - 492 freq
closer - 134 freq
closest - 20 freq
closer - 134 freq
closing - 8 freq
closin - 25 freq
closes - 27 freq
closed - 126 freq
closet - 9 freq
Time to execute Levenshtein function - 0.385270 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.681423 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028304 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.073426 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001386 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.