A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cyberdug in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cyberdug (0) - 4 freq
cyberdugs (1) - 2 freq
cyberpaw (3) - 1 freq
underdug (3) - 1 freq
cybermen (3) - 1 freq
cyber (3) - 4 freq
cyber-ee (3) - 1 freq
cyberee (3) - 1 freq
cyberpunk (3) - 2 freq
overdue (4) - 9 freq
cooerd's (4) - 1 freq
cerd (4) - 7 freq
brug (4) - 1 freq
cooerdly (4) - 1 freq
rydercup (4) - 1 freq
iceberg (4) - 6 freq
cyber-een (4) - 1 freq
tuberous (4) - 1 freq
covering (4) - 6 freq
caber' (4) - 2 freq
cyaard's (4) - 2 freq
berg (4) - 1 freq
verdur (4) - 1 freq
cyaards' (4) - 1 freq
underdog (4) - 1 freq
cyberdug (0) - 4 freq
cyberdugs (2) - 2 freq
cyberee (5) - 1 freq
iceberg (5) - 6 freq
cyber (5) - 4 freq
cyber-ee (5) - 1 freq
cyberpaw (5) - 1 freq
underdug (5) - 1 freq
cybermen (5) - 1 freq
aiberdour (6) - 5 freq
cuboard (6) - 2 freq
crug (6) - 1 freq
caerds (6) - 1 freq
coleridge (6) - 1 freq
cubberd (6) - 1 freq
caberray (6) - 1 freq
cooerds (6) - 1 freq
cyaards (6) - 9 freq
cybertail (6) - 3 freq
aberdour (6) - 9 freq
berde (6) - 2 freq
capering (6) - 1 freq
caberay (6) - 1 freq
cabers (6) - 4 freq
cerds (6) - 1 freq
SoundEx code - C163
covert - 39 freq
covered - 124 freq
cuivert - 2 freq
cover't - 3 freq
cupboards - 19 freq
cavorting - 1 freq
cupboard - 53 freq
chauffeured - 1 freq
'cupboard' - 2 freq
coverts - 1 freq
capered - 4 freq
cuver'd - 2 freq
cubberd - 1 freq
cuvert - 1 freq
cabaret - 2 freq
coverit - 6 freq
cubbords - 2 freq
co-operation - 5 freq
capaured - 1 freq
cupboord's - 1 freq
cupboords - 2 freq
cupboord - 2 freq
cyberdug - 4 freq
cybertail - 3 freq
cyberheid - 1 freq
cubbart - 1 freq
coiffured - 1 freq
cupboard's - 1 freq
cavortan - 1 freq
co-operative - 3 freq
chipperdingan - 1 freq
cypriot - 1 freq
covereth - 1 freq
copywriter - 1 freq
cuboard - 2 freq
coveret - 2 freq
chipboord - 1 freq
cooperation - 3 freq
chipboard - 2 freq
cyberdugs - 2 freq
copper-tapped - 2 freq
cooperate - 2 freq
€˜covert - 3 freq
“covered - 1 freq
coveritup - 1 freq
cobradabest - 3 freq
cooperative - 1 freq
chiefbrody - 1 freq
MetaPhone code - SBRTK
cyberdug - 4 freq
CYBERDUG
dug - 576 freq
doggie - 3 freq
dugs - 231 freq
dogs - 45 freq
dog - 157 freq
cyberdug - 4 freq
Time to execute Levenshtein function - 0.481103 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.810391 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.080153 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037178 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000918 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.