A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to comfits in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
comfits (0) - 6 freq
comfies (1) - 2 freq
comfirts (1) - 1 freq
commits (1) - 7 freq
comins (2) - 4 freq
comfirt (2) - 2 freq
coits (2) - 1 freq
comets (2) - 12 freq
coffins (2) - 6 freq
comforts (2) - 3 freq
comics (2) - 20 freq
ootfits (2) - 4 freq
commita (2) - 1 freq
comfiest (2) - 1 freq
compis (2) - 1 freq
commit (2) - 17 freq
comlie (3) - 1 freq
coins (3) - 52 freq
chafts (3) - 9 freq
comete (3) - 1 freq
coupit (3) - 16 freq
coggins (3) - 1 freq
cooarts (3) - 1 freq
consi's (3) - 1 freq
come's (3) - 1 freq
comfits (0) - 6 freq
commits (2) - 7 freq
comfirts (2) - 1 freq
comfies (2) - 2 freq
comforts (3) - 3 freq
comets (3) - 12 freq
crafts (4) - 5 freq
clifts (4) - 5 freq
commit (4) - 17 freq
clefts (4) - 1 freq
competes (4) - 1 freq
crufts (4) - 1 freq
crofts (4) - 10 freq
compis (4) - 1 freq
chafts (4) - 9 freq
comfiest (4) - 1 freq
ootfits (4) - 4 freq
coffins (4) - 6 freq
coits (4) - 1 freq
comins (4) - 4 freq
comfirt (4) - 2 freq
commita (4) - 1 freq
comics (4) - 20 freq
comes (5) - 962 freq
commuity (5) - 1 freq
SoundEx code - C513
cambodia's - 1 freq
campit - 2 freq
champit - 11 freq
confetti - 5 freq
computer - 109 freq
comfits - 6 freq
confident - 45 freq
competitive - 9 freq
confidence - 114 freq
compete - 11 freq
camped - 5 freq
competeition - 8 freq
confidit - 1 freq
combatants - 1 freq
competeeshun - 1 freq
computers - 26 freq
convoyt - 1 freq
competeetions - 4 freq
combat - 5 freq
competition - 63 freq
confided - 3 freq
compiete - 1 freq
confide - 4 freq
conveyed - 2 freq
convoyit - 1 freq
compute - 2 freq
competin - 3 freq
confidintly - 1 freq
competent - 3 freq
competence - 5 freq
can-but - 4 freq
confidently - 10 freq
computit - 2 freq
confederations - 2 freq
competitions - 7 freq
computin - 3 freq
combed - 3 freq
champed - 2 freq
campt - 2 freq
computer's - 1 freq
competeetion - 13 freq
competes - 1 freq
compt - 1 freq
confidenss - 1 freq
confoodan - 1 freq
confidence' - 2 freq
compadres - 1 freq
confederation - 1 freq
compatriots - 3 freq
canopied - 1 freq
confederacy - 2 freq
confideracy - 1 freq
computer-generatit - 2 freq
compeetitors - 1 freq
compuiters - 2 freq
compatible - 2 freq
competeeshin - 5 freq
confeedent - 2 freq
competitors - 4 freq
compeetin - 1 freq
competeeshins - 1 freq
competitor - 1 freq
competeition's - 1 freq
competeitions - 3 freq
caimbed - 3 freq
convoyed - 2 freq
computeir - 4 freq
compuiteir - 1 freq
confettae - 1 freq
confidential - 1 freq
confuddlin - 1 freq
competeen - 1 freq
competan - 2 freq
competitiveness - 1 freq
champit' - 1 freq
confidante - 1 freq
competit - 1 freq
confidences - 1 freq
compatriot - 2 freq
confidentially - 1 freq
€œconfidence - 1 freq
computatiouns - 1 freq
competing - 2 freq
compitition - 1 freq
cmbpdufuwc - 1 freq
competently - 1 freq
combatcoach - 1 freq
MetaPhone code - KMFTS
comfits - 6 freq
COMFITS
Time to execute Levenshtein function - 0.266080 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.467214 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029684 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.042779 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001002 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.