A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to academic in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
academic (0) - 69 freq
aicademic (1) - 2 freq
academik (1) - 2 freq
academics (1) - 22 freq
academie (1) - 24 freq
academia (1) - 9 freq
€˜academic (2) - 1 freq
academe (2) - 1 freq
acadeemics (2) - 1 freq
academies (2) - 1 freq
academicus (2) - 1 freq
academy (2) - 57 freq
'academic' (2) - 1 freq
anaemic (2) - 1 freq
acamedics (3) - 1 freq
carmic (3) - 2 freq
académie (3) - 3 freq
endemic (3) - 3 freq
anaemia (3) - 1 freq
acidic (3) - 1 freq
academe's (3) - 1 freq
akademy (3) - 3 freq
pendemic (3) - 1 freq
epidemic (3) - 8 freq
pandemic (3) - 40 freq
academic (0) - 69 freq
aicademic (1) - 2 freq
academia (2) - 9 freq
academie (2) - 24 freq
academics (2) - 22 freq
academik (2) - 2 freq
academicus (3) - 1 freq
academy (3) - 57 freq
academies (3) - 1 freq
academe (3) - 1 freq
acadeemics (3) - 1 freq
endemic (4) - 3 freq
epidemic (4) - 8 freq
acidic (4) - 1 freq
carmic (4) - 2 freq
anaemic (4) - 1 freq
'academic' (4) - 1 freq
€˜academic (4) - 1 freq
chymic (5) - 1 freq
cadence (5) - 2 freq
andymc (5) - 1 freq
cosmic (5) - 12 freq
coamic (5) - 5 freq
cadmium (5) - 2 freq
calmac (5) - 1 freq
SoundEx code - A235
'accident' - 2 freq
academie - 24 freq
accident - 76 freq
ashton - 4 freq
achteen - 2 freq
achten - 1 freq
action - 138 freq
actions - 49 freq
actin - 58 freq
acting - 12 freq
asthma - 4 freq
accidentally - 13 freq
academicus - 1 freq
academe's - 1 freq
academy - 57 freq
academic - 69 freq
aisedom - 6 freq
academies - 1 freq
agitants - 1 freq
aichteen - 13 freq
acettain - 1 freq
astonishment - 6 freq
aichteen'' - 1 freq
academyfolk - 1 freq
academically - 4 freq
auctioneer - 3 freq
austin - 5 freq
auctioneers - 2 freq
auction - 3 freq
asthmas - 1 freq
action's - 1 freq
astonishes - 1 freq
astonisht - 4 freq
austen - 2 freq
astonishmint - 1 freq
academia - 9 freq
academics - 22 freq
accidental - 7 freq
austenty - 1 freq
actioun - 15 freq
auchteen - 5 freq
aston - 4 freq
actin' - 1 freq
acktin - 1 freq
auctioneer's - 1 freq
¬‚agstones - 1 freq
aisedom's - 1 freq
asudden - 1 freq
'actin - 1 freq
actin's - 1 freq
actan - 3 freq
akademy - 3 freq
aicademic - 2 freq
astonishin - 2 freq
accidence - 1 freq
aichteenth - 1 freq
astonished - 3 freq
achaedeemics - 1 freq
astonisher - 1 freq
academik - 2 freq
astonist - 5 freq
astonishing - 2 freq
académie - 3 freq
astoondin - 3 freq
acuteness - 1 freq
accidently - 1 freq
accidentprone - 1 freq
€˜academic - 1 freq
accidents - 1 freq
'astana - 1 freq
action-packed - 1 freq
awjtn - 1 freq
aztumgy - 1 freq
achtung - 2 freq
austinarmacost - 13 freq
ashtenrd - 1 freq
actiontiff - 1 freq
actionresearch - 2 freq
astounded - 1 freq
astounding - 1 freq
'academic' - 1 freq
azxcdnzki - 1 freq
acadeemics - 1 freq
academe - 1 freq
MetaPhone code - AKTMK
academic - 69 freq
aicademic - 2 freq
academik - 2 freq
€˜academic - 1 freq
'academic' - 1 freq
ACADEMIC
Time to execute Levenshtein function - 0.190027 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.337247 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027849 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038146 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000909 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.