A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to correct in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
correct (0) - 40 freq
correck (1) - 3 freq
corrects (1) - 1 freq
coreect (1) - 1 freq
'correct (1) - 1 freq
connect (2) - 15 freq
corbett (2) - 7 freq
cornet (2) - 4 freq
incorrect (2) - 1 freq
€˜correct (2) - 1 freq
correctin (2) - 6 freq
carret (2) - 3 freq
collect (2) - 29 freq
correctly (2) - 6 freq
correctet (2) - 2 freq
corekt (2) - 1 freq
corrected (2) - 11 freq
current (2) - 91 freq
forrest (2) - 5 freq
corset (2) - 2 freq
correctit (2) - 4 freq
cornert (2) - 1 freq
correckit (2) - 2 freq
conract (2) - 1 freq
forret (2) - 32 freq
correct (0) - 40 freq
'correct (2) - 1 freq
coreect (2) - 1 freq
corrects (2) - 1 freq
correck (2) - 3 freq
correctit (3) - 4 freq
corrected (3) - 11 freq
current (3) - 91 freq
corrupt (3) - 13 freq
correctet (3) - 2 freq
conract (3) - 1 freq
correckit (3) - 2 freq
correctly (3) - 6 freq
incorrect (3) - 1 freq
carret (3) - 3 freq
correctin (3) - 6 freq
carrant (4) - 2 freq
corrugat (4) - 1 freq
charact (4) - 1 freq
carriet (4) - 15 freq
correction (4) - 4 freq
currant (4) - 4 freq
careert (4) - 1 freq
curriest (4) - 1 freq
courrant (4) - 1 freq
SoundEx code - C623
crocodiles - 1 freq
crackt - 7 freq
christ - 198 freq
crackit - 23 freq
cursed - 43 freq
caressed - 2 freq
charactereisticallie - 1 freq
character - 90 freq
croasst - 3 freq
croquet - 47 freq
christmas - 324 freq
crocodile - 20 freq
crouched - 9 freq
croakit - 2 freq
croquet-grun - 8 freq
croqueted - 3 freq
croqueting - 1 freq
crashed - 28 freq
corset - 2 freq
crookit - 17 freq
curst - 2 freq
chorister - 1 freq
crossit - 7 freq
craw-step - 1 freq
crosst - 12 freq
correctly - 6 freq
correct - 40 freq
characters - 84 freq
curiosity - 34 freq
crust - 18 freq
crest - 7 freq
christ's - 5 freq
crusty - 5 freq
crossed - 100 freq
chairged - 35 freq
crystal - 27 freq
crusade - 4 freq
charged - 24 freq
crusader - 3 freq
characteristics - 7 freq
creshed - 1 freq
croqueit - 1 freq
crossd - 1 freq
cracked - 41 freq
christopher - 174 freq
christian - 99 freq
crests - 2 freq
chairacters - 9 freq
chairacter - 12 freq
christian-lyke - 1 freq
'christ - 2 freq
crack-heid - 1 freq
corrected - 11 freq
crusts - 12 freq
crystalise - 1 freq
curset - 1 freq
carsethorn - 5 freq
crested - 2 freq
christie - 15 freq
cherished - 9 freq
christened - 11 freq
cruikedbank - 37 freq
cruiked - 1 freq
christan - 1 freq
craiked - 4 freq
'cruikedbank - 1 freq
cruikit - 9 freq
crakt - 1 freq
curiositie - 1 freq
crystallie - 1 freq
christmastide - 1 freq
crushed - 21 freq
chairgit - 4 freq
cruist - 1 freq
christmases - 1 freq
carket - 1 freq
christendie - 1 freq
crecaitet - 1 freq
crakket - 2 freq
curyositee - 1 freq
christmiss - 1 freq
corrects - 1 freq
christmastime - 2 freq
crooked - 6 freq
cressida - 1 freq
crusadin - 1 freq
curiousity - 2 freq
'character' - 1 freq
christe - 1 freq
corsets - 2 freq
correctin - 6 freq
crystalite - 1 freq
cristo - 1 freq
christie's - 2 freq
croassed - 7 freq
correction - 4 freq
chrisitna - 1 freq
christina - 13 freq
christ' - 2 freq
crooched - 1 freq
crack't - 1 freq
chreestmas - 1 freq
croquet-gruund - 3 freq
correctet - 2 freq
croquetet - 1 freq
croquetin - 1 freq
crashad - 1 freq
character' - 1 freq
characteristic - 5 freq
crusht - 2 freq
christians - 10 freq
cricd - 1 freq
circuit - 6 freq
christophers - 3 freq
christopher's - 2 freq
christine - 16 freq
characterisation - 1 freq
creesht - 1 freq
cross-eed - 1 freq
correctit - 4 freq
crockett - 4 freq
cricket - 13 freq
cruised - 1 freq
christenin - 3 freq
curse'd - 2 freq
chargeth - 1 freq
crasht - 2 freq
crockt - 1 freq
creasht - 1 freq
crystals - 7 freq
croagit - 1 freq
croquet-grund - 2 freq
crichton - 14 freq
christianity - 6 freq
'crocodile' - 2 freq
croaked - 6 freq
courgette - 1 freq
christen - 5 freq
cairry-cot - 1 freq
corstorphine - 5 freq
'christopher - 9 freq
'correct - 1 freq
creished - 1 freq
charact - 1 freq
chreistiane - 5 freq
chreistianes - 1 freq
creaked - 4 freq
christmas-tree - 1 freq
chorused - 1 freq
christchurch - 2 freq
circuitry - 1 freq
crestan - 1 freq
crusted - 3 freq
croass-atlantic - 1 freq
cherged - 4 freq
characterised - 1 freq
crash't - 1 freq
cursit - 4 freq
cresseid - 2 freq
crossgates - 3 freq
craigdulleart - 2 freq
christenan - 1 freq
christentie - 1 freq
coorsed - 1 freq
croquet-groond - 2 freq
croquetground - 1 freq
croquetan - 1 freq
curriest - 1 freq
correckit - 2 freq
crawsteppit - 1 freq
correcting - 1 freq
crashit - 1 freq
chargit - 2 freq
crouchit - 1 freq
corrucciata - 1 freq
coruscating - 1 freq
christologie - 1 freq
christenins - 1 freq
creosote - 3 freq
cheeriest - 1 freq
crack-addicts - 1 freq
cruggit - 2 freq
crooged - 1 freq
christiann - 1 freq
crickets - 1 freq
corrugated - 3 freq
€˜christopher - 2 freq
€˜cricket - 1 freq
christenin' - 1 freq
christmassy - 2 freq
charcuterie - 1 freq
cross't - 1 freq
character's - 4 freq
crochet - 6 freq
corrections - 1 freq
cairrage-dryve - 1 freq
correkkit - 1 freq
corkit - 2 freq
crooshied - 1 freq
crocodile's - 1 freq
caracts - 1 freq
corssed - 2 freq
corrugat - 1 freq
christies - 1 freq
charactereestics - 1 freq
€œchrist - 4 freq
cross-toun - 1 freq
crusting - 1 freq
correctness - 4 freq
creeked - 1 freq
crystaline - 1 freq
craacked - 1 freq
cruisaders - 1 freq
creakit - 1 freq
christsake - 1 freq
christ-sake - 2 freq
€˜correct - 1 freq
-chairacter - 1 freq
crazed - 3 freq
€˜christ - 1 freq
cross-stitch - 2 freq
christa - 1 freq
crochets - 1 freq
characters' - 1 freq
crocket - 1 freq
christi - 2 freq
chairecters - 1 freq
corrugatit - 4 freq
curcuddoch - 3 freq
€œcrossed - 1 freq
crackit-open - 2 freq
curcuddochly - 1 freq
creased - 2 freq
crackdoon - 1 freq
cairacters - 1 freq
crooked-lik - 1 freq
crestfallen - 1 freq
cross-eyed - 1 freq
crachtless - 1 freq
croogit - 1 freq
corekt - 1 freq
coursed - 1 freq
chrisdeerin - 1 freq
coreect - 1 freq
christopherharv - 7 freq
cristiano - 1 freq
christinedonne - 2 freq
christmasparty - 1 freq
circuit-break - 1 freq
cristofoli - 1 freq
chrystal - 1 freq
craigdons - 6 freq
cruciate - 1 freq
christianilbury - 1 freq
christingle - 1 freq
christinamclar - 4 freq
christyscottmus - 2 freq
chrisodonnell - 3 freq
christinabrigg - 2 freq
charachter - 1 freq
christinasnp - 4 freq
chrisdarroch - 1 freq
chrisstirk - 1 freq
crossgatecentre - 4 freq
christinehoyÂ’s - 1 freq
christinehoy - 2 freq
christtocs - 8 freq
chrissyteigen - 1 freq
christinepert - 1 freq
christinef - 1 freq
carstairs - 1 freq
christmasaurus - 1 freq
christinecouser - 1 freq
curiosities - 1 freq
cursethesestreetlamps - 1 freq
chrisstephens - 1 freq
charscotswoman - 2 freq
christinah - 2 freq
christof - 1 freq
christineweth - 1 freq
christy - 1 freq
christmas' - 2 freq
christapeterso - 2 freq
craigstevenson - 2 freq
craigsutherland - 8 freq
curriestarfc - 1 freq
crookithame - 1 freq
MetaPhone code - KRKT
crackt - 7 freq
crackit - 23 freq
croquet - 47 freq
croakit - 2 freq
crookit - 17 freq
correct - 40 freq
croqueit - 1 freq
cracked - 41 freq
cruiked - 1 freq
craiked - 4 freq
cruikit - 9 freq
crakt - 1 freq
carket - 1 freq
crakket - 2 freq
crooked - 6 freq
crack't - 1 freq
cricd - 1 freq
crockett - 4 freq
cricket - 13 freq
crockt - 1 freq
croaked - 6 freq
cairry-cot - 1 freq
'correct - 1 freq
creaked - 4 freq
grogged - 1 freq
correckit - 2 freq
cruggit - 2 freq
€˜cricket - 1 freq
correkkit - 1 freq
corkit - 2 freq
corrugat - 1 freq
gourgaud - 1 freq
creeked - 1 freq
craacked - 1 freq
creakit - 1 freq
€˜correct - 1 freq
crocket - 1 freq
corekt - 1 freq
coreect - 1 freq
quarecuttie - 1 freq
CORRECT
Time to execute Levenshtein function - 0.442771 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.597115 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.062194 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041657 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000928 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.