A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to coleraine in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
coleraine (0) - 4 freq
cowlraine (2) - 2 freq
tolerance (3) - 7 freq
lorraine (3) - 6 freq
coherin (3) - 1 freq
coulrain (3) - 1 freq
cowerin (3) - 5 freq
clerkin (3) - 2 freq
coverin' (3) - 1 freq
cochrane (3) - 4 freq
coverage (3) - 15 freq
cocaine (3) - 6 freq
souerane (3) - 1 freq
columbine (3) - 2 freq
coveran (3) - 2 freq
chlorine (3) - 1 freq
covering (3) - 6 freq
couerin (3) - 1 freq
coulrain' (3) - 2 freq
colline (3) - 1 freq
coverin (3) - 29 freq
coleridge (3) - 1 freq
codeine (3) - 1 freq
tolerable (3) - 1 freq
cleane (3) - 1 freq
coleraine (0) - 4 freq
coulrain (3) - 1 freq
cowlraine (3) - 2 freq
colline (4) - 1 freq
coulrain' (4) - 2 freq
chlorine (4) - 1 freq
coverin (4) - 29 freq
colourin (4) - 3 freq
cleane (4) - 1 freq
clearan (4) - 2 freq
coveran (4) - 2 freq
clearin (4) - 27 freq
couerin (4) - 1 freq
cowerin (4) - 5 freq
clerkin (4) - 2 freq
coherin (4) - 1 freq
cleedin (5) - 1 freq
cheerin (5) - 25 freq
caperin (5) - 3 freq
clearing (5) - 3 freq
clane (5) - 4 freq
clartin (5) - 4 freq
cloorin (5) - 1 freq
cleanin (5) - 48 freq
clean (5) - 356 freq
SoundEx code - C465
clourin - 1 freq
clarinet - 3 freq
clearances - 12 freq
clearin - 27 freq
cowlraine - 2 freq
coulrain' - 2 freq
coulrain - 1 freq
cailry-on's - 1 freq
clarence - 2 freq
clearance - 16 freq
clearan - 2 freq
cullourin - 1 freq
clairance - 1 freq
chlorine - 1 freq
clarinda - 1 freq
callerin - 1 freq
clarence's - 1 freq
clairin - 1 freq
cloorin - 1 freq
clearing - 3 freq
clarion - 1 freq
colourin - 3 freq
clarendon - 1 freq
colourins - 1 freq
coleraine - 4 freq
clewran - 1 freq
chlorinated - 1 freq
clairehanm - 1 freq
colouring - 2 freq
clear-minded - 1 freq
colourin-in - 3 freq
clairehammond - 1 freq
MetaPhone code - KLRN
glarin - 7 freq
clourin - 1 freq
gollerin - 6 freq
glowrin - 6 freq
clearin - 27 freq
cowlraine - 2 freq
gleerin - 1 freq
glaurin - 1 freq
glourin - 1 freq
coulrain' - 2 freq
coulrain - 1 freq
clearan - 2 freq
cullourin - 1 freq
glaran - 2 freq
kilrenny - 1 freq
callerin - 1 freq
kloorin - 1 freq
clairin - 1 freq
cloorin - 1 freq
clarion - 1 freq
colourin - 3 freq
coleraine - 4 freq
clewran - 1 freq
COLERAINE
Time to execute Levenshtein function - 0.470944 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.870331 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030549 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.085828 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000850 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.