A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to collieshange in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
collieshange (0) - 1 freq
collieshangie (1) - 14 freq
collieshangies (2) - 2 freq
callishang (3) - 1 freq
collieston (4) - 1 freq
callyshang (4) - 1 freq
collapsan (5) - 1 freq
collegiate (5) - 1 freq
collecting (5) - 4 freq
colline (5) - 1 freq
polishing (5) - 3 freq
cellophane (5) - 1 freq
collieback (5) - 1 freq
colliebuckie (5) - 1 freq
collies (5) - 1 freq
yellaestane (5) - 3 freq
collapsing (5) - 1 freq
collarbane (5) - 1 freq
college (5) - 113 freq
coal-shade (5) - 1 freq
collegian (5) - 1 freq
coalie-bag (5) - 1 freq
collectan (5) - 2 freq
collision (5) - 5 freq
colledge (5) - 1 freq
collieshange (0) - 1 freq
collieshangie (1) - 14 freq
callishang (3) - 1 freq
collieshangies (3) - 2 freq
callyshang (4) - 1 freq
collieston (6) - 1 freq
cellophane (7) - 1 freq
polishing (7) - 3 freq
collision (7) - 5 freq
collapsing (7) - 1 freq
collecting (7) - 4 freq
pollisman (8) - 2 freq
culling (8) - 2 freq
callaghan (8) - 6 freq
claething (8) - 1 freq
cherishing (8) - 1 freq
colleague (8) - 15 freq
calling (8) - 15 freq
collusion (8) - 1 freq
mellishon (8) - 2 freq
publishing (8) - 10 freq
cullisien (8) - 1 freq
collegianer (8) - 37 freq
cullykhan (8) - 1 freq
cleeshin (8) - 1 freq
SoundEx code - C425
cleckin - 13 freq
clishmaclaivers - 2 freq
clachan - 101 freq
clekkin - 11 freq
clasin - 1 freq
claichin - 1 freq
collision - 5 freq
clashin - 9 freq
callishang - 1 freq
clicking - 3 freq
classmates - 2 freq
colluseum - 1 freq
clashmaclaver - 1 freq
clockin - 10 freq
collieshangie - 14 freq
claikin - 11 freq
cologne - 3 freq
collegianer - 37 freq
closin - 25 freq
cloakin - 1 freq
clickin - 13 freq
clauchan - 1 freq
close-mooths - 1 freq
colquhonnie - 1 freq
clachans - 30 freq
cluckin - 2 freq
clishmaclavers - 2 freq
closing - 8 freq
clocking - 1 freq
cleukin - 2 freq
cullisien - 1 freq
claiken - 1 freq
close-eein - 1 freq
clackin - 15 freq
callcentre - 1 freq
colosseum - 2 freq
coliseum - 1 freq
cloggin - 2 freq
colloguin - 7 freq
colleckin - 1 freq
clackan - 1 freq
cleekin - 11 freq
colossians - 1 freq
coalescence - 1 freq
cluckan - 1 freq
closeen - 2 freq
clickan - 2 freq
collegianers - 19 freq
clekkins - 1 freq
clossan - 2 freq
clashan - 2 freq
claikan - 1 freq
cluckeens - 1 freq
clickeen - 1 freq
clagging - 1 freq
clish-ma-claver - 2 freq
clagginess - 1 freq
clossmid - 1 freq
callyshang - 1 freq
cleikin - 4 freq
cuailgne - 1 freq
clish-maclaver - 1 freq
cleikins - 1 freq
collegian - 1 freq
'clishmaclaivers' - 1 freq
collieshangies - 2 freq
claagin - 2 freq
closin' - 1 freq
closemouths - 1 freq
clishmaclaverin - 1 freq
colloguein - 1 freq
clokkin - 1 freq
closeness - 2 freq
clishmaclash - 1 freq
chalkin - 1 freq
callaghan - 6 freq
collieshange - 1 freq
clessmates - 1 freq
clockens - 1 freq
cleeshin - 1 freq
collusion - 1 freq
cullykhan - 1 freq
clickimin - 1 freq
cwlcymro - 1 freq
clishnaclaver - 4 freq
caljamieson - 6 freq
clakkin - 1 freq
coulson - 2 freq
culzean - 1 freq
cowlickin - 1 freq
clecshin - 1 freq
clishmaclavers' - 1 freq
chloeejcampbell - 1 freq
'clochan' - 1 freq
MetaPhone code - KLXNJ
collieshangie - 14 freq
collieshange - 1 freq
COLLIESHANGE
Time to execute Levenshtein function - 0.341565 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.706811 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033738 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.090698 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000846 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.