A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to clashan in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
clashan (0) - 2 freq
crashan (1) - 3 freq
clashin (1) - 9 freq
clachan (1) - 101 freq
lashan (1) - 1 freq
flashan (1) - 2 freq
clachans (2) - 30 freq
clash (2) - 41 freq
clossan (2) - 2 freq
slashin (2) - 3 freq
clasht (2) - 4 freq
clashbag (2) - 1 freq
clauchan (2) - 1 freq
rashan (2) - 2 freq
crashin (2) - 18 freq
cashin (2) - 1 freq
claikan (2) - 1 freq
loshan (2) - 1 freq
clankan (2) - 1 freq
washan (2) - 3 freq
clangan (2) - 1 freq
castan (2) - 1 freq
clappan (2) - 2 freq
splashan (2) - 2 freq
dashan (2) - 1 freq
clashan (0) - 2 freq
clashin (1) - 9 freq
lashan (2) - 1 freq
flashan (2) - 2 freq
clachan (2) - 101 freq
crashan (2) - 3 freq
clashes (3) - 2 freq
plashin (3) - 3 freq
claspin (3) - 7 freq
blashin (3) - 1 freq
flashin (3) - 28 freq
blushan (3) - 1 freq
clathin (3) - 1 freq
cleeshin (3) - 1 freq
lashin (3) - 8 freq
clashed (3) - 4 freq
clash' (3) - 1 freq
flashen (3) - 2 freq
clasin (3) - 1 freq
clauchan (3) - 1 freq
cashin (3) - 1 freq
clasht (3) - 4 freq
slashin (3) - 3 freq
clash (3) - 41 freq
clossan (3) - 2 freq
SoundEx code - C425
cleckin - 13 freq
clishmaclaivers - 2 freq
clachan - 101 freq
clekkin - 11 freq
clasin - 1 freq
claichin - 1 freq
collision - 5 freq
clashin - 9 freq
callishang - 1 freq
clicking - 3 freq
classmates - 2 freq
colluseum - 1 freq
clashmaclaver - 1 freq
clockin - 10 freq
collieshangie - 15 freq
claikin - 11 freq
cologne - 3 freq
collegianer - 37 freq
closin - 27 freq
cloakin - 1 freq
clickin - 13 freq
clauchan - 1 freq
close-mooths - 1 freq
colquhonnie - 1 freq
clachans - 30 freq
cluckin - 2 freq
clishmaclavers - 2 freq
closing - 8 freq
clocking - 1 freq
cleukin - 2 freq
cullisien - 1 freq
claiken - 1 freq
coalescence - 2 freq
close-eein - 1 freq
clackin - 15 freq
callcentre - 1 freq
colosseum - 2 freq
coliseum - 1 freq
cloggin - 2 freq
colloguin - 7 freq
colleckin - 1 freq
clackan - 1 freq
cleekin - 11 freq
colossians - 1 freq
cluckan - 1 freq
closeen - 2 freq
clickan - 2 freq
collegianers - 19 freq
clekkins - 1 freq
clossan - 2 freq
clashan - 2 freq
claikan - 1 freq
cluckeens - 1 freq
clickeen - 1 freq
clagging - 1 freq
clish-ma-claver - 2 freq
clagginess - 1 freq
clossmid - 1 freq
callyshang - 1 freq
cleikin - 4 freq
cuailgne - 1 freq
clish-maclaver - 1 freq
cleikins - 1 freq
collegian - 1 freq
'clishmaclaivers' - 1 freq
collieshangies - 2 freq
claagin - 2 freq
closin' - 1 freq
closemouths - 1 freq
clishmaclaverin - 1 freq
colloguein - 1 freq
clokkin - 1 freq
closeness - 2 freq
clishmaclash - 1 freq
chalkin - 1 freq
callaghan - 6 freq
collieshange - 1 freq
clessmates - 1 freq
clockens - 1 freq
cleeshin - 1 freq
collusion - 1 freq
cullykhan - 1 freq
clickimin - 1 freq
cwlcymro - 1 freq
clishnaclaver - 4 freq
caljamieson - 6 freq
clakkin - 1 freq
coulson - 2 freq
culzean - 1 freq
cowlickin - 1 freq
clecshin - 1 freq
clishmaclavers' - 1 freq
chloeejcampbell - 1 freq
'clochan' - 1 freq
MetaPhone code - KLXN
clachan - 101 freq
claichin - 1 freq
collision - 5 freq
clashin - 9 freq
clootchin - 1 freq
clauchan - 1 freq
clatchin - 1 freq
clutchin - 12 freq
clashan - 2 freq
clutchan - 1 freq
galician - 2 freq
'galician' - 1 freq
gollachin - 1 freq
cleeshin - 1 freq
collusion - 1 freq
coalition - 8 freq
'clochan' - 1 freq
CLASHAN
Time to execute Levenshtein function - 0.262028 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.387681 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029789 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039828 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000994 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.