A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to bliadhnaichean in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
bliadhnaichean (0) - 1 freq
flinrichen (6) - 1 freq
bliadhna (6) - 1 freq
blichan (7) - 1 freq
bbcnaidheachdan (7) - 1 freq
crannachan (7) - 4 freq
bird-watchers (7) - 1 freq
bladnoch (7) - 3 freq
bluid-matchin (7) - 1 freq
dunnichen (7) - 1 freq
seòmraichean (7) - 1 freq
laancheen (7) - 1 freq
blanche (7) - 14 freq
bladenoch (7) - 1 freq
laachan (7) - 2 freq
bradhainey (7) - 3 freq
baudelairean (7) - 1 freq
clinician (7) - 1 freq
bird-watchin (7) - 2 freq
cranachan (7) - 3 freq
blanched (7) - 2 freq
elsamaishman (7) - 1 freq
laanchan (7) - 1 freq
barbican (8) - 1 freq
cflanaghan (8) - 3 freq
bliadhnaichean (0) - 1 freq
bladenoch (9) - 1 freq
bladnoch (9) - 3 freq
bliadhna (9) - 1 freq
laanchan (10) - 1 freq
laancheen (10) - 1 freq
blanche (10) - 14 freq
dunnichen (10) - 1 freq
flinrichen (10) - 1 freq
bluid-matchin (10) - 1 freq
blanched (10) - 2 freq
blichan (10) - 1 freq
blednoch (10) - 2 freq
blenched (11) - 2 freq
branchin (11) - 3 freq
fleadhcheoil (11) - 1 freq
bleachin (11) - 4 freq
blorachin (11) - 1 freq
chainchan (11) - 1 freq
bhrochan (11) - 1 freq
lenchan (11) - 1 freq
launchin (11) - 4 freq
cranachan (11) - 3 freq
crannachan (11) - 4 freq
bradhainey (11) - 3 freq
SoundEx code - B435
buildin - 108 freq
blythness - 1 freq
blaudin - 2 freq
beltin - 22 freq
ballet-dauncers - 1 freq
boldness - 3 freq
building - 38 freq
bluidin - 3 freq
blodwin - 1 freq
bluidwin - 1 freq
buildings - 11 freq
bleedin - 23 freq
blytheness - 3 freq
buildins - 30 freq
blatant - 11 freq
bladnoch - 3 freq
beholden - 2 freq
blateness - 5 freq
bieldin - 7 freq
bleedin' - 1 freq
belten - 1 freq
baldin - 1 freq
bolotana - 1 freq
bliddin - 2 freq
bltime - 1 freq
blatantly - 3 freq
bulletin - 6 freq
buildin's - 3 freq
boltin - 3 freq
bleatin - 7 freq
bleatins - 1 freq
bluidan - 1 freq
bell-time - 3 freq
buildin' - 1 freq
built-in - 1 freq
blue-tinged - 1 freq
bolton - 2 freq
baldoon - 1 freq
beltane - 3 freq
beildin - 7 freq
bladin - 1 freq
buildan - 2 freq
baldan - 1 freq
bloodhound - 1 freq
bulletins - 2 freq
bulletin's - 1 freq
blaetness - 1 freq
beeldin - 2 freq
behowlden - 1 freq
beeldins - 2 freq
blydeness - 1 freq
buildeen - 1 freq
bleateen - 1 freq
blottan - 1 freq
bieldins - 2 freq
bloatin - 1 freq
bleeding - 1 freq
boldint - 1 freq
blitheness - 3 freq
belting - 2 freq
baldwin - 1 freq
€˜building - 1 freq
bluid-mither - 1 freq
bluid-matchin - 1 freq
€œbuildin - 1 freq
blythman - 1 freq
bluidyin - 1 freq
beilding - 2 freq
blednoch - 2 freq
bladenoch - 1 freq
bliadhna - 1 freq
bliadhnaichean - 1 freq
boulton - 1 freq
boldin - 1 freq
bleedinÂ’ - 1 freq
bleathing - 1 freq
MetaPhone code - BLTNXN
bliadhnaichean - 1 freq
BLIADHNAICHEAN
Time to execute Levenshtein function - 0.337086 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.524655 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033921 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040960 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000955 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.