A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to buildings in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
buildings (0) - 11 freq
buildins (1) - 30 freq
building (1) - 38 freq
buildin's (1) - 3 freq
buildin (2) - 108 freq
beilding (2) - 2 freq
buildin' (2) - 1 freq
biggings (3) - 3 freq
bidding (3) - 4 freq
bulking (3) - 1 freq
guidins (3) - 1 freq
biding (3) - 9 freq
ceilings (3) - 1 freq
blinding (3) - 1 freq
rebuilding (3) - 1 freq
buildan (3) - 2 freq
binding (3) - 1 freq
bindins (3) - 1 freq
sidings (3) - 1 freq
buidin (3) - 1 freq
briefings (3) - 1 freq
builder's (3) - 5 freq
bulging (3) - 1 freq
failings (3) - 1 freq
fillings (3) - 2 freq
buildings (0) - 11 freq
building (2) - 38 freq
buildin's (2) - 3 freq
buildins (2) - 30 freq
beilding (3) - 2 freq
bilangs (4) - 6 freq
beeldins (4) - 2 freq
bieldins (4) - 2 freq
buildin' (4) - 1 freq
buildin (4) - 108 freq
killings (5) - 2 freq
boiledeggs (5) - 1 freq
bullying (5) - 4 freq
jillings (5) - 10 freq
findings (5) - 4 freq
biddins (5) - 12 freq
belangs (5) - 79 freq
buntings (5) - 1 freq
builders (5) - 10 freq
bleeding (5) - 1 freq
builds (5) - 3 freq
boiling (5) - 2 freq
beildin (5) - 7 freq
belongs (5) - 14 freq
puddings (5) - 1 freq
SoundEx code - B435
buildin - 108 freq
blythness - 1 freq
blaudin - 2 freq
beltin - 22 freq
ballet-dauncers - 1 freq
boldness - 3 freq
building - 38 freq
bluidin - 3 freq
blodwin - 1 freq
bluidwin - 1 freq
buildings - 11 freq
bleedin - 23 freq
blytheness - 3 freq
buildins - 30 freq
blatant - 11 freq
bladnoch - 3 freq
beholden - 2 freq
blateness - 5 freq
bieldin - 7 freq
bleedin' - 1 freq
belten - 1 freq
baldin - 1 freq
bolotana - 1 freq
bliddin - 2 freq
bltime - 1 freq
blatantly - 3 freq
bulletin - 6 freq
buildin's - 3 freq
boltin - 3 freq
bleatin - 7 freq
bleatins - 1 freq
bluidan - 1 freq
bell-time - 3 freq
buildin' - 1 freq
built-in - 1 freq
blue-tinged - 1 freq
bolton - 2 freq
baldoon - 1 freq
beltane - 3 freq
beildin - 7 freq
bladin - 1 freq
buildan - 2 freq
baldan - 1 freq
bloodhound - 1 freq
bulletins - 2 freq
bulletin's - 1 freq
blaetness - 1 freq
beeldin - 2 freq
behowlden - 1 freq
beeldins - 2 freq
blydeness - 1 freq
buildeen - 1 freq
bleateen - 1 freq
blottan - 1 freq
bieldins - 2 freq
bloatin - 1 freq
bleeding - 1 freq
boldint - 1 freq
blitheness - 3 freq
belting - 2 freq
baldwin - 1 freq
€˜building - 1 freq
bluid-mither - 1 freq
bluid-matchin - 1 freq
€œbuildin - 1 freq
blythman - 1 freq
bluidyin - 1 freq
beilding - 2 freq
blednoch - 2 freq
bladenoch - 1 freq
bliadhna - 1 freq
bliadhnaichean - 1 freq
boulton - 1 freq
boldin - 1 freq
bleedinÂ’ - 1 freq
bleathing - 1 freq
MetaPhone code - BLTNKS
buildings - 11 freq
BUILDINGS
Time to execute Levenshtein function - 0.330838 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.434364 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028183 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040610 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001218 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.