A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to brazil in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
brazil (0) - 11 freq
brazils (1) - 1 freq
bradie (2) - 1 freq
brattil (2) - 1 freq
frail (2) - 18 freq
crail (2) - 6 freq
basil (2) - 2 freq
bravin (2) - 3 freq
branie (2) - 1 freq
grail (2) - 4 freq
braig (2) - 2 freq
blazin (2) - 12 freq
braid (2) - 254 freq
frawil (2) - 1 freq
rail (2) - 29 freq
braik (2) - 6 freq
trail (2) - 57 freq
brakin (2) - 15 freq
brain (2) - 151 freq
bruil (2) - 1 freq
bracin (2) - 2 freq
brazier (2) - 5 freq
brawl (2) - 1 freq
brazen (2) - 3 freq
grazin (2) - 11 freq
brazil (0) - 11 freq
brazils (2) - 1 freq
brazier (3) - 5 freq
bruil (3) - 1 freq
brawl (3) - 1 freq
brazen (3) - 3 freq
crazily (3) - 2 freq
grizel (4) - 15 freq
brutal (4) - 17 freq
brizz (4) - 3 freq
brill (4) - 12 freq
bar-l (4) - 1 freq
bravely (4) - 8 freq
brawlie (4) - 28 freq
bridal (4) - 4 freq
brel (4) - 2 freq
barl (4) - 1 freq
braidly (4) - 8 freq
brasilia (4) - 1 freq
breezin (4) - 1 freq
bryz (4) - 1 freq
braali (4) - 27 freq
brawly (4) - 36 freq
brally (4) - 4 freq
braally (4) - 1 freq
SoundEx code - B624
birsled - 3 freq
birslin - 6 freq
birslet - 1 freq
braikley - 1 freq
bracelet - 8 freq
birselt - 4 freq
breeshle - 3 freq
brussels - 24 freq
brickle - 1 freq
brazils - 1 freq
birsulen - 1 freq
brazil - 11 freq
barclays - 1 freq
burglars - 2 freq
breeshlin - 3 freq
broccoli - 2 freq
brazilian - 1 freq
'brazilian' - 1 freq
barcelona - 10 freq
burgled - 2 freq
burglaries - 2 freq
brussell - 1 freq
beer-swiller - 1 freq
bricklayers - 1 freq
barclay - 3 freq
'barcelona - 1 freq
brussels' - 3 freq
'brussels - 1 freq
breeshled - 10 freq
bruckle - 12 freq
brukkilnes - 1 freq
brukkil - 1 freq
bruckalaetion - 1 freq
brusquely - 2 freq
bresslaw - 1 freq
birsels - 1 freq
brucella - 1 freq
brooklyn - 2 freq
brooklyn's - 1 freq
braesslet - 1 freq
brouselt - 1 freq
bricklayin - 1 freq
birssled - 4 freq
bracelets - 3 freq
berkeley - 3 freq
burglar's - 1 freq
breishilt - 1 freq
breasley - 1 freq
breasley's - 2 freq
barce-lona - 1 freq
barcelona-airtit - 1 freq
bryceland - 1 freq
bracklach's - 1 freq
burglar - 3 freq
bruckleness - 1 freq
breek-legs - 1 freq
birsslin - 4 freq
birsslit - 1 freq
briskly - 3 freq
birsle - 1 freq
brookliners - 1 freq
burglers - 1 freq
burgle - 1 freq
burglary - 1 freq
breslin - 1 freq
birssle - 2 freq
birsilt - 2 freq
breeshlit - 1 freq
brickelly - 1 freq
brocklebankfash - 1 freq
barcelone - 1 freq
brasilia - 1 freq
barclaypeter - 1 freq
bruceleunson - 13 freq
barklandcroft - 2 freq
birseland - 7 freq
brissels - 1 freq
brissills - 1 freq
brocklebank - 1 freq
brazillian - 1 freq
brockelbank - 1 freq
MetaPhone code - BRSL
brazil - 11 freq
brussell - 1 freq
bresslaw - 1 freq
brucella - 1 freq
breasley - 1 freq
birsle - 1 freq
birssle - 2 freq
brasilia - 1 freq
BRAZIL
Time to execute Levenshtein function - 0.180834 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.342708 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027604 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037105 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000856 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.