A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to bulldozers in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
bulldozers (0) - 1 freq
bulldozer (1) - 2 freq
bulldozed (2) - 1 freq
bulldoze (2) - 1 freq
bullers (3) - 2 freq
bulldozin (3) - 1 freq
builders (3) - 10 freq
bulders (3) - 1 freq
bulldeug (4) - 1 freq
builders' (4) - 1 freq
belltooer (4) - 1 freq
bellower (4) - 1 freq
bullert (4) - 2 freq
blowers (4) - 1 freq
budders (4) - 2 freq
builder (4) - 14 freq
boolders (4) - 8 freq
blooters (4) - 3 freq
pullitzer (4) - 1 freq
gulders (4) - 6 freq
folloers (4) - 1 freq
bullets (4) - 29 freq
wallopers (4) - 2 freq
followers (4) - 35 freq
blazers (4) - 1 freq
bulldozers (0) - 1 freq
bulldozer (2) - 2 freq
bulldoze (4) - 1 freq
bulldozed (4) - 1 freq
bulders (5) - 1 freq
builders (5) - 10 freq
bullers (5) - 2 freq
bulldozin (5) - 1 freq
blazers (6) - 1 freq
boolders (6) - 8 freq
boulders (6) - 14 freq
ill-doers (7) - 1 freq
bulder (7) - 4 freq
bullocks (7) - 2 freq
builder's (7) - 5 freq
cullers (7) - 1 freq
bloomers (7) - 5 freq
bloopers (7) - 1 freq
blinders (7) - 2 freq
beholders (7) - 2 freq
bullies (7) - 6 freq
boozers (7) - 4 freq
bulldog (7) - 3 freq
guelders (7) - 1 freq
bullock's (7) - 1 freq
SoundEx code - B432
blades - 22 freq
bields - 13 freq
bluid's - 2 freq
bullets - 29 freq
blowts - 1 freq
blythsome - 1 freq
baltic - 29 freq
builds - 3 freq
blithesome - 3 freq
ballads - 24 freq
billets - 3 freq
blythesome - 10 freq
bluidshot - 3 freq
bluidstains - 1 freq
bloodshot - 1 freq
bolts - 12 freq
belts - 15 freq
bloats - 1 freq
blitz - 7 freq
blade's - 2 freq
blood-shot - 1 freq
bluid-soaked - 1 freq
bluid-stained - 1 freq
blotches - 3 freq
bull-thick - 1 freq
bleeds - 6 freq
blotch - 6 freq
blythsum - 1 freq
bloatches - 1 freq
baldastard - 1 freq
bloodshoat - 2 freq
bulldog - 3 freq
blatts - 5 freq
bloodsuckin - 1 freq
blotts - 1 freq
blood-curdlin - 2 freq
blytheswid - 1 freq
bulldozer - 2 freq
bulldozers - 1 freq
blythesum - 2 freq
biled-sweetie - 2 freq
bluid-shot - 1 freq
blitzen - 1 freq
blitzt - 1 freq
bleds - 1 freq
blathewick - 3 freq
blotchy - 2 freq
blads - 9 freq
'ballats' - 1 freq
bleudiest - 1 freq
blood-stained - 1 freq
blitzed - 1 freq
beldie's - 2 freq
bluidspring - 1 freq
bledds - 2 freq
bieldside - 2 freq
buhlitts - 2 freq
baults - 3 freq
'ballads' - 2 freq
bluidshed - 2 freq
bleats - 1 freq
balthasar - 1 freq
blöd-spring - 1 freq
bleddick-coloured - 1 freq
blate-kyn - 1 freq
blatk's - 1 freq
blauds - 1 freq
bloodgood - 1 freq
blithesum - 1 freq
bulldeug - 1 freq
bauldest - 1 freq
ballats - 1 freq
by-leids - 2 freq
bield's - 1 freq
bloodaxe - 2 freq
ballads' - 1 freq
bulldozed - 1 freq
byleids - 9 freq
blood-curdling - 1 freq
bluid-curdlin - 2 freq
blu-tack - 1 freq
belt-ish - 1 freq
bulldozin - 1 freq
bludgeonin - 1 freq
bloods - 5 freq
beilds - 1 freq
€™-bluidy-sulliven - 1 freq
blithsome - 2 freq
ballots - 1 freq
bye-leids - 3 freq
bloodstain - 1 freq
€œblythesome - 1 freq
bulldug - 2 freq
belt's - 1 freq
behaltkeeps - 1 freq
bloodstream - 1 freq
''bloodsuckers'' - 1 freq
blitzwalker - 1 freq
bulldoze - 1 freq
byleid's - 1 freq
baltasound - 1 freq
boiledeggs - 1 freq
MetaPhone code - BLTSRS
bulldozers - 1 freq
BULLDOZERS
Time to execute Levenshtein function - 0.232082 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.430545 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029538 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037369 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000855 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.