A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ballerina in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ballerina (0) - 2 freq
callerin (2) - 1 freq
bullerin (2) - 1 freq
balmerino (2) - 1 freq
algeria (3) - 1 freq
hollerin (3) - 2 freq
gollerin (3) - 6 freq
batherin (3) - 5 freq
barberin (3) - 1 freq
sillerin (3) - 4 freq
battering (3) - 2 freq
gallerie (3) - 1 freq
alterin (3) - 2 freq
barterin (3) - 1 freq
bacteria (3) - 4 freq
albertina (3) - 1 freq
ballin (3) - 3 freq
bulderin (3) - 2 freq
barbering (3) - 1 freq
balaena (3) - 1 freq
balerno (3) - 2 freq
ballymena (3) - 6 freq
bulletin (3) - 6 freq
galleries (3) - 8 freq
walterin (3) - 3 freq
ballerina (0) - 2 freq
bullerin (2) - 1 freq
callerin (3) - 1 freq
balmerino (3) - 1 freq
ballymena (4) - 6 freq
balerno (4) - 2 freq
bulletin (4) - 6 freq
bulderin (4) - 2 freq
ballin (4) - 3 freq
ballamena (4) - 1 freq
gollerin (4) - 6 freq
sillerin (4) - 4 freq
hollerin (4) - 2 freq
bellin (5) - 3 freq
allourin (5) - 1 freq
bullert (5) - 2 freq
bulletins (5) - 2 freq
batterin (5) - 26 freq
billowin (5) - 3 freq
blarin (5) - 13 freq
blurrin (5) - 1 freq
bellamaina (5) - 2 freq
balluderon (5) - 3 freq
ballroom (5) - 5 freq
ballarat (5) - 1 freq
SoundEx code - B465
bell-ringin - 1 freq
blarin - 13 freq
blarin' - 1 freq
ballerina - 2 freq
blairen - 1 freq
ballroom - 5 freq
blue-rinsed - 2 freq
baallroom - 1 freq
bell-ringing - 1 freq
blearie-een't - 1 freq
bullerin - 1 freq
balornock - 1 freq
balerno - 2 freq
blaring - 2 freq
boilermaker - 1 freq
blurring - 1 freq
blurrin - 1 freq
blairmcdougall - 1 freq
balernoathletic - 1 freq
MetaPhone code - BLRN
blarin - 13 freq
blarin' - 1 freq
ballerina - 2 freq
blairen - 1 freq
bullerin - 1 freq
balerno - 2 freq
blurrin - 1 freq
BALLERINA
Time to execute Levenshtein function - 0.237596 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.403967 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029071 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040477 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000807 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.