A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to embro-basit in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
embro-basit (0) - 1 freq
embra-based (3) - 1 freq
embro-born (4) - 1 freq
emboiain (5) - 2 freq
embarrast (5) - 4 freq
embracin (5) - 7 freq
embrowan (5) - 2 freq
embro's (5) - 2 freq
embroil (5) - 1 freq
embroidert (5) - 1 freq
embrasse (5) - 1 freq
brodcast (5) - 1 freq
approbatit (5) - 3 freq
brakit (6) - 1 freq
embrae (6) - 4 freq
emigratit (6) - 1 freq
brakwast (6) - 4 freq
demobit (6) - 1 freq
obayit (6) - 2 freq
grossit (6) - 2 freq
professit (6) - 2 freq
onabasit (6) - 1 freq
embaurassin (6) - 1 freq
roamit (6) - 1 freq
croakit (6) - 2 freq
embro-basit (0) - 1 freq
embra-based (4) - 1 freq
embro-born (6) - 1 freq
embarrast (7) - 4 freq
embroidert (8) - 1 freq
embro's (8) - 2 freq
embrasse (8) - 1 freq
brodcast (8) - 1 freq
more-as (9) - 1 freq
baby-sit (9) - 1 freq
babysit (9) - 1 freq
barrabest (9) - 2 freq
bro-ski (9) - 1 freq
embarrass (9) - 10 freq
bbsit (9) - 1 freq
broadest (9) - 3 freq
nor-wast (9) - 2 freq
brodist (9) - 1 freq
embraces (9) - 1 freq
nor-east (9) - 21 freq
markeast (9) - 1 freq
brissit (9) - 1 freq
ambros (9) - 1 freq
breast (9) - 12 freq
umbrellas (9) - 5 freq
SoundEx code - E516
embra - 96 freq
embrae - 4 freq
embro - 86 freq
embro's - 2 freq
empire - 82 freq
embarrassinly - 1 freq
embarrassin - 13 freq
embarrasment - 1 freq
embarrassed - 37 freq
embarrassment - 23 freq
embarrass - 10 freq
environment - 46 freq
embraced - 7 freq
embers - 7 freq
emporor - 1 freq
eonversation - 2 freq
ember's - 1 freq
embrace - 19 freq
environmental - 14 freq
embroidert - 1 freq
empress - 3 freq
empires - 8 freq
enforcin - 1 freq
embroidered - 7 freq
emperor - 40 freq
environs - 5 freq
embracin - 7 freq
empressed - 1 freq
embroil - 1 freq
enforce - 6 freq
embraked - 1 freq
enforcement - 5 freq
embarked - 1 freq
embarris - 1 freq
embarrassmint - 3 freq
embra's - 4 freq
empires' - 2 freq
emprical - 1 freq
embarrast - 4 freq
empire's - 5 freq
emperors - 2 freq
empowered - 2 freq
emburgh - 1 freq
embark - 2 freq
embarrassing - 10 freq
emperialiss - 1 freq
embroidery - 6 freq
empourement - 1 freq
empooerin - 1 freq
empouer - 3 freq
empourment - 3 freq
empouerment - 2 freq
embraces - 1 freq
empouered - 1 freq
empooer - 2 freq
emperor's - 20 freq
eonfeirance - 1 freq
emperoar - 1 freq
empire' - 1 freq
empirical - 1 freq
emperer - 1 freq
enforcit - 2 freq
embaurassin - 1 freq
embaurassed - 1 freq
environmentally - 4 freq
environments - 3 freq
enforced - 3 freq
embryos - 3 freq
embrowan - 2 freq
envirounit - 1 freq
embrasse - 1 freq
€™empereur - 1 freq
empouerin - 1 freq
empowerin - 1 freq
embroiled - 1 freq
embroider - 1 freq
enfer - 1 freq
enbra - 2 freq
embro-born - 1 freq
embro-basit - 1 freq
enviro-howe - 1 freq
embarressed - 1 freq
embarassed - 1 freq
embra-based - 1 freq
empiricism - 1 freq
environmentalism - 1 freq
embarras - 1 freq
embairrassed - 2 freq
embarassing - 5 freq
ewanporteous - 1 freq
embarrasing - 1 freq
empower - 2 freq
MetaPhone code - EMRBST
embro-basit - 1 freq
embra-based - 1 freq
EMBRO-BASIT
Time to execute Levenshtein function - 0.366173 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.687492 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032658 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.079888 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000997 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.