A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to empire in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
empire (0) - 82 freq
expire (1) - 2 freq
empire' (1) - 1 freq
empie (1) - 25 freq
impire (1) - 2 freq
empires (1) - 8 freq
entire (2) - 32 freq
empire's (2) - 6 freq
vampire (2) - 14 freq
emperer (2) - 1 freq
aspire (2) - 4 freq
expires (2) - 1 freq
empires' (2) - 2 freq
amyire (2) - 1 freq
impyre (2) - 2 freq
kempie (2) - 1 freq
impires (2) - 1 freq
emptie (2) - 1 freq
maire (2) - 146 freq
impure (2) - 1 freq
spire (2) - 5 freq
expired (2) - 3 freq
pire (2) - 1 freq
elvire (2) - 4 freq
empied (2) - 3 freq
empire (0) - 82 freq
impire (1) - 2 freq
expire (2) - 2 freq
impure (2) - 1 freq
impyre (2) - 2 freq
empires (2) - 8 freq
empire' (2) - 1 freq
empie (2) - 25 freq
impires (3) - 1 freq
empied (3) - 3 freq
emptie (3) - 1 freq
empooer (3) - 2 freq
spire (3) - 5 freq
maire (3) - 146 freq
aspire (3) - 4 freq
vampire (3) - 14 freq
mire (3) - 14 freq
emperer (3) - 1 freq
pire (3) - 1 freq
empien (3) - 1 freq
empouer (3) - 3 freq
amyire (3) - 1 freq
pure (4) - 676 freq
impose (4) - 10 freq
emorra (4) - 1 freq
SoundEx code - E516
embra - 96 freq
embrae - 4 freq
embro - 86 freq
embro's - 2 freq
empire - 82 freq
embarrassinly - 1 freq
embarrassin - 15 freq
embarrasment - 1 freq
embarrassed - 39 freq
embarrassment - 24 freq
embarrass - 10 freq
environment - 46 freq
embraced - 7 freq
embers - 8 freq
emporor - 1 freq
eonversation - 2 freq
ember's - 1 freq
embrace - 19 freq
environmental - 15 freq
embroidert - 1 freq
empress - 3 freq
empires - 8 freq
enforcin - 1 freq
embroidered - 7 freq
emperor - 40 freq
environs - 5 freq
embracin - 7 freq
empressed - 1 freq
embroil - 1 freq
empire's - 6 freq
enforce - 6 freq
embraked - 1 freq
enforcement - 5 freq
embarked - 1 freq
embarris - 1 freq
embarrassmint - 3 freq
embra's - 4 freq
empires' - 2 freq
emprical - 1 freq
embarrast - 4 freq
emperors - 2 freq
empowered - 2 freq
emburgh - 1 freq
embark - 2 freq
embarrassing - 10 freq
emperialiss - 1 freq
embroidery - 6 freq
empourement - 1 freq
empooerin - 1 freq
empouer - 3 freq
empourment - 3 freq
empouerment - 2 freq
embraces - 1 freq
empouered - 1 freq
empooer - 2 freq
emperor's - 20 freq
eonfeirance - 1 freq
emperoar - 1 freq
empire' - 1 freq
empirical - 1 freq
emperer - 1 freq
enforcit - 2 freq
embaurassin - 1 freq
embaurassed - 1 freq
environmentally - 4 freq
environments - 3 freq
enforced - 3 freq
embryos - 3 freq
embrowan - 2 freq
envirounit - 1 freq
embrasse - 1 freq
€™empereur - 1 freq
empouerin - 1 freq
empowerin - 1 freq
embroiled - 1 freq
embroider - 1 freq
enfer - 1 freq
enbra - 2 freq
embro-born - 1 freq
embro-basit - 1 freq
enviro-howe - 1 freq
embarressed - 1 freq
embarassed - 1 freq
embra-based - 1 freq
empiricism - 1 freq
environmentalism - 1 freq
embarras - 1 freq
embairrassed - 2 freq
embarassing - 5 freq
ewanporteous - 1 freq
embarrasing - 1 freq
empower - 2 freq
MetaPhone code - EMPR
empire - 82 freq
empouer - 3 freq
empooer - 2 freq
empire' - 1 freq
EMPIRE
Time to execute Levenshtein function - 0.177932 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.369472 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.035845 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037024 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000872 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.