A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to emigrated in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
emigrated (0) - 10 freq
emigratet (1) - 1 freq
emigrate (1) - 3 freq
emigratit (2) - 1 freq
migrate (2) - 1 freq
migrates (2) - 1 freq
emanated (3) - 2 freq
embraked (3) - 1 freq
emigrant's (3) - 1 freq
emaciated (3) - 2 freq
vibrated (3) - 1 freq
emitted (3) - 2 freq
migratin (3) - 1 freq
imitated (3) - 2 freq
emigrant (3) - 2 freq
pirated (3) - 1 freq
denigrate (3) - 2 freq
emigrants (3) - 1 freq
engraved (3) - 6 freq
grated (3) - 1 freq
designated (3) - 6 freq
embraced (3) - 7 freq
separated (4) - 8 freq
fixated (4) - 5 freq
weighted (4) - 3 freq
emigrated (0) - 10 freq
emigrate (2) - 3 freq
emigratet (2) - 1 freq
migrates (3) - 1 freq
migrate (3) - 1 freq
emigratit (3) - 1 freq
migratin (4) - 1 freq
grated (4) - 1 freq
engraved (5) - 6 freq
embraced (5) - 7 freq
imported (5) - 4 freq
migration (5) - 10 freq
emigrants (5) - 1 freq
gyrated (5) - 1 freq
moderated (5) - 1 freq
migratory (5) - 1 freq
vibrated (5) - 1 freq
emaciated (5) - 2 freq
emanated (5) - 2 freq
emitted (5) - 2 freq
embraked (5) - 1 freq
pirated (5) - 1 freq
emigrant (5) - 2 freq
imitated (5) - 2 freq
evaporated (6) - 1 freq
SoundEx code - E526
engraved - 6 freq
encouraging - 8 freq
encouraginly - 2 freq
encouraged - 26 freq
encourage - 66 freq
enshrined - 2 freq
engrossed - 5 freq
enquire - 4 freq
enquiry - 7 freq
emigrants - 1 freq
ensure - 19 freq
encircled - 1 freq
encouragin' - 4 freq
enshore - 2 freq
encroaching - 2 freq
enquired - 9 freq
emigrate - 3 freq
encouragin - 18 freq
encore - 5 freq
encouragement - 28 freq
enquires - 5 freq
enchor't - 1 freq
'encouraged' - 1 freq
engravins - 1 freq
ensurin - 3 freq
emigratet - 1 freq
encourages - 4 freq
emigrant - 2 freq
emigrant's - 1 freq
engorged - 1 freq
enquirt - 1 freq
enquiries - 6 freq
encouragemint - 1 freq
encroached - 6 freq
ensures - 3 freq
emissary - 2 freq
encouragan - 2 freq
emigrated - 10 freq
encrusted - 2 freq
ee-winkers - 2 freq
eonger-eel - 1 freq
eimagerie - 1 freq
encouraget - 1 freq
encouragit - 4 freq
enshair - 1 freq
encroachin - 2 freq
encroach - 1 freq
enquirin - 3 freq
emigratit - 1 freq
enshuir - 1 freq
engert - 1 freq
encooraged - 1 freq
ensuring - 2 freq
engerlush - 4 freq
engurlesh - 1 freq
engurlish - 3 freq
engurland - 2 freq
engurlush - 2 freq
emmagraeauthor - 12 freq
enquiring - 1 freq
MetaPhone code - EMKRTT
emigratet - 1 freq
emigrated - 10 freq
emigratit - 1 freq
EMIGRATED
Time to execute Levenshtein function - 0.200752 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.384164 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027543 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039274 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000904 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.