A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to families in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
families (0) - 30 freq
faimilies (1) - 43 freq
familie (1) - 1 freq
emilies (2) - 1 freq
femlies (2) - 2 freq
bailies (2) - 2 freq
faimilie (2) - 21 freq
familiars (2) - 1 freq
faimlies (2) - 47 freq
family's (2) - 3 freq
ramilles (2) - 1 freq
failins (2) - 1 freq
mailies (2) - 1 freq
fairies (2) - 39 freq
familial (2) - 2 freq
familiar (2) - 73 freq
failzies (2) - 1 freq
faimiles (2) - 2 freq
dailies (2) - 1 freq
hamelins (3) - 1 freq
rallies (3) - 7 freq
bailie's (3) - 1 freq
danglies (3) - 2 freq
fallaes (3) - 1 freq
fabils (3) - 2 freq
families (0) - 30 freq
faimilies (1) - 43 freq
faimiles (2) - 2 freq
faimlies (2) - 47 freq
femlies (2) - 2 freq
familie (2) - 1 freq
femlees (3) - 1 freq
familial (3) - 2 freq
familiar (3) - 73 freq
females (3) - 9 freq
mailies (3) - 1 freq
emilies (3) - 1 freq
familiars (3) - 1 freq
faimilie (3) - 21 freq
family's (3) - 3 freq
faills (4) - 1 freq
family (4) - 329 freq
faimlie's (4) - 1 freq
maalies (4) - 6 freq
fameiliar (4) - 11 freq
yamils (4) - 1 freq
family- (4) - 1 freq
ferlies (4) - 61 freq
faimlie (4) - 140 freq
files (4) - 111 freq
SoundEx code - F542
faimilies - 43 freq
faimly's - 3 freq
finlay's - 2 freq
funnels - 4 freq
finalise - 2 freq
funny-like - 2 freq
faimlies - 47 freq
families - 30 freq
faimily's - 8 freq
faimiles - 2 freq
fine-luikin - 1 freq
femlie's - 1 freq
finals - 7 freq
finalised - 2 freq
fine-lookin - 2 freq
finelookin - 1 freq
family's - 3 freq
femlies - 2 freq
females' - 1 freq
funny-lukkan - 1 freq
femmels - 1 freq
final-stage - 1 freq
fenella's - 2 freq
femmelies - 1 freq
fummles - 2 freq
faimlie's - 1 freq
females - 9 freq
finalists - 1 freq
fammils - 1 freq
fionnlaigh - 1 freq
finlayson - 1 freq
femlees - 1 freq
fine-lik - 1 freq
fmwales - 1 freq
fnlz - 1 freq
MetaPhone code - FMLS
faimilies - 43 freq
faimly's - 3 freq
faimlies - 47 freq
families - 30 freq
faimily's - 8 freq
faimiles - 2 freq
femlie's - 1 freq
family's - 3 freq
femlies - 2 freq
females' - 1 freq
femmels - 1 freq
femmelies - 1 freq
fummles - 2 freq
faimlie's - 1 freq
females - 9 freq
fammils - 1 freq
femlees - 1 freq
fumbles - 2 freq
FAMILIES
Time to execute Levenshtein function - 0.181170 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.348972 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027540 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036990 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000906 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.