A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to finding in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
finding (0) - 16 freq
findings (1) - 4 freq
binding (1) - 1 freq
minding (1) - 7 freq
winding (1) - 2 freq
findin (1) - 71 freq
findins (1) - 4 freq
funding (1) - 5 freq
finking (1) - 1 freq
findin' (1) - 3 freq
inning (2) - 1 freq
linking (2) - 2 freq
funning (2) - 1 freq
finkin (2) - 8 freq
ending (2) - 17 freq
lining (2) - 1 freq
vining (2) - 17 freq
folding (2) - 2 freq
mindings (2) - 1 freq
bindin (2) - 6 freq
fitting (2) - 9 freq
flinging (2) - 4 freq
sinking (2) - 5 freq
filing (2) - 1 freq
fundins (2) - 1 freq
finding (0) - 16 freq
funding (1) - 5 freq
findins (2) - 4 freq
finking (2) - 1 freq
findin (2) - 71 freq
findin' (2) - 3 freq
winding (2) - 2 freq
findings (2) - 4 freq
binding (2) - 1 freq
minding (2) - 7 freq
findan (3) - 6 freq
tending (3) - 3 freq
pending (3) - 1 freq
fencing (3) - 1 freq
mending (3) - 1 freq
punding (3) - 1 freq
sending (3) - 13 freq
onding (3) - 9 freq
bonding (3) - 1 freq
handing (3) - 2 freq
fandin (3) - 1 freq
fendin (3) - 5 freq
fandango (3) - 2 freq
bending (3) - 2 freq
feeding (3) - 10 freq
SoundEx code - F535
fundin - 62 freq
fountains - 8 freq
findin - 71 freq
faintin - 4 freq
finndin - 11 freq
fendin - 5 freq
finndna - 1 freq
foontain - 4 freq
fanton - 3 freq
fondness - 6 freq
fundamental - 24 freq
findin' - 3 freq
fountain - 17 freq
fenton - 8 freq
fundamentall - 1 freq
foontains - 7 freq
fintin - 1 freq
findins - 4 freq
funtain - 6 freq
fentin - 2 freq
fantin - 20 freq
fountain' - 1 freq
fountain's - 1 freq
fendiness - 1 freq
foondin - 13 freq
funding - 5 freq
findan - 6 freq
fendan - 1 freq
fundin' - 37 freq
fundemental - 2 freq
fandango - 2 freq
finding - 16 freq
fundamentally - 2 freq
faintan - 1 freq
fontaines - 1 freq
fountainpen - 1 freq
foontainheid - 1 freq
findings - 4 freq
fine-tuin - 1 freq
fontaine - 1 freq
fandin - 1 freq
fontainebleau - 1 freq
fundins - 1 freq
fandom - 1 freq
fundamentalist - 6 freq
foundiment - 1 freq
fundeen - 3 freq
'fundin - 1 freq
fundamentalism - 1 freq
foandness - 1 freq
€œfindan - 1 freq
fondniss - 1 freq
fenton's - 4 freq
foundin - 1 freq
findinganeish - 1 freq
fandan - 7 freq
fnteamwear - 10 freq
fndungeonmom - 1 freq
MetaPhone code - FNTNK
funding - 5 freq
fandango - 2 freq
finding - 16 freq
FINDING
Time to execute Levenshtein function - 0.185869 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.360785 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027481 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036871 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000805 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.