A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to finding in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
finding (0) - 16 freq
findin (1) - 71 freq
binding (1) - 1 freq
winding (1) - 2 freq
funding (1) - 5 freq
minding (1) - 7 freq
findings (1) - 4 freq
findins (1) - 4 freq
finking (1) - 1 freq
findin' (1) - 3 freq
bidding (2) - 4 freq
handing (2) - 2 freq
fundin (2) - 62 freq
finnins (2) - 1 freq
windin (2) - 28 freq
filming (2) - 2 freq
firming (2) - 1 freq
pinting (2) - 1 freq
grinding (2) - 2 freq
finin (2) - 3 freq
fundin' (2) - 37 freq
inning (2) - 1 freq
mindins (2) - 20 freq
jinking (2) - 1 freq
blinding (2) - 1 freq
finding (0) - 16 freq
funding (1) - 5 freq
findins (2) - 4 freq
findin' (2) - 3 freq
findings (2) - 4 freq
finking (2) - 1 freq
minding (2) - 7 freq
findin (2) - 71 freq
binding (2) - 1 freq
winding (2) - 2 freq
bonding (3) - 1 freq
fandin (3) - 1 freq
onding (3) - 9 freq
landing (3) - 11 freq
punding (3) - 1 freq
ending (3) - 17 freq
sending (3) - 13 freq
funning (3) - 1 freq
pending (3) - 1 freq
bending (3) - 2 freq
tending (3) - 3 freq
feeding (3) - 10 freq
findan (3) - 6 freq
fading (3) - 7 freq
folding (3) - 2 freq
SoundEx code - F535
fundin - 62 freq
fountains - 8 freq
findin - 71 freq
faintin - 4 freq
finndin - 11 freq
fendin - 5 freq
finndna - 1 freq
foontain - 4 freq
fanton - 3 freq
fondness - 7 freq
fundamental - 24 freq
findin' - 3 freq
fountain - 17 freq
finiteness - 1 freq
fenton - 8 freq
fundamentall - 1 freq
foontains - 7 freq
fintin - 1 freq
findins - 4 freq
funtain - 6 freq
fentin - 2 freq
fantin - 20 freq
fountain' - 1 freq
fountain's - 1 freq
fendiness - 1 freq
foondin - 13 freq
funding - 5 freq
findan - 6 freq
fendan - 1 freq
fundin' - 37 freq
fundemental - 2 freq
fandango - 2 freq
finding - 16 freq
fundamentally - 2 freq
faintan - 1 freq
fontaines - 1 freq
fountainpen - 1 freq
foontainheid - 1 freq
findings - 4 freq
fine-tuin - 1 freq
fontaine - 1 freq
fandin - 1 freq
fontainebleau - 1 freq
fundins - 1 freq
fandom - 1 freq
fundamentalist - 6 freq
foundiment - 1 freq
fundeen - 3 freq
'fundin - 1 freq
fundamentalism - 1 freq
foandness - 1 freq
€œfindan - 1 freq
fondniss - 1 freq
fenton's - 4 freq
foundin - 1 freq
findinganeish - 1 freq
fandan - 7 freq
fnteamwear - 10 freq
fndungeonmom - 1 freq
MetaPhone code - FNTNK
funding - 5 freq
fandango - 2 freq
finding - 16 freq
FINDING
Time to execute Levenshtein function - 0.181654 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.373828 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027506 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037183 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000910 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.