A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sangobeg in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sangobeg (0) - 1 freq
sanger (3) - 1 freq
hangover (3) - 10 freq
sange (3) - 1 freq
handbeg (3) - 1 freq
tangoed (3) - 1 freq
hangower (3) - 3 freq
sangster (3) - 24 freq
snowey (4) - 1 freq
sannies (4) - 7 freq
sangbeuks (4) - 1 freq
salbe (4) - 7 freq
danglies (4) - 2 freq
langlegs (4) - 3 freq
sansgter (4) - 1 freq
tango (4) - 10 freq
shangie (4) - 1 freq
oanyone (4) - 1 freq
mangrove (4) - 1 freq
rangoon (4) - 1 freq
santered (4) - 2 freq
sancte (4) - 4 freq
yangtze (4) - 1 freq
hanbag (4) - 1 freq
range (4) - 93 freq
sangobeg (0) - 1 freq
svgbg (5) - 1 freq
sangbuik (5) - 1 freq
handbeg (5) - 1 freq
singing (5) - 29 freq
sange (5) - 1 freq
sanger (5) - 1 freq
gangable (6) - 1 freq
snob (6) - 10 freq
languag (6) - 1 freq
banging (6) - 9 freq
ganging (6) - 3 freq
sangria (6) - 2 freq
songbook (6) - 1 freq
slugabed (6) - 1 freq
sunbem (6) - 1 freq
sangs (6) - 256 freq
bangbang (6) - 1 freq
singe (6) - 3 freq
singsong (6) - 1 freq
tangible (6) - 3 freq
sandrag (6) - 1 freq
sang' (6) - 2 freq
hanging (6) - 9 freq
sneg (6) - 1 freq
SoundEx code - S521
sensible - 33 freq
smoke-blackened - 1 freq
singapore - 9 freq
swingpark - 1 freq
songbirds - 2 freq
semi-quavers - 1 freq
shinsplints - 2 freq
sangbuik - 1 freq
songbook - 1 freq
snackbar - 1 freq
sinsible - 1 freq
songbird - 1 freq
sunspots - 1 freq
snakebake - 2 freq
sangbeuks - 1 freq
sinkfae - 1 freq
snaa-covered - 1 freq
sainsbury - 1 freq
sensibly - 2 freq
sainsbury's - 1 freq
scenesofulster - 1 freq
shanksybraniel - 1 freq
snxvusgkj - 1 freq
simchafisher - 2 freq
swansofficial - 1 freq
sneekyboy - 1 freq
sensibeilities - 1 freq
sangobeg - 1 freq
snsbi - 1 freq
smqscjpxza - 2 freq
sonzyb - 2 freq
MetaPhone code - SNKBK
sangbuik - 1 freq
songbook - 1 freq
snakebake - 2 freq
sangobeg - 1 freq
SANGOBEG
Time to execute Levenshtein function - 0.226337 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.419654 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030274 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038876 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000846 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.