A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sweetcorn in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sweetcorn (0) - 3 freq
sweetchan (2) - 2 freq
sweethert (3) - 8 freq
sweetin (3) - 3 freq
sweeter (3) - 7 freq
seed-corn (3) - 1 freq
sweeten (3) - 3 freq
seedcoarn (3) - 1 freq
sweechan (3) - 3 freq
seaton (4) - 3 freq
sweetheart (4) - 11 freq
owerturn (4) - 2 freq
sweetens (4) - 1 freq
weetin (4) - 2 freq
weeackin (4) - 2 freq
sweetness (4) - 17 freq
sweetly (4) - 5 freq
seecond (4) - 1 freq
sweein (4) - 4 freq
benthorn (4) - 1 freq
'weelcome (4) - 1 freq
stretchan (4) - 3 freq
stretchin (4) - 17 freq
sweengan (4) - 2 freq
sectioun (4) - 7 freq
sweetcorn (0) - 3 freq
sweetchan (3) - 2 freq
switchin (5) - 9 freq
swatchin (5) - 1 freq
seedcoarn (5) - 1 freq
swutchin (5) - 1 freq
sweechan (5) - 3 freq
sweetin (5) - 3 freq
sweethert (5) - 8 freq
sweeten (5) - 3 freq
sweeter (5) - 7 freq
seed-corn (5) - 1 freq
swelterin (6) - 3 freq
seatchin (6) - 1 freq
sweetener (6) - 1 freq
scorn (6) - 18 freq
skeitchin (6) - 1 freq
seturn (6) - 12 freq
sketchan (6) - 1 freq
swutherin (6) - 3 freq
sweitin (6) - 2 freq
switherin (6) - 27 freq
sweatin (6) - 15 freq
sweirn (6) - 1 freq
sweeren (6) - 1 freq
SoundEx code - S326
stauchered - 3 freq
sodjers - 23 freq
swedgers - 1 freq
sodjer - 7 freq
sodgers - 125 freq
scotscreive - 3 freq
stoker - 4 freq
sodger - 114 freq
stauchert - 6 freq
staggered - 7 freq
sodger's - 7 freq
swedgers' - 1 freq
staggers - 4 freq
sidewhuskers - 1 freq
switzerland - 12 freq
staggert - 4 freq
stagger - 15 freq
staucher - 5 freq
sodgers' - 2 freq
staigger - 1 freq
staggeren - 1 freq
sidecar - 3 freq
sydecar - 1 freq
steeker - 1 freq
staucherin - 2 freq
stauchran - 1 freq
staaker - 1 freq
stickers - 11 freq
schwytzertütsch - 3 freq
staggern - 1 freq
staggerin - 8 freq
stachered - 4 freq
stacherin - 3 freq
scutcher - 1 freq
sticker - 9 freq
sodgerin - 1 freq
stack-yaird - 2 freq
soadjers - 1 freq
scotch-airish - 5 freq
scots-airish - 1 freq
stacher - 3 freq
sudocrem - 2 freq
'scotscreive' - 1 freq
stacheran - 1 freq
stackyaird - 1 freq
stecheran - 1 freq
seedcoarn - 1 freq
stachert - 4 freq
scotcourts - 1 freq
sudgers - 5 freq
sodgiers - 1 freq
sudjers - 2 freq
sweetcorn - 3 freq
stok-whorne - 1 freq
sodger-palmers - 1 freq
sodger-palmer - 1 freq
shot-the-craw - 1 freq
sweatshirt - 4 freq
stockaree - 1 freq
shit-scared - 2 freq
site-search - 1 freq
staggering - 2 freq
seed-corn - 1 freq
schotserib - 1 freq
staceyr - 1 freq
scotsradiomedia - 31 freq
shithousery - 2 freq
scotsscriever - 45 freq
scotiagrannie - 1 freq
scattyscribbler - 4 freq
scotsrenaissance - 1 freq
stjerne - 1 freq
satyagrahalba - 2 freq
scotsquirrels - 4 freq
scotsradio - 1 freq
stgriswalds - 1 freq
steisher - 2 freq
scotswirds - 1 freq
sidecarsplz - 1 freq
scottacraigie - 3 freq
scotscores - 3 freq
scottishwriters - 2 freq
MetaPhone code - SWTKRN
sweetcorn - 3 freq
SWEETCORN
Time to execute Levenshtein function - 0.209344 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.362118 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027580 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037340 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000813 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.