A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to j-just in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
j-just (0) - 1 freq
jjuist (2) - 1 freq
ajust (2) - 2 freq
‘just (2) - 1 freq
adjust (2) - 10 freq
unjust (2) - 4 freq
just (2) - 1618 freq
jeust (2) - 35 freq
injust (2) - 1 freq
'just (2) - 8 freq
ju-jus (2) - 1 freq
€œjust (2) - 8 freq
€”just (2) - 1 freq
€˜just (2) - 14 freq
jaunt (3) - 15 freq
'jist (3) - 53 freq
crust (3) - 18 freq
jest (3) - 80 freq
'trust (3) - 1 freq
jjs (3) - 52 freq
€˜jist (3) - 3 freq
juit (3) - 1 freq
jeest (3) - 38 freq
unjest (3) - 1 freq
oonjust (3) - 1 freq
j-just (0) - 1 freq
jjuist (3) - 1 freq
ju-jus (3) - 1 freq
'just (4) - 8 freq
€˜just (4) - 14 freq
injust (4) - 1 freq
€”just (4) - 1 freq
€œjust (4) - 8 freq
jeust (4) - 35 freq
ajust (4) - 2 freq
‘just (4) - 1 freq
adjust (4) - 10 freq
unjust (4) - 4 freq
just (4) - 1618 freq
jaist (5) - 53 freq
justa (5) - 1 freq
jiist (5) - 1 freq
€œjist (5) - 12 freq
€”jist (5) - 1 freq
€™jist (5) - 4 freq
'jst (5) - 1 freq
jaloust (5) - 3 freq
jesuit (5) - 1 freq
juste (5) - 3 freq
€œjuist (5) - 17 freq
SoundEx code - J230
jist - 6754 freq
just - 1618 freq
jaiket - 105 freq
jeest - 38 freq
juist - 1764 freq
jock''d - 1 freq
jockd - 1 freq
joked - 11 freq
jouked - 23 freq
joost - 230 freq
'just - 8 freq
'joost - 5 freq
jooked - 8 freq
'jist - 53 freq
'juist - 16 freq
joukit - 16 freq
jakit - 8 freq
justa - 1 freq
jacket - 29 freq
joukt - 6 freq
juikt - 2 freq
jyuist - 1 freq
jest - 80 freq
jaicket - 72 freq
jacked - 2 freq
jesuit - 1 freq
jock'd - 1 freq
jaggit - 7 freq
jaikit - 16 freq
jakedaw - 1 freq
jagged - 9 freq
juikit - 1 freq
jaickit - 16 freq
jast - 4 freq
jaickct - 1 freq
jaist - 53 freq
joogied - 1 freq
jeust - 35 freq
juke't - 1 freq
jookit - 3 freq
jista - 1 freq
jecket - 10 freq
jaiked - 2 freq
jokit - 8 freq
jjuist - 1 freq
jeckit - 3 freq
j-just - 1 freq
jiggit - 1 freq
jigged - 4 freq
jaskit - 1 freq
jeskit - 1 freq
jogged - 2 freq
jaisket - 1 freq
jeegit - 1 freq
€˜jist - 3 freq
€œjuist - 17 freq
jackdaw - 2 freq
jeist - 1 freq
juste - 3 freq
€œjist - 12 freq
€˜just - 14 freq
jyst - 7 freq
€”jist - 1 freq
jayket - 12 freq
€œjust - 8 freq
€”just - 1 freq
jaycket - 1 freq
jeukit - 2 freq
jiist - 1 freq
€œjiist - 2 freq
€™jist - 4 freq
jxixstai - 1 freq
jistÂ… - 1 freq
'just't' - 1 freq
jeggit - 4 freq
jockscot - 3 freq
jaiket' - 1 freq
jkhcgd - 1 freq
jsizt - 1 freq
‘just - 1 freq
MetaPhone code - JJST
j-just - 1 freq
J-JUST
Time to execute Levenshtein function - 0.321206 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.592046 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028384 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.072998 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001314 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.