A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to jxixstai in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
jxixstai (0) - 1 freq
jista (3) - 1 freq
existed (4) - 6 freq
jeists (4) - 3 freq
i'stair (4) - 4 freq
jiist (4) - 1 freq
vistas (4) - 3 freq
exists (4) - 45 freq
juist (4) - 1764 freq
justa (4) - 1 freq
distain (4) - 1 freq
existit (4) - 18 freq
instaw (4) - 1 freq
maistli (4) - 4 freq
jaist (4) - 53 freq
kirstal (4) - 2 freq
mistak (4) - 41 freq
jeist (4) - 1 freq
tristan (4) - 15 freq
krista (4) - 4 freq
fiesta (4) - 3 freq
jist (4) - 6754 freq
alistair (4) - 22 freq
iniesta (4) - 1 freq
exist (4) - 76 freq
jxixstai (0) - 1 freq
jista (5) - 1 freq
jaist (6) - 53 freq
exist (6) - 76 freq
jist (6) - 6754 freq
jeist (6) - 1 freq
juist (6) - 1764 freq
justa (6) - 1 freq
jiist (6) - 1 freq
jöst (7) - 30 freq
sexiest (7) - 4 freq
jeffsto (7) - 1 freq
jeest (7) - 38 freq
jyuist (7) - 1 freq
jyst (7) - 7 freq
jjuist (7) - 1 freq
xst (7) - 1 freq
jast (7) - 4 freq
jüst (7) - 34 freq
joost (7) - 230 freq
jatest (7) - 1 freq
xxsr (7) - 1 freq
just (7) - 1618 freq
j-just (7) - 1 freq
sexist (7) - 6 freq
SoundEx code - J230
jist - 6754 freq
just - 1618 freq
jaiket - 105 freq
jeest - 38 freq
juist - 1764 freq
jock''d - 1 freq
jockd - 1 freq
joked - 11 freq
jouked - 23 freq
joost - 230 freq
'just - 8 freq
'joost - 5 freq
jooked - 8 freq
'jist - 53 freq
'juist - 16 freq
joukit - 16 freq
jakit - 8 freq
justa - 1 freq
jacket - 29 freq
joukt - 6 freq
juikt - 2 freq
jyuist - 1 freq
jest - 80 freq
jaicket - 72 freq
jacked - 2 freq
jesuit - 1 freq
jock'd - 1 freq
jaggit - 7 freq
jaikit - 16 freq
jakedaw - 1 freq
jagged - 9 freq
juikit - 1 freq
jaickit - 16 freq
jast - 4 freq
jaickct - 1 freq
jaist - 53 freq
joogied - 1 freq
jeust - 35 freq
juke't - 1 freq
jookit - 3 freq
jista - 1 freq
jecket - 10 freq
jaiked - 2 freq
jokit - 8 freq
jjuist - 1 freq
jeckit - 3 freq
j-just - 1 freq
jiggit - 1 freq
jigged - 4 freq
jaskit - 1 freq
jeskit - 1 freq
jogged - 2 freq
jaisket - 1 freq
jeegit - 1 freq
€˜jist - 3 freq
€œjuist - 17 freq
jackdaw - 2 freq
jeist - 1 freq
juste - 3 freq
€œjist - 12 freq
€˜just - 14 freq
jyst - 7 freq
€”jist - 1 freq
jayket - 12 freq
€œjust - 8 freq
€”just - 1 freq
jaycket - 1 freq
jeukit - 2 freq
jiist - 1 freq
€œjiist - 2 freq
€™jist - 4 freq
jxixstai - 1 freq
jistÂ… - 1 freq
'just't' - 1 freq
jeggit - 4 freq
jockscot - 3 freq
jaiket' - 1 freq
jkhcgd - 1 freq
jsizt - 1 freq
‘just - 1 freq
MetaPhone code - JKSKSST
jxixstai - 1 freq
JXIXSTAI
Time to execute Levenshtein function - 0.424911 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.075987 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031705 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.092137 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001015 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.