A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to josie in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
josie (0) - 18 freq
bosie (1) - 60 freq
jobie (1) - 1 freq
cosie (1) - 22 freq
rosie (1) - 73 freq
josiej (1) - 1 freq
jodie (1) - 1 freq
posie (1) - 1 freq
nosie (1) - 1 freq
oosie (1) - 3 freq
tosie (1) - 4 freq
joss (2) - 2 freq
dotie (2) - 1 freq
dose (2) - 45 freq
jove (2) - 5 freq
aesie (2) - 2 freq
housie (2) - 1 freq
joe (2) - 105 freq
€œjosie (2) - 1 freq
joggie (2) - 2 freq
jassie (2) - 3 freq
casie (2) - 3 freq
foie (2) - 4 freq
fowsie (2) - 2 freq
sowie (2) - 1 freq
josie (0) - 18 freq
oosie (2) - 3 freq
juse (2) - 1 freq
tosie (2) - 4 freq
jise (2) - 1 freq
josu (2) - 1 freq
posie (2) - 1 freq
nosie (2) - 1 freq
jodie (2) - 1 freq
bosie (2) - 60 freq
cosie (2) - 22 freq
jobie (2) - 1 freq
rosie (2) - 73 freq
josiej (2) - 1 freq
ojos (3) - 1 freq
oossie (3) - 1 freq
susie (3) - 38 freq
absie (3) - 1 freq
jovi (3) - 1 freq
assie (3) - 1 freq
julie (3) - 29 freq
'sie (3) - 1 freq
hoosie (3) - 71 freq
poosie (3) - 4 freq
moysie (3) - 1 freq
SoundEx code - J200
jaggy - 21 freq
jouk - 62 freq
jig - 26 freq
jock - 511 freq
jis - 36 freq
jaws - 48 freq
joke - 140 freq
jeez - 10 freq
jug - 26 freq
joys - 32 freq
jakey - 5 freq
jouks - 10 freq
joco - 37 freq
'joco - 1 freq
'joco' - 1 freq
joyce - 138 freq
jackie - 38 freq
josie - 18 freq
jock's - 35 freq
jocks - 5 freq
jack - 224 freq
jigsaw - 5 freq
joog - 19 freq
jook - 24 freq
'jeezo - 1 freq
juice - 68 freq
joug - 8 freq
jeszcze - 1 freq
jaggie - 8 freq
juicy - 15 freq
joggie - 2 freq
jogs - 2 freq
jockie - 28 freq
jessie - 258 freq
jougs - 9 freq
joyous - 10 freq
jeuk - 1 freq
jag - 26 freq
jags - 9 freq
jews - 23 freq
jockie' - 1 freq
jees - 3 freq
jeesy - 2 freq
'jock's - 2 freq
joss - 2 freq
jooks - 14 freq
'jees - 1 freq
jake - 71 freq
jess - 21 freq
jigs - 8 freq
jows - 2 freq
jockey - 3 freq
jazz - 11 freq
joug's - 3 freq
jog - 7 freq
joy's - 1 freq
jeegs - 1 freq
jewish - 11 freq
jew's - 2 freq
joak's - 1 freq
jus - 39 freq
jaas - 15 freq
joes - 4 freq
juggs - 2 freq
joackey - 3 freq
jug's - 1 freq
jugs - 4 freq
jucks - 1 freq
josh' - 1 freq
jegs - 1 freq
jeggie - 1 freq
jassie - 3 freq
joycie - 6 freq
jocks' - 3 freq
jeck - 16 freq
jike - 5 freq
jeezo - 15 freq
jesse - 7 freq
josiah - 4 freq
juk - 2 freq
joshua - 13 freq
joukie - 2 freq
jek' - 1 freq
jeeeez - 1 freq
'jackie - 1 freq
jak - 1 freq
jack's - 6 freq
jeeg - 1 freq
jowes - 5 freq
jacksie - 3 freq
joe's - 4 freq
joey's - 3 freq
jeuce - 1 freq
jacks - 10 freq
jaikee - 1 freq
jaikey - 1 freq
jise - 1 freq
juse - 1 freq
jiggy - 9 freq
jocky - 25 freq
joogs - 3 freq
juwish - 1 freq
'jack' - 1 freq
jeg - 2 freq
jess's - 3 freq
josh - 3 freq
'jock - 2 freq
'jig - 1 freq
jaw's - 1 freq
jeyous - 1 freq
jaikie' - 1 freq
jesssie' - 1 freq
joks - 1 freq
jaks - 1 freq
jauss - 7 freq
jhesu - 4 freq
juke - 7 freq
€˜jeezo - 1 freq
jesu - 2 freq
jacquie - 1 freq
josu - 1 freq
'jeezo' - 1 freq
jagg - 1 freq
€œjosie - 1 freq
jeeso - 1 freq
€œjackie - 1 freq
€œjessie - 2 freq
jaiks - 1 freq
joakey - 1 freq
jazza - 2 freq
jjeezo - 1 freq
€œjock - 1 freq
jeyes - 1 freq
jhx - 1 freq
jysu - 1 freq
joeq - 1 freq
jwahjwah - 1 freq
jqexqo - 1 freq
jizz - 1 freq
jix - 1 freq
jhs - 2 freq
jic - 1 freq
'jag' - 1 freq
jeezoo - 1 freq
jjkyg - 1 freq
joos - 6 freq
jock' - 1 freq
jacqui - 1 freq
jac - 3 freq
jik - 1 freq
jwaca - 1 freq
jyhz - 1 freq
joak - 1 freq
jghugu - 1 freq
jojo - 2 freq
joke' - 1 freq
jas - 1 freq
jjgass - 1 freq
jok - 1 freq
jhg - 1 freq
jek - 1 freq
jhsh - 1 freq
jozx - 1 freq
jaigw - 1 freq
jyz - 1 freq
jacky - 1 freq
jxjes - 1 freq
MetaPhone code - JS
gies - 501 freq
jis - 36 freq
jaws - 48 freq
jeez - 10 freq
gie's - 76 freq
joys - 32 freq
geese - 42 freq
joyce - 138 freq
josie - 18 freq
'jeezo - 1 freq
juice - 68 freq
juicy - 15 freq
gees - 41 freq
'gees - 1 freq
'gie's - 8 freq
jessie - 258 freq
gizz - 19 freq
giez - 1 freq
jews - 23 freq
jees - 3 freq
jeesy - 2 freq
joss - 2 freq
'gies - 2 freq
'jees - 1 freq
gis - 1 freq
jess - 21 freq
jows - 2 freq
jazz - 11 freq
joy's - 1 freq
geis - 8 freq
jew's - 2 freq
jus - 39 freq
jaas - 15 freq
joes - 4 freq
geez - 20 freq
giza - 1 freq
jassie - 3 freq
joycie - 6 freq
giess - 4 freq
js - 5 freq
giz - 1 freq
jeezo - 15 freq
jesse - 7 freq
jeeeez - 1 freq
joe's - 4 freq
joey's - 3 freq
jeuce - 1 freq
jise - 1 freq
juse - 1 freq
gess - 1 freq
giy's - 1 freq
jaw's - 1 freq
hgis - 1 freq
jesssie' - 1 freq
geise - 1 freq
ges - 1 freq
jauss - 7 freq
geyse - 1 freq
€œgies - 2 freq
€˜jeezo - 1 freq
jesu - 2 freq
€˜gies - 3 freq
josu - 1 freq
'jeezo' - 1 freq
€œjosie - 1 freq
jeeso - 1 freq
€œjessie - 2 freq
jazza - 2 freq
jjeezo - 1 freq
€™gies - 1 freq
geos - 2 freq
jysu - 1 freq
wjz - 2 freq
jjs - 52 freq
jizz - 1 freq
jhs - 2 freq
yjsu - 1 freq
jeezoo - 1 freq
joos - 6 freq
j's - 2 freq
“geez - 1 freq
jyhz - 1 freq
gieÂ’s - 1 freq
gyz - 2 freq
jas - 1 freq
giese - 2 freq
jz - 1 freq
jyz - 1 freq
yjeizaoa - 1 freq
JOSIE
Time to execute Levenshtein function - 0.190371 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.341787 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028326 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037375 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000903 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.