A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to versity in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
versity (0) - 11 freq
versify (1) - 1 freq
varsity (1) - 9 freq
verity (1) - 2 freq
veesits (2) - 15 freq
density (2) - 4 freq
diversity (2) - 73 freq
version (2) - 143 freq
versie (2) - 1 freq
'varsity (2) - 1 freq
veisits (2) - 8 freq
verify (2) - 2 freq
adversity (2) - 7 freq
veesit (2) - 93 freq
verily (2) - 2 freq
veisit (2) - 25 freq
veracity (2) - 2 freq
versy (2) - 1 freq
reosity (2) - 1 freq
versions (3) - 36 freq
verdict (3) - 30 freq
heisit (3) - 1 freq
obesity (3) - 8 freq
vert (3) - 1 freq
fessit (3) - 2 freq
versity (0) - 11 freq
varsity (1) - 9 freq
versify (2) - 1 freq
verity (2) - 2 freq
veisit (3) - 25 freq
veracity (3) - 2 freq
adversity (3) - 7 freq
reosity (3) - 1 freq
versy (3) - 1 freq
veesit (3) - 93 freq
version (3) - 143 freq
diversity (3) - 73 freq
versie (3) - 1 freq
varsitie (3) - 3 freq
'varsity (3) - 1 freq
varrit (4) - 1 freq
verse' (4) - 1 freq
veritie (4) - 3 freq
vertie (4) - 1 freq
vesta (4) - 2 freq
tursit (4) - 1 freq
overbite (4) - 1 freq
worsit (4) - 11 freq
persuit (4) - 1 freq
kirsty (4) - 159 freq
SoundEx code - V623
varsity - 9 freq
variegated - 1 freq
virused - 1 freq
vrocht - 35 freq
vrockt - 1 freq
versities - 4 freq
versity - 11 freq
varsity's - 2 freq
versatile - 4 freq
versed - 4 freq
vricht - 6 freq
verstummen - 1 freq
varsitie - 3 freq
vrichts - 1 freq
'varsity - 1 freq
varstled - 1 freq
versatility - 2 freq
vrochtin - 2 freq
voyeuristic - 1 freq
veracity - 2 freq
varged - 1 freq
vrsdin - 1 freq
MetaPhone code - FRST
first - 2456 freq
forced - 72 freq
frost - 139 freq
fireside - 21 freq
fraised - 3 freq
foarest - 1 freq
furst - 357 freq
ferst - 29 freq
'first - 4 freq
forest - 117 freq
fraized - 4 freq
varsity - 9 freq
frst - 3 freq
fuirsday - 5 freq
forstaw - 3 freq
virused - 1 freq
frost' - 1 freq
forcit - 2 freq
frostie - 1 freq
feirst - 8 freq
frosty - 43 freq
fairest - 7 freq
froast - 1 freq
'furst - 1 freq
firrst - 49 freq
frast - 1 freq
forestaa - 3 freq
wfirst - 2 freq
versity - 11 freq
'first' - 1 freq
forssit - 1 freq
freistie - 1 freq
freist - 3 freq
forrsit - 1 freq
frizzied - 1 freq
freest - 1 freq
versed - 4 freq
faarest - 1 freq
forcet - 1 freq
feerst - 1 freq
forest' - 1 freq
varsitie - 3 freq
€˜first - 3 freq
forrest - 5 freq
forst - 1 freq
fiurst - 1 freq
'varsity - 1 freq
€œfirst - 5 freq
freezed - 1 freq
ferrest - 1 freq
ferest - 1 freq
phrased - 1 freq
froasty - 1 freq
fierce-eed - 1 freq
veracity - 2 freq
fursday - 1 freq
VERSITY
Time to execute Levenshtein function - 0.214798 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.354623 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.034111 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.042621 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001149 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.