A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to galston in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
galston (0) - 2 freq
galston' (1) - 1 freq
gawston (1) - 25 freq
gakstoun (2) - 1 freq
dalton (2) - 1 freq
gascon (2) - 2 freq
gawstonl (2) - 1 freq
aston (2) - 4 freq
bilston (2) - 14 freq
gallon (2) - 12 freq
garson (2) - 2 freq
gawstoun (2) - 3 freq
calton (2) - 10 freq
alson (2) - 1 freq
easton (2) - 3 freq
maisson (3) - 4 freq
pallion (3) - 1 freq
gallta (3) - 1 freq
wilton (3) - 1 freq
castin (3) - 48 freq
manson (3) - 14 freq
haldon (3) - 1 freq
grafton (3) - 1 freq
hallon (3) - 1 freq
fulton (3) - 5 freq
galston (0) - 2 freq
gawston (2) - 25 freq
galston' (2) - 1 freq
gawstoun (3) - 3 freq
bilston (3) - 14 freq
gakstoun (3) - 1 freq
lastin (4) - 9 freq
giltin (4) - 2 freq
glasto (4) - 1 freq
gemstone (4) - 1 freq
gladstone (4) - 2 freq
ogston (4) - 3 freq
gustin (4) - 1 freq
kliston (4) - 5 freq
hailstone (4) - 1 freq
glastos (4) - 1 freq
gluten (4) - 1 freq
glutton (4) - 5 freq
calton (4) - 10 freq
aston (4) - 4 freq
garson (4) - 2 freq
gallon (4) - 12 freq
alson (4) - 1 freq
gawstonl (4) - 1 freq
dalton (4) - 1 freq
SoundEx code - G423
glaikit - 144 freq
gleg-witted - 1 freq
glisked - 10 freq
glazed - 15 freq
glister - 6 freq
glistenin' - 1 freq
gqlaikit - 1 freq
glaiket - 9 freq
glisteran - 3 freq
glossed - 2 freq
glaikitness - 5 freq
glakit - 4 freq
glaister - 2 freq
galston - 2 freq
galston' - 1 freq
gleekt - 2 freq
glisterin - 29 freq
glistenin - 5 freq
gliskit - 7 freq
glowstick - 1 freq
glistered - 3 freq
glousterin - 1 freq
glist - 1 freq
glogged - 3 freq
glistened - 3 freq
glugged - 2 freq
glazkit - 2 freq
glaikitly - 1 freq
gleckit - 1 freq
glaikid - 1 freq
galactic - 6 freq
glaissie-eed - 1 freq
gleckit-leukin - 1 freq
glisters - 4 freq
glastos - 1 freq
glekkid - 1 freq
gleg-wittit - 1 freq
glaikit-like - 1 freq
glackte - 1 freq
glaikit-' - 1 freq
gallowgate - 6 freq
glesgied - 1 freq
'glesgied' - 1 freq
glosst - 1 freq
glekit - 4 freq
glistens - 2 freq
glaikit's - 1 freq
gollached - 2 freq
glekkit - 2 freq
gluggit - 1 freq
glaiket-leukin - 1 freq
glaickit - 2 freq
glig-eed - 1 freq
glessheids - 8 freq
glessheid - 1 freq
glaistigs - 1 freq
glistening - 1 freq
€œglaikit - 1 freq
glistery - 1 freq
glastonbury - 4 freq
glaikit-lukkin - 1 freq
glasto - 1 freq
gaylestephen - 2 freq
glecket - 1 freq
glisters' - 2 freq
MetaPhone code - KLSTN
clouston - 4 freq
galston - 2 freq
galston' - 1 freq
collieston - 1 freq
kliston - 5 freq
GALSTON
Time to execute Levenshtein function - 0.383214 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.789377 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027440 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.084235 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000781 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.