A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gaylestephen in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gaylestephen (0) - 2 freq
gravesteen (5) - 1 freq
yestereen (5) - 1 freq
hailsteen (5) - 1 freq
stephen (5) - 37 freq
gillespie (6) - 5 freq
gatekeeper (6) - 1 freq
pesteren (6) - 1 freq
ayegreen (6) - 3 freq
greeteen (6) - 1 freq
mullsteen (6) - 1 freq
gaurenteed (6) - 1 freq
palestyne (6) - 1 freq
glistened (6) - 3 freq
dial-steen (6) - 1 freq
galston (6) - 2 freq
gleswegian (6) - 13 freq
guylejeune (6) - 3 freq
naythen (6) - 2 freq
glisteran (6) - 3 freq
gravesteens (6) - 4 freq
milestene (6) - 1 freq
galevitchin (6) - 1 freq
apostophe (6) - 1 freq
gamekeeper (6) - 3 freq
gaylestephen (0) - 2 freq
stephen (7) - 37 freq
glisterin (8) - 29 freq
glisteran (8) - 3 freq
galston (8) - 2 freq
glistenin (8) - 5 freq
hailsteen (8) - 1 freq
galevitchin (8) - 1 freq
milestene (9) - 1 freq
palestine (9) - 4 freq
ayelestan (9) - 4 freq
ulstermen (9) - 5 freq
glousterin (9) - 1 freq
gestern (9) - 1 freq
ayelestin (9) - 1 freq
apostophe (9) - 1 freq
gleswegian (9) - 13 freq
gleeterin (9) - 1 freq
stephane (9) - 1 freq
yestereen (9) - 1 freq
gravesteen (9) - 1 freq
valeleithen (9) - 1 freq
glistered (9) - 3 freq
glancethen (9) - 1 freq
glistened (9) - 3 freq
SoundEx code - G423
glaikit - 144 freq
gleg-witted - 1 freq
glisked - 10 freq
glazed - 15 freq
glister - 6 freq
glistenin' - 1 freq
gqlaikit - 1 freq
glaiket - 9 freq
glisteran - 3 freq
glossed - 2 freq
glaikitness - 5 freq
glakit - 4 freq
glaister - 2 freq
galston - 2 freq
galston' - 1 freq
gleekt - 2 freq
glisterin - 29 freq
glistenin - 5 freq
gliskit - 7 freq
glowstick - 1 freq
glistered - 3 freq
glousterin - 1 freq
glist - 1 freq
glogged - 3 freq
glistened - 3 freq
glugged - 2 freq
glazkit - 2 freq
glaikitly - 1 freq
gleckit - 1 freq
glaikid - 1 freq
galactic - 6 freq
glaissie-eed - 1 freq
gleckit-leukin - 1 freq
glisters - 4 freq
glastos - 1 freq
glekkid - 1 freq
gleg-wittit - 1 freq
glaikit-like - 1 freq
glackte - 1 freq
glaikit-' - 1 freq
gallowgate - 6 freq
glesgied - 1 freq
'glesgied' - 1 freq
glosst - 1 freq
glekit - 4 freq
glistens - 2 freq
glaikit's - 1 freq
gollached - 2 freq
glekkit - 2 freq
gluggit - 1 freq
glaiket-leukin - 1 freq
glaickit - 2 freq
glig-eed - 1 freq
glessheids - 8 freq
glessheid - 1 freq
glaistigs - 1 freq
glistening - 1 freq
€œglaikit - 1 freq
glistery - 1 freq
glastonbury - 4 freq
glaikit-lukkin - 1 freq
glasto - 1 freq
gaylestephen - 2 freq
glecket - 1 freq
glisters' - 2 freq
MetaPhone code - KLSTFN
gaylestephen - 2 freq
GAYLESTEPHEN
Time to execute Levenshtein function - 0.210414 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.363674 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027594 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037787 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000872 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.