A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to saegates in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
saegates (0) - 1 freq
negates (2) - 1 freq
awegates (2) - 1 freq
sumgates (2) - 1 freq
agates (2) - 1 freq
socrates (3) - 3 freq
gates (3) - 102 freq
segas (3) - 1 freq
somegate (3) - 1 freq
saaces (3) - 1 freq
staetes (3) - 1 freq
speats (3) - 1 freq
seated (3) - 11 freq
saets (3) - 29 freq
sweats (3) - 1 freq
senate (3) - 8 freq
saats (3) - 1 freq
midgates (3) - 1 freq
sausages (3) - 25 freq
radiates (3) - 3 freq
sedate (3) - 1 freq
sages (3) - 1 freq
seater (3) - 1 freq
seat's (3) - 3 freq
delegates (3) - 3 freq
saegates (0) - 1 freq
sumgates (3) - 1 freq
agates (3) - 1 freq
awegates (3) - 1 freq
negates (3) - 1 freq
seggies (4) - 1 freq
gaetes (4) - 1 freq
states (4) - 79 freq
onygates (4) - 4 freq
senatus (4) - 2 freq
naegaits (4) - 5 freq
seats (4) - 76 freq
settes (4) - 1 freq
situates (4) - 1 freq
slates (4) - 53 freq
sgots (4) - 1 freq
spates (4) - 2 freq
skates (4) - 6 freq
sagas (4) - 3 freq
sages (4) - 1 freq
oniegates (4) - 15 freq
sates (4) - 9 freq
saats (4) - 1 freq
speats (4) - 1 freq
staetes (4) - 1 freq
SoundEx code - S232
sockets - 8 freq
sichts - 50 freq
saxties - 4 freq
societies - 14 freq
sassidges - 1 freq
sixties - 13 freq
'sax-days - 1 freq
sixty-six - 5 freq
sixtie-sein - 1 freq
sights - 6 freq
sixty-eight - 1 freq
saxty-somethin - 2 freq
sightsman's - 1 freq
sassidge - 6 freq
succeeds - 1 freq
saxty-sax - 1 freq
shghtest - 1 freq
saegates - 1 freq
sects - 2 freq
sichtseers - 1 freq
sightseers - 1 freq
sightsee - 1 freq
siestas - 1 freq
skectches - 1 freq
saasidges - 1 freq
sossidge - 1 freq
MetaPhone code - SKTS
scots - 6761 freq
skates - 6 freq
skites - 15 freq
sockets - 8 freq
scuddies - 1 freq
scouts - 3 freq
skuds - 1 freq
'scots' - 3 freq
scottis - 52 freq
scot's - 2 freq
scott's - 15 freq
scots' - 16 freq
sgots - 1 freq
squaddies - 1 freq
'scots - 15 freq
skate's - 1 freq
skots - 1 freq
scuds - 2 freq
scotts - 11 freq
squid's - 9 freq
skuddies - 1 freq
skoits - 1 freq
scotties - 1 freq
scotus - 1 freq
skytes - 1 freq
scoots - 2 freq
skuts - 1 freq
scots - 11 freq
skauds - 1 freq
cicadas - 1 freq
scots - 16 freq
saegates - 1 freq
sects - 2 freq
scottis - 1 freq
scoats - 1 freq
skids - 3 freq
scotts - 1 freq
scots - 1 freq
zkots - 4 freq
squads - 1 freq
scots - 1 freq
scottis - 1 freq
scatts - 1 freq
scots- - 1 freq
‘scots’ - 1 freq
skooties - 1 freq
skootys - 1 freq
SAEGATES
Time to execute Levenshtein function - 0.186758 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.371347 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027525 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038561 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000889 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.