A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sograsutherland in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sograsutherland (0) - 1 freq
juliasutherland (4) - 1 freq
craigsutherland (5) - 8 freq
sutherland (5) - 12 freq
northumberland (6) - 3 freq
southerlaun (6) - 1 freq
southeran (6) - 1 freq
wasteland (7) - 4 freq
gruneland (7) - 2 freq
sunderland (7) - 4 freq
gatheran (7) - 2 freq
frontierland (7) - 1 freq
grassland (7) - 3 freq
gaetheran (7) - 3 freq
gauchalland (7) - 1 freq
doggerland (7) - 2 freq
southerly (7) - 4 freq
sootherlaun (7) - 1 freq
southern (7) - 16 freq
sarasheridan (7) - 11 freq
gaitheran (7) - 2 freq
soothland (7) - 2 freq
switzerland (7) - 12 freq
shetland (8) - 288 freq
gaitherins (8) - 13 freq
sograsutherland (0) - 1 freq
juliasutherland (7) - 1 freq
sutherland (8) - 12 freq
craigsutherland (8) - 8 freq
southerlaun (10) - 1 freq
grassland (11) - 3 freq
sarasheridan (11) - 11 freq
soothland (11) - 2 freq
switzerland (11) - 12 freq
frontierland (11) - 1 freq
sootherlaun (11) - 1 freq
southeran (11) - 1 freq
northumberland (11) - 3 freq
wasteland (12) - 4 freq
gatheran (12) - 2 freq
angryscotland (12) - 3 freq
shorthaand (12) - 1 freq
gruneland (12) - 2 freq
sooth-laund (12) - 1 freq
gaitheran (12) - 2 freq
gaetheran (12) - 3 freq
sunderland (12) - 4 freq
gauchalland (12) - 1 freq
doggerland (12) - 2 freq
southern (12) - 16 freq
SoundEx code - S262
saucers - 6 freq
sassers - 6 freq
swaggers - 4 freq
seeker's - 2 freq
sacrist - 1 freq
shakers - 2 freq
seekers - 9 freq
secrecy - 4 freq
scissors - 6 freq
sugars - 2 freq
sojers - 54 freq
seizures - 1 freq
sugar-coatit - 1 freq
sex-crazed - 1 freq
skooshers - 1 freq
secures - 1 freq
skagerrak's - 1 freq
segregatit - 3 freq
saasers - 1 freq
soger's - 1 freq
shoogie-horse-flee - 2 freq
shoogie-horse-stang-na - 1 freq
swackeries - 1 freq
saicrecy - 1 freq
sikorsky - 2 freq
sikkars - 1 freq
segregate - 1 freq
segregation - 4 freq
sacrosanct - 1 freq
sugarcandy - 3 freq
soajers - 1 freq
sookers - 2 freq
secrests - 1 freq
sojours - 1 freq
sograsutherland - 1 freq
“segregation” - 1 freq
sugaracre - 1 freq
shugriggie - 1 freq
shaggers - 2 freq
MetaPhone code - SKRS0RLN
SOGRASUTHERLAND
Time to execute Levenshtein function - 0.278685 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.450698 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027881 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039873 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001203 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.