A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to scrat in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
scrat (0) - 31 freq
scrit (1) - 6 freq
scrae (1) - 1 freq
scrap (1) - 33 freq
scram (1) - 1 freq
scran (1) - 83 freq
sara (2) - 15 freq
scrapy (2) - 1 freq
scrim (2) - 1 freq
crakt (2) - 1 freq
shat (2) - 25 freq
scaar (2) - 20 freq
scarab (2) - 1 freq
straet (2) - 1 freq
straw (2) - 27 freq
corat (2) - 1 freq
s'at (2) - 1 freq
crab (2) - 32 freq
coat (2) - 159 freq
crak (2) - 4 freq
strath (2) - 12 freq
sort (2) - 305 freq
scab (2) - 12 freq
craa (2) - 40 freq
scrape (2) - 29 freq
scrat (0) - 31 freq
scrit (1) - 6 freq
sacrit (2) - 3 freq
scirt (2) - 1 freq
scort (2) - 1 freq
secret (2) - 196 freq
scart (2) - 23 freq
scrote (2) - 1 freq
sacret (2) - 1 freq
scrae (2) - 1 freq
scrap (2) - 33 freq
scram (2) - 1 freq
scran (2) - 83 freq
scott (3) - 290 freq
scata (3) - 2 freq
scrabo (3) - 3 freq
script (3) - 30 freq
scrum (3) - 2 freq
scry (3) - 4 freq
scoit (3) - 1 freq
croat (3) - 1 freq
scrift (3) - 2 freq
scant (3) - 34 freq
crate (3) - 6 freq
scrow (3) - 2 freq
SoundEx code - S630
shroud - 10 freq
sweirt - 37 freq
sort - 305 freq
soared - 6 freq
scared - 44 freq
screed - 31 freq
skreid - 18 freq
scairt - 6 freq
sortae - 18 freq
seurrit - 1 freq
seairt - 1 freq
soart - 47 freq
short - 319 freq
sreat - 1 freq
skirt - 56 freq
shared - 86 freq
skairt - 1 freq
skywart - 1 freq
seawart - 4 freq
sheared - 6 freq
soartae - 14 freq
skeert - 1 freq
shirt - 72 freq
scart - 23 freq
scrowed - 2 freq
swuird - 14 freq
skyrit - 1 freq
swaird - 3 freq
shard - 2 freq
scurrit - 2 freq
sert - 2 freq
screwed - 30 freq
scarred - 15 freq
sword - 107 freq
shirty - 1 freq
squared - 6 freq
scirt - 1 freq
scrat - 31 freq
skyward - 4 freq
sweerty - 1 freq
scoured - 5 freq
soured - 1 freq
soored - 1 freq
swarthy - 1 freq
shrood - 5 freq
scaured - 4 freq
scourit - 2 freq
sirit - 1 freq
seawaird - 1 freq
shooert - 2 freq
squarrt - 1 freq
squirt - 4 freq
scoort - 2 freq
soort - 2 freq
swoard - 2 freq
swurd - 5 freq
shoart - 41 freq
shired - 5 freq
scratty - 4 freq
scoored - 3 freq
screwit - 2 freq
shred - 1 freq
sort' - 1 freq
scaredy - 1 freq
sard - 3 freq
scurried - 5 freq
shrewd - 2 freq
scored - 41 freq
scort - 1 freq
shird - 1 freq
serred - 9 freq
saired - 7 freq
shore-heid - 1 freq
sorta - 4 freq
'sort - 3 freq
'shorty' - 1 freq
swoord - 4 freq
sairt - 2 freq
shoured - 1 freq
skrit - 6 freq
skurt - 4 freq
scrit - 6 freq
sweert - 1 freq
swart - 2 freq
soar'd - 1 freq
schort - 5 freq
scar'd - 1 freq
skreed - 1 freq
seared - 2 freq
skord - 1 freq
shaired - 3 freq
swoert - 1 freq
sawrd - 1 freq
skart - 2 freq
scaird - 1 freq
swerd - 1 freq
skurried - 1 freq
seawird - 1 freq
shoard - 4 freq
skrythe - 1 freq
sair't - 1 freq
screid - 8 freq
scaurt - 2 freq
suretie - 2 freq
souerte - 2 freq
souertie - 1 freq
skared - 1 freq
skaired - 6 freq
'sweirtie - 1 freq
'soared - 1 freq
skyward' - 1 freq
seward - 2 freq
sorti - 2 freq
sired - 2 freq
schorit - 1 freq
sorrit - 4 freq
sweirit - 1 freq
sharet - 3 freq
serried - 1 freq
sortie - 2 freq
serd - 1 freq
sweirtie - 2 freq
skewered - 2 freq
scurred - 2 freq
shooered - 3 freq
€˜short - 1 freq
shurt - 1 freq
skyred - 1 freq
shaerd - 2 freq
“saired” - 1 freq
sored - 1 freq
sorto - 1 freq
shired' - 1 freq
saraita - 1 freq
scored- - 1 freq
scrote - 1 freq
sortd - 2 freq
sweared - 1 freq
MetaPhone code - SKRT
secret - 196 freq
scared - 44 freq
screed - 31 freq
skreid - 18 freq
scairt - 6 freq
skirt - 56 freq
skairt - 1 freq
saicret - 37 freq
skeert - 1 freq
cigarette - 21 freq
security - 54 freq
scart - 23 freq
sigurd - 40 freq
skyrit - 1 freq
sacred - 24 freq
scurrit - 2 freq
scarred - 15 freq
sugart - 1 freq
squared - 6 freq
scrat - 31 freq
scoured - 5 freq
saucrit - 6 freq
scaured - 4 freq
scourit - 2 freq
squarrt - 1 freq
sukkert - 1 freq
squirt - 4 freq
scoort - 2 freq
so-cried - 1 freq
scratty - 4 freq
scoored - 3 freq
security' - 1 freq
scaredy - 1 freq
scurried - 5 freq
scored - 41 freq
scort - 1 freq
socried - 1 freq
sacret - 1 freq
secured - 4 freq
saicred - 2 freq
skrit - 6 freq
skurt - 4 freq
scrit - 6 freq
sae-cried - 3 freq
scar'd - 1 freq
skreed - 1 freq
skord - 1 freq
skart - 2 freq
scaird - 1 freq
securit - 2 freq
skurried - 1 freq
screid - 8 freq
seecrit - 1 freq
scaurt - 2 freq
sacrit - 3 freq
securitie - 3 freq
skared - 1 freq
skaired - 6 freq
saecret - 1 freq
ceegarette - 1 freq
sugared - 1 freq
seicret - 3 freq
sikkart - 1 freq
scurred - 2 freq
skyred - 1 freq
scored- - 1 freq
scrote - 1 freq
zqqrd - 1 freq
SCRAT
Time to execute Levenshtein function - 0.163140 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.314606 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030217 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037927 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000804 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.