A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to soart in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
soart (0) - 47 freq
soat (1) - 1 freq
swart (1) - 2 freq
soapt (1) - 1 freq
soary (1) - 3 freq
soort (1) - 2 freq
start (1) - 555 freq
shoart (1) - 41 freq
soar (1) - 10 freq
spoart (1) - 7 freq
roart (1) - 74 freq
soaft (1) - 2 freq
skart (1) - 2 freq
soarts (1) - 12 freq
soars (1) - 5 freq
sort (1) - 305 freq
foart (1) - 1 freq
scart (1) - 23 freq
smart (1) - 36 freq
soaket (2) - 4 freq
board (2) - 150 freq
sport (2) - 153 freq
slaet (2) - 12 freq
fart (2) - 27 freq
swait (2) - 4 freq
soart (0) - 47 freq
soort (1) - 2 freq
sort (1) - 305 freq
smart (2) - 36 freq
scart (2) - 23 freq
foart (2) - 1 freq
sert (2) - 2 freq
soartae (2) - 14 freq
sairt (2) - 2 freq
sorti (2) - 2 freq
sorta (2) - 4 freq
soars (2) - 5 freq
seairt (2) - 1 freq
sorto (2) - 1 freq
start (2) - 555 freq
shoart (2) - 41 freq
soarts (2) - 12 freq
soapt (2) - 1 freq
soat (2) - 1 freq
swart (2) - 2 freq
soar (2) - 10 freq
soary (2) - 3 freq
skart (2) - 2 freq
roart (2) - 74 freq
spoart (2) - 7 freq
SoundEx code - S630
shroud - 10 freq
sweirt - 37 freq
sort - 305 freq
soared - 6 freq
scared - 44 freq
screed - 31 freq
skreid - 18 freq
scairt - 6 freq
sortae - 18 freq
seurrit - 1 freq
seairt - 1 freq
soart - 47 freq
short - 319 freq
sreat - 1 freq
skirt - 56 freq
shared - 86 freq
skairt - 1 freq
skywart - 1 freq
seawart - 4 freq
sheared - 6 freq
soartae - 14 freq
skeert - 1 freq
shirt - 72 freq
scart - 23 freq
scrowed - 2 freq
swuird - 14 freq
skyrit - 1 freq
swaird - 3 freq
shard - 2 freq
scurrit - 2 freq
sert - 2 freq
screwed - 30 freq
scarred - 15 freq
sword - 107 freq
shirty - 1 freq
squared - 6 freq
scirt - 1 freq
scrat - 31 freq
skyward - 4 freq
sweerty - 1 freq
scoured - 5 freq
soured - 1 freq
soored - 1 freq
swarthy - 1 freq
shrood - 5 freq
scaured - 4 freq
scourit - 2 freq
sirit - 1 freq
seawaird - 1 freq
shooert - 2 freq
squarrt - 1 freq
squirt - 4 freq
scoort - 2 freq
soort - 2 freq
swoard - 2 freq
swurd - 5 freq
shoart - 41 freq
shired - 5 freq
scratty - 4 freq
scoored - 3 freq
screwit - 2 freq
shred - 1 freq
sort' - 1 freq
scaredy - 1 freq
sard - 3 freq
scurried - 5 freq
shrewd - 2 freq
scored - 41 freq
scort - 1 freq
shird - 1 freq
serred - 9 freq
saired - 7 freq
shore-heid - 1 freq
sorta - 4 freq
'sort - 3 freq
'shorty' - 1 freq
swoord - 4 freq
sairt - 2 freq
shoured - 1 freq
skrit - 6 freq
skurt - 4 freq
scrit - 6 freq
sweert - 1 freq
swart - 2 freq
soar'd - 1 freq
schort - 5 freq
scar'd - 1 freq
skreed - 1 freq
seared - 2 freq
skord - 1 freq
shaired - 3 freq
swoert - 1 freq
sawrd - 1 freq
skart - 2 freq
scaird - 1 freq
swerd - 1 freq
skurried - 1 freq
seawird - 1 freq
shoard - 4 freq
skrythe - 1 freq
sair't - 1 freq
screid - 8 freq
scaurt - 2 freq
suretie - 2 freq
souerte - 2 freq
souertie - 1 freq
skared - 1 freq
skaired - 6 freq
'sweirtie - 1 freq
'soared - 1 freq
skyward' - 1 freq
seward - 2 freq
sorti - 2 freq
sired - 2 freq
schorit - 1 freq
sorrit - 4 freq
sweirit - 1 freq
sharet - 3 freq
serried - 1 freq
sortie - 2 freq
serd - 1 freq
sweirtie - 2 freq
skewered - 2 freq
scurred - 2 freq
shooered - 3 freq
€˜short - 1 freq
shurt - 1 freq
skyred - 1 freq
shaerd - 2 freq
“saired” - 1 freq
sored - 1 freq
sorto - 1 freq
shired' - 1 freq
saraita - 1 freq
scored- - 1 freq
scrote - 1 freq
sortd - 2 freq
sweared - 1 freq
MetaPhone code - SRT
sort - 305 freq
soared - 6 freq
sortae - 18 freq
seurrit - 1 freq
seairt - 1 freq
soart - 47 freq
sreat - 1 freq
soartae - 14 freq
sert - 2 freq
scirt - 1 freq
soured - 1 freq
soored - 1 freq
sirit - 1 freq
soort - 2 freq
cert - 1 freq
sort' - 1 freq
sard - 3 freq
serred - 9 freq
saired - 7 freq
sorta - 4 freq
'sort - 3 freq
sairt - 2 freq
soar'd - 1 freq
seared - 2 freq
cerried - 3 freq
certy - 3 freq
sawrd - 1 freq
certie - 4 freq
sair't - 1 freq
suretie - 2 freq
souerte - 2 freq
souertie - 1 freq
'soared - 1 freq
sorti - 2 freq
sired - 2 freq
hsard - 1 freq
sorrit - 4 freq
serried - 1 freq
sortie - 2 freq
serd - 1 freq
cerry-oot - 1 freq
cerd - 6 freq
“saired” - 1 freq
sored - 1 freq
sorto - 1 freq
saraita - 1 freq
SOART
Time to execute Levenshtein function - 0.248767 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.531082 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027727 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036892 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000912 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.