A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sharks in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sharks (0) - 9 freq
sarks (1) - 17 freq
starks (1) - 1 freq
shanks (1) - 74 freq
shaks (1) - 25 freq
shards (1) - 11 freq
shargs (1) - 1 freq
sparks (1) - 19 freq
shacks (1) - 1 freq
shark (1) - 21 freq
harks (1) - 1 freq
sherks (1) - 2 freq
shares (1) - 27 freq
hanks (2) - 2 freq
sharny (2) - 7 freq
shak (2) - 99 freq
sharp (2) - 85 freq
stares (2) - 41 freq
shaky (2) - 3 freq
skarts (2) - 3 freq
staaks (2) - 1 freq
scarfs (2) - 3 freq
snorks (2) - 1 freq
scarns (2) - 1 freq
thanks (2) - 451 freq
sharks (0) - 9 freq
sherks (1) - 2 freq
shark (2) - 21 freq
shacks (2) - 1 freq
shares (2) - 27 freq
sharkies (2) - 1 freq
sparks (2) - 19 freq
harks (2) - 1 freq
starks (2) - 1 freq
sarks (2) - 17 freq
shargs (2) - 1 freq
shanks (2) - 74 freq
shards (2) - 11 freq
shaks (2) - 25 freq
shirk (3) - 2 freq
shores (3) - 21 freq
shrieks (3) - 8 freq
shoarts (3) - 3 freq
shucks (3) - 2 freq
serks (3) - 1 freq
shair's (3) - 1 freq
shreeks (3) - 1 freq
stirks (3) - 14 freq
shires (3) - 1 freq
sperks (3) - 5 freq
SoundEx code - S620
serious - 152 freq
skreich - 31 freq
sark - 122 freq
shirras - 1 freq
sarks - 17 freq
source - 56 freq
sairious - 34 freq
shriek - 21 freq
shrieks - 8 freq
series - 86 freq
search - 116 freq
shark - 21 freq
sharks - 9 freq
sorrows - 10 freq
shooers - 17 freq
scaurs - 5 freq
screich - 18 freq
skerries - 10 freq
shores - 21 freq
sierra's - 1 freq
scarce - 48 freq
shrug - 17 freq
squares - 16 freq
swears - 5 freq
serrs - 5 freq
scars - 11 freq
skraichy - 1 freq
skreek - 4 freq
soorce - 17 freq
screech - 5 freq
screws - 22 freq
skraich - 13 freq
shears - 11 freq
scraich - 14 freq
scurries - 4 freq
skraik - 4 freq
sure's - 3 freq
scrogs - 4 freq
serk - 3 freq
scrooge - 8 freq
sers - 11 freq
sourocks - 3 freq
skrogs - 1 freq
'search - 1 freq
scours - 4 freq
screichie - 3 freq
shours - 1 freq
surge - 15 freq
skreigh - 2 freq
skiers - 2 freq
shirk - 2 freq
sweers - 1 freq
scores - 15 freq
surks - 1 freq
seers - 3 freq
skraiks - 1 freq
sweirs - 2 freq
screechie - 1 freq
scairse - 3 freq
skurry's - 1 freq
scairs - 1 freq
scrious - 1 freq
scoor's - 1 freq
shrugs - 12 freq
sorries - 2 freq
sourz - 1 freq
screes - 1 freq
scurry's - 1 freq
screch - 4 freq
scroag - 1 freq
scroaggy - 1 freq
shares - 27 freq
scares - 5 freq
serge - 4 freq
sarky - 10 freq
squeers - 1 freq
scroosh - 1 freq
sheers - 2 freq
scourge - 6 freq
skrek - 4 freq
sorras - 6 freq
sirs - 6 freq
sewers - 5 freq
shrews - 2 freq
scaur's - 1 freq
scoors - 2 freq
skriech - 3 freq
scraggy - 4 freq
serug - 3 freq
sairs - 8 freq
sores - 3 freq
scurj - 1 freq
serius - 1 freq
shreeks - 1 freq
soars - 5 freq
sarx - 1 freq
skraikie - 1 freq
sherrick - 2 freq
shrek - 3 freq
sairch - 7 freq
serious-wye - 4 freq
sharg - 2 freq
scriech - 3 freq
seriouswye - 1 freq
sayers - 17 freq
sorraes - 3 freq
skroos - 3 freq
'scrooge' - 1 freq
sirrah's - 1 freq
scrows - 1 freq
skreks - 2 freq
squarego - 1 freq
sherries - 2 freq
swaars - 1 freq
sarah's - 2 freq
soorik - 2 freq
soorik's - 2 freq
skrog - 1 freq
scaers - 1 freq
scroos - 1 freq
sarious - 1 freq
serks - 1 freq
skaurs - 1 freq
showers - 7 freq
skaars - 1 freq
scairce - 1 freq
seeryis - 2 freq
seereez - 1 freq
sark's - 1 freq
scurrie's - 1 freq
seraich - 1 freq
skraichie - 3 freq
sair's - 2 freq
skerrs - 2 freq
swarees - 1 freq
suir's - 1 freq
scroggs - 1 freq
soirées - 1 freq
shires - 1 freq
scurge - 1 freq
screigh - 2 freq
shair's - 1 freq
sours - 1 freq
sooriks - 2 freq
scraigh - 1 freq
sergey - 1 freq
scries - 1 freq
shears--she - 1 freq
saerious - 1 freq
shoors - 34 freq
soor's - 1 freq
sars - 1 freq
skurries - 2 freq
skairs - 2 freq
serss - 1 freq
squars - 1 freq
skouriss - 1 freq
sherks - 2 freq
sherk - 1 freq
scorchio - 1 freq
sergio - 15 freq
€˜sergio - 1 freq
squarish - 1 freq
scraik - 1 freq
sewerage - 2 freq
scroggie - 1 freq
sayrious - 1 freq
shoures - 1 freq
seirch - 1 freq
shoo'ers - 1 freq
scroggy - 3 freq
scorch - 1 freq
syriza - 12 freq
€œserious - 1 freq
suruchi - 1 freq
shargs - 1 freq
sirch - 1 freq
€˜search - 1 freq
seurs - 1 freq
“sark” - 1 freq
soorage - 1 freq
sarahg - 1 freq
scrake - 1 freq
sruc - 1 freq
shories - 1 freq
shooors - 1 freq
ssrg - 1 freq
srks - 1 freq
sarchy - 1 freq
shouers - 1 freq
sara's - 1 freq
saorse - 2 freq
sharkey - 1 freq
sgrgx - 1 freq
src - 1 freq
sayersy - 1 freq
sweeries - 1 freq
surhoose - 1 freq
MetaPhone code - XRKS
shrieks - 8 freq
sharks - 9 freq
shrugs - 12 freq
shreeks - 1 freq
sharkies - 1 freq
sherks - 2 freq
shargs - 1 freq
SHARKS
Time to execute Levenshtein function - 0.181814 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.397629 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027728 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.042466 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001073 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.