A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to scartin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
scartin (0) - 18 freq
skartin (1) - 4 freq
scartin' (1) - 1 freq
scartlin (1) - 1 freq
scartins (1) - 2 freq
scarin (1) - 2 freq
scarrin (1) - 1 freq
seartin (1) - 1 freq
startin (1) - 97 freq
sartin (1) - 6 freq
scarpin (1) - 1 freq
cartin (1) - 1 freq
soartin (1) - 5 freq
scartit (1) - 23 freq
scalin (2) - 2 freq
carpin (2) - 2 freq
snarin (2) - 2 freq
seatin (2) - 4 freq
snortin (2) - 13 freq
smarten (2) - 1 freq
scoutin (2) - 1 freq
starten (2) - 3 freq
satin (2) - 12 freq
plartin (2) - 1 freq
scootin (2) - 4 freq
scartin (0) - 18 freq
sartin (2) - 6 freq
startin (2) - 97 freq
scarpin (2) - 1 freq
soartin (2) - 5 freq
scartit (2) - 23 freq
seartin (2) - 1 freq
cartin (2) - 1 freq
skartin (2) - 4 freq
scarrin (2) - 1 freq
scartlin (2) - 1 freq
scartin' (2) - 1 freq
scartins (2) - 2 freq
scarin (2) - 2 freq
sortin (3) - 24 freq
scaurin (3) - 1 freq
cairtin (3) - 7 freq
scorin (3) - 6 freq
scrattin (3) - 10 freq
startan (3) - 6 freq
cartain (3) - 1 freq
spurtin (3) - 2 freq
scarn (3) - 2 freq
certin (3) - 1 freq
snoartin (3) - 1 freq
SoundEx code - S635
scartin - 18 freq
skirtin - 12 freq
scrattin - 10 freq
seartin - 1 freq
scriddans - 2 freq
skyrie-tonguit - 1 freq
serten - 4 freq
sorten - 2 freq
scrooteneer's - 4 freq
scrootenized - 1 freq
scrootinizing - 1 freq
scratten - 1 freq
seratonin - 1 freq
squirtin - 4 freq
sardinia - 3 freq
scrutiny - 5 freq
sortin - 24 freq
sweertness - 1 freq
soartin - 5 freq
sartin - 6 freq
sheridan's - 2 freq
sheridan - 5 freq
sardinian - 3 freq
scrittin - 1 freq
scrutinised - 1 freq
sortan - 1 freq
skartin - 4 freq
sardines - 3 freq
skirteen - 2 freq
shreddan - 1 freq
skirteens - 1 freq
shoartens - 1 freq
scartins - 2 freq
shortened - 2 freq
shorthaand - 1 freq
sardane - 1 freq
shortenin - 2 freq
skrattin - 1 freq
sword-dauncin - 1 freq
scartin' - 1 freq
skirtins - 1 freq
shortening - 1 freq
sardine - 1 freq
scrattins - 1 freq
schrödinger - 2 freq
soartins - 2 freq
shoardin - 1 freq
shortness - 2 freq
sorting - 2 freq
shreadin' - 1 freq
scrutinise - 1 freq
shoartenin - 2 freq
sardonic - 1 freq
sardonicism - 1 freq
“scartin - 1 freq
shorthaund - 1 freq
saortony - 1 freq
schrodinger's - 1 freq
MetaPhone code - SKRTN
scartin - 18 freq
skirtin - 12 freq
scrattin - 10 freq
scratten - 1 freq
squirtin - 4 freq
scrutiny - 5 freq
scrittin - 1 freq
skartin - 4 freq
skirteen - 2 freq
skrattin - 1 freq
scartin' - 1 freq
“scartin - 1 freq
SCARTIN
Time to execute Levenshtein function - 0.401219 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.822694 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.086428 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.100873 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.005170 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.