A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to scartin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
scartin (0) - 18 freq
cartin (1) - 1 freq
scarrin (1) - 1 freq
scarin (1) - 2 freq
scarpin (1) - 1 freq
scartins (1) - 2 freq
sartin (1) - 6 freq
soartin (1) - 5 freq
startin (1) - 89 freq
scartit (1) - 23 freq
seartin (1) - 1 freq
scartlin (1) - 1 freq
scartin' (1) - 1 freq
skartin (1) - 4 freq
sautin (2) - 1 freq
starten (2) - 3 freq
skirtin (2) - 12 freq
saitin (2) - 3 freq
partin (2) - 5 freq
scoutin (2) - 1 freq
carpin (2) - 2 freq
smartie (2) - 1 freq
scrattin (2) - 10 freq
sharein (2) - 1 freq
scaudin (2) - 4 freq
scartin (0) - 18 freq
scartit (2) - 23 freq
startin (2) - 89 freq
scartlin (2) - 1 freq
scartin' (2) - 1 freq
skartin (2) - 4 freq
soartin (2) - 5 freq
seartin (2) - 1 freq
sartin (2) - 6 freq
cartin (2) - 1 freq
scarrin (2) - 1 freq
scarin (2) - 2 freq
scartins (2) - 2 freq
scarpin (2) - 1 freq
sportin (3) - 16 freq
scarted (3) - 1 freq
stairtin (3) - 102 freq
startan (3) - 6 freq
carton (3) - 7 freq
snortin (3) - 13 freq
smarten (3) - 1 freq
cairtin (3) - 7 freq
scart (3) - 23 freq
scootin (3) - 4 freq
stertin (3) - 143 freq
SoundEx code - S635
scartin - 18 freq
skirtin - 12 freq
scrattin - 10 freq
seartin - 1 freq
scriddans - 2 freq
skyrie-tonguit - 1 freq
serten - 4 freq
sorten - 2 freq
scrooteneer's - 4 freq
scrootenized - 1 freq
scrootinizing - 1 freq
scratten - 1 freq
seratonin - 1 freq
scrutiny - 5 freq
sortin - 24 freq
sweertness - 1 freq
soartin - 5 freq
sartin - 6 freq
squirtin - 3 freq
sheridan's - 2 freq
sheridan - 5 freq
sardinian - 3 freq
scrittin - 1 freq
scrutinised - 1 freq
sortan - 1 freq
skartin - 4 freq
sardines - 3 freq
skirteen - 2 freq
shreddan - 1 freq
skirteens - 1 freq
shoartens - 1 freq
sardinia - 2 freq
scartins - 2 freq
shortened - 2 freq
shorthaand - 1 freq
sardane - 1 freq
shortenin - 2 freq
skrattin - 1 freq
sword-dauncin - 1 freq
scartin' - 1 freq
skirtins - 1 freq
shortening - 1 freq
sardine - 1 freq
scrattins - 1 freq
schrödinger - 2 freq
soartins - 2 freq
shoardin - 1 freq
shortness - 2 freq
sorting - 2 freq
shreadin' - 1 freq
scrutinise - 1 freq
shoartenin - 2 freq
sardonic - 1 freq
sardonicism - 1 freq
“scartin - 1 freq
shorthaund - 1 freq
saortony - 1 freq
schrodinger's - 1 freq
MetaPhone code - SKRTN
scartin - 18 freq
skirtin - 12 freq
scrattin - 10 freq
scratten - 1 freq
scrutiny - 5 freq
squirtin - 3 freq
scrittin - 1 freq
skartin - 4 freq
skirteen - 2 freq
skrattin - 1 freq
scartin' - 1 freq
“scartin - 1 freq
SCARTIN
Time to execute Levenshtein function - 0.197843 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.349409 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027584 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036949 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000855 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.