A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to carts in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
carts (0) - 2 freq
parts (1) - 24 freq
charts (1) - 8 freq
farts (1) - 5 freq
cants (1) - 1 freq
caats (1) - 4 freq
marts (1) - 4 freq
car's (1) - 16 freq
darts (1) - 32 freq
cats (1) - 124 freq
cards (1) - 33 freq
sarts (1) - 1 freq
carte (1) - 4 freq
scarts (1) - 9 freq
warts (1) - 5 freq
arts (1) - 34 freq
clarts (1) - 2 freq
casts (1) - 19 freq
cairts (1) - 34 freq
cart (1) - 7 freq
cares (1) - 61 freq
tarts (1) - 40 freq
cars (1) - 100 freq
chits (2) - 1 freq
gar's (2) - 2 freq
carts (0) - 2 freq
cairts (1) - 34 freq
cars (2) - 100 freq
coorts (2) - 9 freq
cart (2) - 7 freq
casts (2) - 19 freq
certes (2) - 23 freq
cares (2) - 61 freq
crits (2) - 1 freq
courts (2) - 14 freq
cooarts (2) - 1 freq
cairtes (2) - 1 freq
curtsy (2) - 1 freq
couarts (2) - 1 freq
clarts (2) - 2 freq
tarts (2) - 40 freq
caats (2) - 4 freq
marts (2) - 4 freq
arts (2) - 34 freq
cants (2) - 1 freq
farts (2) - 5 freq
parts (2) - 24 freq
charts (2) - 8 freq
darts (2) - 32 freq
car's (2) - 16 freq
SoundEx code - C632
carrots - 33 freq
creetics - 5 freq
certes - 23 freq
curtsey - 5 freq
curtseyin - 6 freq
cairds - 88 freq
crates - 6 freq
critical - 19 freq
cardigans - 5 freq
creiticism - 5 freq
cairt's - 1 freq
cairridges - 1 freq
cairts - 34 freq
certies - 17 freq
courtships - 1 freq
cooards - 2 freq
criticeese - 1 freq
cortex - 1 freq
caretaker - 5 freq
critics - 17 freq
crouds - 2 freq
cheritie's - 1 freq
creates - 9 freq
cartesian - 1 freq
cards - 33 freq
cardigan - 18 freq
croods - 30 freq
curates' - 1 freq
curates - 3 freq
courteously - 1 freq
cords - 5 freq
chariots - 1 freq
carret's - 3 freq
chords - 10 freq
courts - 14 freq
courtesy - 9 freq
crowds - 30 freq
criticism - 21 freq
carthaginians - 1 freq
crootched - 3 freq
crootchin - 2 freq
creeds - 1 freq
cowards - 5 freq
critically - 1 freq
crits - 1 freq
cardiac - 2 freq
cowardess - 1 freq
crutches - 3 freq
criticise - 11 freq
carthage - 2 freq
charts - 8 freq
cowardice - 3 freq
carts - 2 freq
curtsy - 1 freq
cairry-oots - 1 freq
coort-case - 1 freq
cyaards - 9 freq
cyaards' - 1 freq
crutch - 4 freq
creauts - 1 freq
coorts - 9 freq
critic - 4 freq
coort-hoose - 1 freq
crowd's - 1 freq
cortege - 1 freq
cairtclaith - 1 freq
creiticall - 1 freq
criticeise - 2 freq
cruds - 2 freq
chairts - 8 freq
coortesy - 1 freq
cairrots - 7 freq
criticisms - 3 freq
crotches - 1 freq
curds - 3 freq
charities - 4 freq
croudged - 1 freq
currots - 1 freq
cooerds - 1 freq
cooerd's - 1 freq
cooardice - 1 freq
'cooards' - 1 freq
crowdieknowe - 1 freq
cortesia - 1 freq
curtsies - 1 freq
curtseyan - 1 freq
'critically - 1 freq
coards - 1 freq
caerds - 1 freq
criticeised - 1 freq
cyaard's - 2 freq
coowards - 1 freq
creitical - 5 freq
creiticised - 1 freq
criticised - 2 freq
critiseesms - 2 freq
creitics - 3 freq
cartographers - 2 freq
creetic - 1 freq
creeticism - 1 freq
cardies - 1 freq
cairtshed - 2 freq
coortship - 1 freq
couarts - 1 freq
cairties - 1 freq
cartowes - 1 freq
curtassie - 1 freq
criticeizin - 1 freq
cairtes - 1 freq
cooarts - 1 freq
cheriots - 2 freq
coortesan - 1 freq
crotchety - 1 freq
€œchariots - 1 freq
courtesan - 2 freq
chairiots - 1 freq
caretakers - 2 freq
courtship - 1 freq
€˜cratic - 1 freq
cairds- - 1 freq
criticising - 3 freq
critique - 2 freq
caird's - 2 freq
crathes - 1 freq
crathescastle - 1 freq
cortese - 1 freq
cerds - 1 freq
curtismccraw - 2 freq
criticalrole - 4 freq
crotch - 1 freq
MetaPhone code - KRTS
carrots - 33 freq
curtsey - 5 freq
cairds - 88 freq
crates - 6 freq
greets - 25 freq
cairt's - 1 freq
cairts - 34 freq
cooards - 2 freq
crouds - 2 freq
greits - 3 freq
creates - 9 freq
quartz - 4 freq
cards - 33 freq
gairds - 16 freq
groats - 5 freq
croods - 30 freq
curates' - 1 freq
curates - 3 freq
grates - 5 freq
cords - 5 freq
carret's - 3 freq
courts - 14 freq
courtesy - 9 freq
crowds - 30 freq
grades - 10 freq
creeds - 1 freq
guairds - 21 freq
crits - 1 freq
guards - 13 freq
carts - 2 freq
curtsy - 1 freq
grate's - 1 freq
cairry-oots - 1 freq
grats - 1 freq
creauts - 1 freq
coorts - 9 freq
greats - 4 freq
kerrots - 1 freq
crowd's - 1 freq
cruds - 2 freq
coortesy - 1 freq
cairrots - 7 freq
groaties - 2 freq
curds - 3 freq
kerds - 1 freq
currots - 1 freq
cooerds - 1 freq
cooerd's - 1 freq
cooardice - 1 freq
'cooards' - 1 freq
coards - 1 freq
caerds - 1 freq
gardies - 1 freq
cardies - 1 freq
greedy's - 1 freq
quarts - 1 freq
groat's - 1 freq
couarts - 1 freq
cairties - 1 freq
curtassie - 1 freq
gratis - 3 freq
gairdies - 1 freq
kurds - 1 freq
cairtes - 1 freq
cooarts - 1 freq
greedius - 1 freq
cairds- - 1 freq
grids - 1 freq
kurtas - 1 freq
caird's - 2 freq
grits - 1 freq
cortese - 1 freq
CARTS
Time to execute Levenshtein function - 0.437674 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.578471 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032531 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.071608 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000944 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.