A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to quartz in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
quartz (0) - 4 freq
quarts (1) - 1 freq
quarto (1) - 8 freq
quart (1) - 3 freq
quarz (1) - 2 freq
quats (2) - 1 freq
quartir (2) - 1 freq
quartet (2) - 1 freq
quate (2) - 161 freq
quares (2) - 1 freq
quarter (2) - 68 freq
quact (2) - 1 freq
quaert (2) - 1 freq
quare (2) - 245 freq
quarry (2) - 60 freq
quat (2) - 31 freq
queart (2) - 3 freq
quanta (2) - 1 freq
quaet (2) - 26 freq
quait (2) - 131 freq
ruary (3) - 1 freq
hearts (3) - 32 freq
guard (3) - 38 freq
quarely (3) - 5 freq
wurt (3) - 4 freq
quartz (0) - 4 freq
quarz (2) - 2 freq
quart (2) - 3 freq
quarts (2) - 1 freq
quarto (2) - 8 freq
quarter (3) - 68 freq
quaert (3) - 1 freq
quartet (3) - 1 freq
queart (3) - 3 freq
quartir (3) - 1 freq
quaet (4) - 26 freq
quanta (4) - 1 freq
quait (4) - 131 freq
rtz (4) - 2 freq
quaarter (4) - 1 freq
quartier (4) - 1 freq
hertz (4) - 8 freq
quats (4) - 1 freq
quate (4) - 161 freq
quares (4) - 1 freq
quat (4) - 31 freq
quarry (4) - 60 freq
quare (4) - 245 freq
quact (4) - 1 freq
quet (5) - 3 freq
SoundEx code - Q632
quartz - 4 freq
quarts - 1 freq
MetaPhone code - KRTS
carrots - 30 freq
curtsey - 5 freq
cairds - 88 freq
crates - 6 freq
greets - 25 freq
cairt's - 1 freq
cairts - 32 freq
cooards - 2 freq
crouds - 2 freq
greits - 3 freq
creates - 8 freq
quartz - 4 freq
cards - 33 freq
gairds - 16 freq
groats - 5 freq
croods - 30 freq
curates' - 1 freq
curates - 3 freq
grates - 5 freq
cords - 5 freq
carret's - 3 freq
courts - 14 freq
creeds - 1 freq
guairds - 21 freq
crits - 1 freq
courtesy - 8 freq
guards - 13 freq
crowds - 28 freq
carts - 2 freq
curtsy - 1 freq
grades - 9 freq
grate's - 1 freq
cairry-oots - 1 freq
grats - 1 freq
creauts - 1 freq
coorts - 9 freq
greats - 4 freq
kerrots - 1 freq
crowd's - 1 freq
cruds - 2 freq
coortesy - 1 freq
cairrots - 7 freq
groaties - 2 freq
curds - 3 freq
kerds - 1 freq
currots - 1 freq
cooerds - 1 freq
cooerd's - 1 freq
cooardice - 1 freq
'cooards' - 1 freq
coards - 1 freq
caerds - 1 freq
gardies - 1 freq
cardies - 1 freq
greedy's - 1 freq
quarts - 1 freq
groat's - 1 freq
couarts - 1 freq
cairties - 1 freq
curtassie - 1 freq
gratis - 3 freq
gairdies - 1 freq
kurds - 1 freq
cairtes - 1 freq
cooarts - 1 freq
greedius - 1 freq
cairds- - 1 freq
grids - 1 freq
kurtas - 1 freq
caird's - 2 freq
grits - 1 freq
cortese - 1 freq
QUARTZ
Time to execute Levenshtein function - 0.580782 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.785329 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.035862 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.071562 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000799 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.