A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to citris in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
citris (0) - 3 freq
citric (1) - 2 freq
citrus (1) - 1 freq
citrusy (2) - 1 freq
pitril (2) - 7 freq
city's (2) - 10 freq
itis (2) - 1 freq
cities (2) - 43 freq
airis (2) - 1 freq
cxhris (2) - 1 freq
tetris (2) - 1 freq
litres (2) - 3 freq
iris (2) - 3 freq
citie (2) - 10 freq
cit's (2) - 1 freq
citin (2) - 1 freq
citit (2) - 2 freq
witries (2) - 2 freq
chris (2) - 253 freq
viris (2) - 1 freq
cites (2) - 12 freq
cittie (2) - 2 freq
cits (2) - 1 freq
ciaran (3) - 4 freq
cigs (3) - 1 freq
citris (0) - 3 freq
citrus (1) - 1 freq
citric (2) - 2 freq
citrusy (2) - 1 freq
litres (3) - 3 freq
cites (3) - 12 freq
tetris (3) - 1 freq
chris (3) - 253 freq
cit's (3) - 1 freq
cities (3) - 43 freq
city's (3) - 10 freq
witries (3) - 2 freq
cits (3) - 1 freq
cars (4) - 100 freq
extras (4) - 12 freq
chorus (4) - 69 freq
caers (4) - 1 freq
curries (4) - 7 freq
corrs (4) - 2 freq
cuts (4) - 45 freq
tries (4) - 74 freq
coories (4) - 5 freq
camras (4) - 1 freq
cutties (4) - 3 freq
intries (4) - 1 freq
SoundEx code - C362
cattrick - 1 freq
chitters - 7 freq
catharsis - 1 freq
cottaries - 1 freq
citris - 3 freq
citrus - 1 freq
cedars - 1 freq
cottars - 19 freq
citrusy - 1 freq
cottar-hooses - 1 freq
catter's - 1 freq
cutter's - 1 freq
cheeters - 1 freq
cataracts - 1 freq
cataract - 5 freq
cottars' - 3 freq
cottar's - 1 freq
cottar-hoose - 5 freq
citric - 2 freq
cheaters - 1 freq
cathersmcg - 5 freq
cheddars - 1 freq
MetaPhone code - STRS
storeys - 3 freq
stairs - 209 freq
stars - 106 freq
stories - 359 freq
stours - 3 freq
steers - 11 freq
staris - 1 freq
stress - 41 freq
story's - 1 freq
soutar's - 6 freq
stares - 41 freq
staurs - 3 freq
stirs - 6 freq
stores - 30 freq
citris - 3 freq
strays - 5 freq
citrus - 1 freq
cedars - 1 freq
citrusy - 1 freq
straws - 6 freq
star's - 1 freq
steirs - 4 freq
strae's - 1 freq
strahs - 1 freq
steer's - 1 freq
satires - 5 freq
stars' - 1 freq
sters - 1 freq
straes - 3 freq
stores-' - 1 freq
stoor's - 1 freq
strauss - 93 freq
setter's - 1 freq
saeter's - 1 freq
steeers - 1 freq
satyrs - 3 freq
stair's - 2 freq
sterrs - 7 freq
stories' - 3 freq
soutars' - 1 freq
sottars - 3 freq
storeys' - 1 freq
stres - 1 freq
suitors - 4 freq
strassa' - 1 freq
stors - 1 freq
souders - 1 freq
storees - 1 freq
straas - 1 freq
'stars' - 1 freq
CITRIS
Time to execute Levenshtein function - 0.226368 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.379581 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028604 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038292 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000947 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.