A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to chipt in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
chipt (0) - 1 freq
chist (1) - 68 freq
chip (1) - 41 freq
chips (1) - 117 freq
chippt (1) - 1 freq
chirpt (1) - 1 freq
whipt (1) - 2 freq
clipt (1) - 1 freq
chit (1) - 4 freq
whips (2) - 22 freq
chink (2) - 13 freq
chibs (2) - 5 freq
china (2) - 53 freq
whip (2) - 14 freq
cuit (2) - 2 freq
chists (2) - 1 freq
chik (2) - 11 freq
crypt (2) - 2 freq
chaft (2) - 13 freq
chilpit (2) - 2 freq
clipe (2) - 9 freq
chop (2) - 25 freq
chaist (2) - 3 freq
coupt (2) - 3 freq
cit (2) - 4 freq
chipt (0) - 1 freq
whipt (2) - 2 freq
chit (2) - 4 freq
chapit (2) - 1 freq
chirpt (2) - 1 freq
clipt (2) - 1 freq
chist (2) - 68 freq
chippt (2) - 1 freq
chip (2) - 41 freq
chips (2) - 117 freq
chape (3) - 6 freq
chuft (3) - 2 freq
chert (3) - 2 freq
chitty (3) - 1 freq
cowpt (3) - 21 freq
chyst (3) - 1 freq
campt (3) - 2 freq
chairt (3) - 17 freq
chett (3) - 1 freq
chapo (3) - 1 freq
choppt (3) - 2 freq
chippit (3) - 7 freq
cheat (3) - 6 freq
chait (3) - 1 freq
chops (3) - 15 freq
SoundEx code - C130
cowpit - 35 freq
chappit - 45 freq
cowpt - 21 freq
cept - 25 freq
chuffed - 84 freq
chapped - 40 freq
coupt - 3 freq
copied - 13 freq
cowped - 48 freq
cupped - 9 freq
cooped - 7 freq
caved - 9 freq
chipped - 14 freq
caped - 2 freq
chippit - 7 freq
chappt - 3 freq
cuffed - 1 freq
chuffd - 2 freq
coft - 9 freq
cupid - 3 freq
'cept - 9 freq
coupit - 16 freq
covid - 73 freq
couped - 9 freq
chaaved - 3 freq
chaved - 1 freq
chuffit - 2 freq
chaft - 13 freq
chuff't - 3 freq
chappet - 2 freq
cheviot - 3 freq
chibbed - 3 freq
coppit - 3 freq
capita - 1 freq
choppt - 2 freq
coopit - 3 freq
covet - 3 freq
covid- - 10 freq
chipt - 1 freq
chopped - 6 freq
chippid - 1 freq
cheepit - 3 freq
cuppid - 1 freq
caveat - 1 freq
cavity - 2 freq
cop-oot - 1 freq
coftee - 1 freq
ciabatta - 1 freq
cave-heid - 2 freq
chaffed - 2 freq
chaift - 1 freq
chuft - 2 freq
cofft - 1 freq
cuppit - 3 freq
chap't - 1 freq
co'peth - 2 freq
cappit - 3 freq
cowboy-hat - 1 freq
cpd - 3 freq
€œcept - 1 freq
caputh - 1 freq
chapit - 1 freq
chafft - 1 freq
chafed - 1 freq
choped - 2 freq
coped - 2 freq
chaffit - 1 freq
chauvit - 1 freq
chippt - 1 freq
chufft - 2 freq
cvht - 1 freq
cbd - 1 freq
coovid - 1 freq
cowpat - 1 freq
cvda - 1 freq
cpt - 1 freq
cbeath - 1 freq
covid” - 1 freq
chuffedÂ… - 1 freq
MetaPhone code - XPT
shapit - 14 freq
chappit - 45 freq
chapped - 40 freq
shaped - 34 freq
chipped - 14 freq
chippit - 7 freq
chappt - 3 freq
shipped - 4 freq
shippit - 1 freq
shaipet - 1 freq
shaypet - 1 freq
chappet - 2 freq
shaepid - 1 freq
shape't - 3 freq
choppt - 2 freq
shapet - 1 freq
chipt - 1 freq
chopped - 6 freq
chippid - 1 freq
cheepit - 3 freq
shaipit - 1 freq
shaepit - 2 freq
shappit - 2 freq
'shappit' - 1 freq
'shappit - 1 freq
chap't - 1 freq
shippet - 1 freq
shuppit - 1 freq
chapit - 1 freq
choped - 2 freq
chippt - 1 freq
CHIPT
Time to execute Levenshtein function - 0.312623 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.502505 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027724 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038941 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000841 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.