A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to child in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
child (0) - 56 freq
chile (1) - 15 freq
chill (1) - 45 freq
chield (1) - 15 freq
childe (1) - 1 freq
childs (1) - 1 freq
cuild (1) - 7 freq
child' (1) - 1 freq
'child (1) - 2 freq
ochils (2) - 8 freq
cricd (2) - 1 freq
coiled (2) - 2 freq
shuld (2) - 29 freq
chise (2) - 2 freq
chieh (2) - 2 freq
chimp (2) - 3 freq
chip (2) - 41 freq
chib (2) - 8 freq
chins (2) - 5 freq
cold (2) - 44 freq
caird (2) - 96 freq
chiao (2) - 2 freq
chad (2) - 3 freq
cuils (2) - 1 freq
beild (2) - 41 freq
child (0) - 56 freq
childe (1) - 1 freq
chield (1) - 15 freq
'child (2) - 2 freq
cuild (2) - 7 freq
child' (2) - 1 freq
childs (2) - 1 freq
chile (2) - 15 freq
chill (2) - 45 freq
chided (3) - 3 freq
heild (3) - 48 freq
cawld (3) - 4 freq
cowld (3) - 52 freq
chimed (3) - 8 freq
haild (3) - 2 freq
hilda (3) - 23 freq
could (3) - 2637 freq
chilly (3) - 12 freq
held (3) - 505 freq
chalk (3) - 20 freq
cuiled (3) - 3 freq
chide (3) - 4 freq
hold (3) - 65 freq
chilled (3) - 9 freq
chiels (3) - 110 freq
SoundEx code - C430
could - 2637 freq
cooled - 4 freq
cauld - 842 freq
clood - 79 freq
clad - 14 freq
claith - 59 freq
cloath - 1 freq
cool't - 1 freq
clowt - 1 freq
cloot - 129 freq
couldae - 4 freq
clatty - 26 freq
chield - 15 freq
cleid - 10 freq
ceilidh - 23 freq
culd - 41 freq
clattie - 11 freq
'clatty - 1 freq
called - 195 freq
cloud - 67 freq
cold - 44 freq
cloth - 16 freq
claaed - 2 freq
cled - 34 freq
'cauld - 4 freq
'callet' - 1 freq
clud - 9 freq
child - 56 freq
clootie - 33 freq
clyth - 1 freq
'could - 16 freq
cald - 8 freq
coulda - 29 freq
cluit - 1 freq
caald - 28 freq
clyde - 66 freq
clutha - 2 freq
clyte - 1 freq
clatt - 5 freq
claa't - 1 freq
clout - 18 freq
cowld - 52 freq
caalt - 24 freq
callit - 3 freq
clathy - 1 freq
coilit - 1 freq
cloody - 6 freq
coiled - 2 freq
cowl-eed - 1 freq
clawed - 5 freq
chilehuid - 1 freq
'child - 2 freq
couldhae - 1 freq
couled - 1 freq
clod - 14 freq
clawit - 1 freq
clat - 6 freq
cowlt - 4 freq
cal'd - 2 freq
clooth - 1 freq
culled - 3 freq
claithe - 1 freq
chilled - 9 freq
callt - 14 freq
collate - 1 freq
cult - 8 freq
clooty - 3 freq
clathe - 1 freq
cuillied - 1 freq
cledd - 1 freq
caaled - 5 freq
cawld - 4 freq
cleed - 2 freq
clot - 1 freq
cloutie - 2 freq
clowtd - 1 freq
claethe - 1 freq
clothe - 1 freq
collit - 1 freq
cuiled - 3 freq
'cold - 1 freq
cold' - 1 freq
cladh - 1 freq
colled - 2 freq
cleat - 1 freq
coled - 1 freq
€˜clyde - 1 freq
cilt - 7 freq
colt - 1 freq
€œcauld - 1 freq
chalet - 5 freq
cweeled - 4 freq
celt - 2 freq
cluitie - 1 freq
€˜child - 3 freq
call-oot - 1 freq
€˜cauld - 1 freq
€˜could - 2 freq
childe - 1 freq
ceildh - 3 freq
€œceildih - 1 freq
couldo - 1 freq
€œcould - 4 freq
cuild - 7 freq
cöllied - 1 freq
'chilled - 1 freq
clothie - 1 freq
claude - 1 freq
clued - 1 freq
cheilidh - 2 freq
clyde- - 1 freq
clydeu - 2 freq
cauld” - 1 freq
cloudy - 3 freq
'clootie - 1 freq
child' - 1 freq
cloot' - 1 freq
chilloot - 3 freq
chillout - 6 freq
MetaPhone code - XLT
should - 907 freq
chield - 15 freq
shield - 26 freq
shelled - 2 freq
child - 56 freq
shalt - 6 freq
'should - 2 freq
shoulda - 29 freq
shaldou - 1 freq
shallt - 2 freq
'shield - 1 freq
should'a - 2 freq
'child - 2 freq
shuld - 29 freq
shouldae - 1 freq
sheltie - 31 freq
shooled - 1 freq
chilled - 9 freq
sheeld - 1 freq
shelt - 28 freq
shaald - 3 freq
chalet - 5 freq
€˜child - 3 freq
childe - 1 freq
shawled - 1 freq
€˜should - 2 freq
€œshould - 1 freq
shilt - 2 freq
'chilled - 1 freq
'shult' - 1 freq
'shultie' - 1 freq
should' - 1 freq
cheilidh - 2 freq
shouldda - 2 freq
child' - 1 freq
chilloot - 3 freq
chillout - 6 freq
CHILD
Time to execute Levenshtein function - 0.192498 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.326085 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027409 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036720 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000821 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.