A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to connect in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
connect (0) - 15 freq
conneck (1) - 3 freq
connects (1) - 4 freq
connecit (1) - 2 freq
connecks (2) - 3 freq
consent (2) - 11 freq
contact (2) - 83 freq
concept (2) - 37 freq
context (2) - 79 freq
conneks (2) - 2 freq
donnert (2) - 20 freq
contest (2) - 16 freq
collect (2) - 28 freq
content (2) - 117 freq
connacht (2) - 10 freq
convent (2) - 3 freq
bonnet (2) - 34 freq
conner (2) - 2 freq
coreect (2) - 1 freq
sonnet (2) - 12 freq
conceit (2) - 22 freq
conneckit (2) - 2 freq
connery (2) - 5 freq
connekit (2) - 2 freq
cornet (2) - 4 freq
connect (0) - 15 freq
connecit (1) - 2 freq
conneck (2) - 3 freq
connects (2) - 4 freq
connectet (3) - 2 freq
conduct (3) - 13 freq
connach (3) - 9 freq
conneckit (3) - 2 freq
connekit (3) - 2 freq
conract (3) - 1 freq
connectit (3) - 18 freq
conceit (3) - 22 freq
connectin (3) - 4 freq
connected (3) - 20 freq
reconnect (3) - 2 freq
connacht (3) - 10 freq
contact (3) - 83 freq
connecks (4) - 3 freq
cannit (4) - 1 freq
coantact (4) - 1 freq
convent (4) - 3 freq
concept (4) - 37 freq
consent (4) - 11 freq
cunnecten (4) - 1 freq
connotit (4) - 1 freq
SoundEx code - C523
chynged - 131 freq
considered' - 1 freq
concedin - 1 freq
chinged - 18 freq
chance'd - 1 freq
constant - 71 freq
consider - 90 freq
cannigait - 3 freq
consternation - 6 freq
cheenged - 39 freq
'constipation's - 1 freq
chanced - 6 freq
changed - 137 freq
constantly - 24 freq
considerin - 24 freq
conceit - 22 freq
conseider - 9 freq
camsteerie - 2 freq
constrictor - 9 freq
constrictors - 5 freq
canniest - 1 freq
connectin - 4 freq
connects - 4 freq
constable - 24 freq
conseeder - 14 freq
conseiderin - 3 freq
'conseider - 1 freq
chance-whit - 1 freq
concait - 6 freq
'comestibling' - 1 freq
camstairy - 2 freq
connections - 35 freq
coincidence - 16 freq
conseederit - 1 freq
cheingit - 2 freq
constrictin - 2 freq
consait - 28 freq
chainged - 26 freq
conceded - 4 freq
coincidences - 1 freq
consty - 1 freq
connectit - 18 freq
cheynged - 3 freq
constituency - 8 freq
constituencies - 8 freq
consideration - 15 freq
chnstian - 2 freq
connected - 20 freq
chyngit - 14 freq
connaught - 1 freq
connection - 41 freq
cheengit - 3 freq
constanta - 1 freq
conseederatioun - 2 freq
cunnectien - 1 freq
cunnecten - 1 freq
connect - 15 freq
connectet - 2 freq
connecit - 2 freq
concedes - 4 freq
concedit - 3 freq
considered - 28 freq
coincide - 4 freq
considerate - 4 freq
constructs - 3 freq
constantin - 2 freq
'considerin - 1 freq
constriction - 2 freq
changeit - 1 freq
chemist - 11 freq
conseediration - 1 freq
chunced - 2 freq
conseederin - 5 freq
construction - 16 freq
connacht - 10 freq
constitutional - 10 freq
constitution - 15 freq
considerit - 5 freq
constituents - 10 freq
constitute - 3 freq
constituent - 3 freq
chemistry - 7 freq
chenged - 9 freq
considert - 8 freq
constantlie - 1 freq
constructions - 6 freq
conceited - 1 freq
considerable - 8 freq
conseederable - 1 freq
constables - 5 freq
constonant - 2 freq
conked - 1 freq
constructive - 2 freq
construct - 4 freq
concaitie - 1 freq
conseidert - 2 freq
conseederation - 3 freq
conseedert - 12 freq
constrained - 1 freq
constellation - 3 freq
coincidentally - 1 freq
constellations - 4 freq
chemist's - 1 freq
considers - 7 freq
constancy - 4 freq
constructit - 4 freq
conceitit - 1 freq
constitutionals - 1 freq
concedan - 1 freq
concede - 5 freq
consate - 4 freq
consates - 2 freq
consither - 10 freq
conseiderautioun - 1 freq
chaenged - 5 freq
constraints - 3 freq
connectors - 1 freq
constrict - 1 freq
constipated - 2 freq
connectedness - 7 freq
chansts - 31 freq
chanst - 22 freq
canigait - 1 freq
canst - 2 freq
cheinged - 6 freq
constellatiouns - 1 freq
considerations - 1 freq
constitutions - 1 freq
constructed - 5 freq
consitherin - 2 freq
consithert - 2 freq
consaity-adjectives - 1 freq
constantinople - 2 freq
conseiders - 2 freq
construck - 1 freq
consternatioun - 1 freq
camsteirie - 2 freq
consithered - 1 freq
conseiderable - 2 freq
'constructs - 1 freq
considering - 6 freq
considerance - 1 freq
consaits - 20 freq
conseidered - 1 freq
constitutioun - 1 freq
constitutiounal - 1 freq
constitouensie - 5 freq
constitouents - 1 freq
constitouent - 1 freq
conseedered - 7 freq
connached - 5 freq
connecktion - 1 freq
consíderâtion - 1 freq
cinched - 1 freq
conchoidal-fracture - 1 freq
concaity - 1 freq
chingged - 1 freq
connekkit - 1 freq
connekit - 2 freq
connectioun - 1 freq
connectivity - 1 freq
conseideration - 1 freq
conseether - 2 freq
conseeders - 2 freq
consíder - 1 freq
consíders - 1 freq
€˜considerable - 1 freq
chaingit - 1 freq
consiederable - 1 freq
coincided - 2 freq
consid'rit - 1 freq
canister - 3 freq
cannister - 1 freq
conneckit - 2 freq
constraint - 1 freq
canisters - 1 freq
conceits - 2 freq
constructively - 1 freq
consitutuency - 1 freq
consitutional - 1 freq
€œchynged - 1 freq
cum-stained - 1 freq
chewingthefat - 1 freq
€˜constancie - 4 freq
constancie - 3 freq
€˜constanter - 1 freq
€œconstancie - 1 freq
€˜constance - 1 freq
'changed - 1 freq
€˜changed - 1 freq
considerin' - 1 freq
constitutes - 1 freq
coniched - 1 freq
changit - 2 freq
cammiescott - 1 freq
connswater - 2 freq
considine - 6 freq
constitution-writing - 1 freq
const - 1 freq
cjohnston - 1 freq
consteetuent - 1 freq
consteetutional - 1 freq
'constructively - 1 freq
constipation - 1 freq
connectionnature - 1 freq
cjohnstonni - 1 freq
cnhjyatt - 1 freq
chinascotlink - 1 freq
cxnseot - 1 freq
MetaPhone code - KNKT
konked - 1 freq
king'd - 1 freq
cannigait - 3 freq
concait - 6 freq
connect - 15 freq
gunkt - 2 freq
kingoodie - 1 freq
conked - 1 freq
concaitie - 1 freq
gunked - 1 freq
canigait - 1 freq
concaity - 1 freq
connekkit - 1 freq
connekit - 2 freq
kincaid - 1 freq
conneckit - 2 freq
gangt - 1 freq
gunkit - 1 freq
CONNECT
Time to execute Levenshtein function - 0.188691 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.330328 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027630 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037217 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000813 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.