A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gorgeous in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gorgeous (0) - 68 freq
georgeous (1) - 1 freq
porteous (2) - 5 freq
gorgons (2) - 1 freq
forgets (3) - 9 freq
grugous (3) - 2 freq
goges (3) - 5 freq
careous (3) - 1 freq
surgeons (3) - 4 freq
gordons (3) - 16 freq
forges (3) - 2 freq
glorious (3) - 58 freq
gorbel's (3) - 1 freq
george’s (3) - 2 freq
murgeons (3) - 2 freq
grous (3) - 2 freq
grievous (3) - 7 freq
oreos (3) - 1 freq
dormous (3) - 39 freq
rteous (3) - 3 freq
gorge (3) - 5 freq
georges (3) - 2 freq
george's (3) - 2 freq
courageous (3) - 3 freq
gurges (3) - 1 freq
gorgeous (0) - 68 freq
georgeous (1) - 1 freq
gurges (3) - 1 freq
grugous (3) - 2 freq
georges (3) - 2 freq
gorgons (3) - 1 freq
bourgeois (4) - 5 freq
gorie's (4) - 1 freq
ootrageous (4) - 3 freq
courageous (4) - 3 freq
gorge (4) - 5 freq
garages (4) - 5 freq
egregious (4) - 1 freq
gorgon (4) - 3 freq
gorehouse (4) - 1 freq
gouges (4) - 1 freq
gorged (4) - 2 freq
george's (4) - 2 freq
grous (4) - 2 freq
gregs (4) - 1 freq
porteous (4) - 5 freq
grievous (4) - 7 freq
forges (4) - 2 freq
goges (4) - 5 freq
griens (5) - 1 freq
SoundEx code - G622
graces - 16 freq
grasses - 8 freq
greeshoch - 3 freq
gorgeous - 68 freq
gorjis - 1 freq
georgeson - 14 freq
'georgeson - 1 freq
gresses - 7 freq
grace's - 3 freq
greasiest - 1 freq
georges - 2 freq
greeshach - 2 freq
'grices' - 1 freq
graciously - 5 freq
gorgeousist - 1 freq
greesaugh - 1 freq
greesauch - 2 freq
gracious - 9 freq
gress-keekers - 1 freq
grassic - 8 freq
garages - 5 freq
gressis - 1 freq
gurges - 1 freq
grugous - 2 freq
grieshoch - 1 freq
george's - 2 freq
grey-gizzed - 1 freq
georgeous - 1 freq
greesheugh - 1 freq
george’s - 2 freq
greigexvs - 10 freq
garyhughesie - 1 freq
gergaskman - 2 freq
georgegalloway - 5 freq
gersoise - 5 freq
georgiaaustinxx - 1 freq
greece's - 1 freq
“gurzies” - 1 freq
graces' - 1 freq
grice's - 1 freq
georgesq - 1 freq
gerrykeogh - 2 freq
'george's - 1 freq
georgecaulkin - 1 freq
georgecunningh - 1 freq
georgesfloyd - 2 freq
MetaPhone code - KRJS
cairridges - 1 freq
gorgeous - 68 freq
gorjis - 1 freq
cairriages - 4 freq
craigy's - 4 freq
courageous - 3 freq
craigie's - 1 freq
craigies - 3 freq
cairrages - 1 freq
grudges - 2 freq
carriages - 1 freq
garages - 5 freq
gurges - 1 freq
cragis - 1 freq
GORGEOUS
Time to execute Levenshtein function - 0.184416 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.367998 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027537 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041214 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000858 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.