A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gcurryphotos in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gcurryphotos (0) - 1 freq
gryphons (5) - 1 freq
currots (5) - 1 freq
curroos (5) - 1 freq
gryphon (6) - 335 freq
'photos (6) - 1 freq
cairry-oots (6) - 1 freq
current's (6) - 1 freq
scurry's (6) - 1 freq
currants (6) - 7 freq
photos (6) - 40 freq
natgeophotos (6) - 1 freq
carry-ons (6) - 1 freq
coreyhook (6) - 7 freq
currys (6) - 1 freq
gerrydotp (6) - 1 freq
scurtfoos (6) - 3 freq
carrots (6) - 30 freq
scurrilous (6) - 2 freq
claypotts (6) - 8 freq
curlysoo (6) - 1 freq
robgrayphoto (6) - 1 freq
cerry-oot (6) - 1 freq
burrows (7) - 5 freq
xsimxphoto (7) - 1 freq
gcurryphotos (0) - 1 freq
currots (8) - 1 freq
gryphons (8) - 1 freq
carrots (9) - 30 freq
cairry-oots (9) - 1 freq
currants (9) - 7 freq
curroos (9) - 1 freq
prophits (10) - 24 freq
corruptit (10) - 2 freq
scripts (10) - 3 freq
corrupt (10) - 13 freq
corruption (10) - 27 freq
corrects (10) - 1 freq
cairrots (10) - 7 freq
carruthers (10) - 1 freq
grumphies (10) - 14 freq
graphics (10) - 3 freq
proaphits (10) - 8 freq
cairpets (10) - 7 freq
crumpets (10) - 1 freq
corruptin (10) - 1 freq
corrputin (10) - 1 freq
prophetis (10) - 1 freq
graphs (10) - 6 freq
graphemes (10) - 3 freq
SoundEx code - G613
grabbed - 77 freq
grafts - 3 freq
grubbed - 1 freq
graft - 35 freq
graived - 1 freq
grafters - 3 freq
gruppit - 23 freq
grippit - 58 freq
gravat - 6 freq
grupt - 2 freq
grabbit - 28 freq
grupped - 8 freq
gruppt - 10 freq
griped - 1 freq
gripped - 19 freq
gravedigger - 11 freq
graavit - 2 freq
graftin - 6 freq
gravity - 29 freq
grippt - 3 freq
grieved - 3 freq
grave-diggin - 1 freq
grafted - 4 freq
graffiti - 7 freq
gropit - 1 freq
gruftalo - 1 freq
grabbid - 3 freq
gravediggers - 3 freq
gravediggin - 1 freq
garevitch - 1 freq
gript - 1 freq
garbed - 3 freq
grouped - 1 freq
grafton - 1 freq
groped - 1 freq
grafting - 2 freq
gravitaetional - 1 freq
gravietaetional - 1 freq
gravity's - 1 freq
grippid - 1 freq
grafter - 1 freq
gravit - 4 freq
grave-digger - 1 freq
grevit - 1 freq
gravitas - 2 freq
gravedigger'd - 1 freq
garbutt - 1 freq
groaped - 1 freq
graffitologists - 1 freq
gravitation - 1 freq
graved - 1 freq
grabed - 1 freq
gravitatit - 1 freq
gravitate - 1 freq
gravits - 1 freq
graftinthemourn - 1 freq
griffiths - 1 freq
gcurryphotos - 1 freq
garyfooty - 1 freq
MetaPhone code - KKRFTS
gcurryphotos - 1 freq
GCURRYPHOTOS
Time to execute Levenshtein function - 0.358829 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.567276 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.036215 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.043452 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001027 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.