A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to girls in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
girls (0) - 58 freq
girss (1) - 16 freq
girlw (1) - 1 freq
giros (1) - 1 freq
birls (1) - 31 freq
girl's (1) - 1 freq
dirls (1) - 7 freq
pirls (1) - 4 freq
girns (1) - 19 freq
girl (1) - 73 freq
girly (1) - 1 freq
gills (1) - 8 freq
gurls (1) - 1 freq
nirls (1) - 1 freq
girds (1) - 5 freq
girs (1) - 4 freq
girl' (1) - 1 freq
tirls (1) - 4 freq
girls' (1) - 2 freq
girdles (2) - 1 freq
giess (2) - 4 freq
rls (2) - 7 freq
gies (2) - 501 freq
aircs (2) - 1 freq
tirlt (2) - 2 freq
girls (0) - 58 freq
gurls (1) - 1 freq
nirls (2) - 1 freq
gills (2) - 8 freq
girds (2) - 5 freq
girl' (2) - 1 freq
grals (2) - 1 freq
girls' (2) - 2 freq
girly (2) - 1 freq
girs (2) - 4 freq
tirls (2) - 4 freq
girss (2) - 16 freq
girl (2) - 73 freq
girlw (2) - 1 freq
birls (2) - 31 freq
giros (2) - 1 freq
girl's (2) - 1 freq
girns (2) - 19 freq
dirls (2) - 7 freq
pirls (2) - 4 freq
gar's (3) - 2 freq
earls (3) - 12 freq
girssy (3) - 3 freq
gorms (3) - 2 freq
hurls (3) - 4 freq
SoundEx code - G642
growls - 14 freq
girls - 58 freq
gurls - 1 freq
garlic - 13 freq
grolsch - 1 freq
gralloch - 2 freq
grilsie - 1 freq
gralloched - 3 freq
gorilla's - 1 freq
girl's - 1 freq
grulshes - 1 freq
gorillas - 2 freq
girls' - 2 freq
garlogie - 1 freq
graylicht - 1 freq
gairleke - 1 freq
grallochin - 1 freq
garrulous - 1 freq
grals - 1 freq
girlchampjinty - 2 freq
grulsh - 1 freq
grølek - 1 freq
grealish - 1 freq
gerrywilson - 1 freq
girlssexygirls - 3 freq
MetaPhone code - JRLS
girls - 58 freq
jarls - 11 freq
jarl's - 3 freq
girl's - 1 freq
girls' - 2 freq
jarl’s - 1 freq
GIRLS
Time to execute Levenshtein function - 0.339941 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.401035 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027233 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036995 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000799 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.