A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gorillas in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gorillas (0) - 2 freq
gorilla (1) - 7 freq
gorilla's (1) - 1 freq
tortillas (2) - 2 freq
arillas (2) - 1 freq
gillis (3) - 1 freq
thrills (3) - 6 freq
movilla's (3) - 1 freq
grillin (3) - 2 freq
gorbel's (3) - 1 freq
godzilla (3) - 1 freq
pillas (3) - 3 freq
gorie's (3) - 1 freq
grill (3) - 5 freq
drills (3) - 14 freq
gill's (3) - 2 freq
rills (3) - 1 freq
trills (3) - 2 freq
grilled (3) - 2 freq
gills (3) - 8 freq
gorblins (3) - 1 freq
gallas (3) - 1 freq
follas (3) - 1 freq
gormless (3) - 9 freq
formulas (3) - 2 freq
gorillas (0) - 2 freq
gorilla's (2) - 1 freq
gorilla (2) - 7 freq
arillas (3) - 1 freq
grilled (4) - 2 freq
trills (4) - 2 freq
rills (4) - 1 freq
gallas (4) - 1 freq
guillys (4) - 3 freq
grille (4) - 1 freq
drills (4) - 14 freq
gorbals (4) - 15 freq
frills (4) - 1 freq
gills (4) - 8 freq
gillis (4) - 1 freq
grillin (4) - 2 freq
tortillas (4) - 2 freq
grill (4) - 5 freq
trolls (5) - 6 freq
frillies (5) - 1 freq
girls (5) - 59 freq
grals (5) - 1 freq
girdles (5) - 1 freq
rolls (5) - 96 freq
grullyan (5) - 1 freq
SoundEx code - G642
growls - 14 freq
girls - 59 freq
gurls - 1 freq
garlic - 13 freq
grolsch - 1 freq
gralloch - 2 freq
grilsie - 1 freq
gralloched - 3 freq
gorilla's - 1 freq
girl's - 1 freq
grulshes - 1 freq
gorillas - 2 freq
girls' - 2 freq
garlogie - 1 freq
graylicht - 1 freq
gairleke - 1 freq
grallochin - 1 freq
garrulous - 1 freq
grals - 1 freq
girlchampjinty - 2 freq
grulsh - 1 freq
grølek - 1 freq
grealish - 1 freq
gerrywilson - 1 freq
girlssexygirls - 3 freq
MetaPhone code - KRLS
careless - 10 freq
curls - 37 freq
carroll's - 1 freq
growls - 14 freq
gurls - 1 freq
creels - 24 freq
carlie's - 1 freq
carlo's - 2 freq
carles - 6 freq
creel's - 3 freq
grilsie - 1 freq
carlos - 3 freq
gorilla's - 1 freq
gorillas - 2 freq
curlew's - 1 freq
crowls - 1 freq
carl's - 1 freq
kerless - 1 freq
crewless - 1 freq
crawls - 1 freq
craals - 1 freq
carols - 3 freq
quarrels - 1 freq
creils - 1 freq
crawlies - 1 freq
garrulous - 1 freq
grals - 1 freq
craalies - 3 freq
karla’s - 1 freq
carrolls - 1 freq
curlysoo - 1 freq
carl’s - 1 freq
curlews - 1 freq
GORILLAS
Time to execute Levenshtein function - 0.334294 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.613375 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030888 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040734 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000962 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.