A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to guillemots in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
guillemots (0) - 11 freq
guilletmot (2) - 1 freq
gimlets (4) - 1 freq
gullert (4) - 18 freq
guidless (4) - 3 freq
gigglymots (4) - 1 freq
billets (4) - 3 freq
fillets (4) - 3 freq
guiltless (4) - 1 freq
guller't (4) - 2 freq
guillys (4) - 3 freq
guiless (4) - 1 freq
giglets (4) - 1 freq
giblets (4) - 1 freq
gullet (4) - 5 freq
bullets (4) - 29 freq
williemoir (5) - 2 freq
shallots (5) - 1 freq
bulletins (5) - 2 freq
gillon (5) - 1 freq
trillions (5) - 1 freq
keillers (5) - 1 freq
gisless (5) - 1 freq
guzzlers (5) - 1 freq
gollops (5) - 1 freq
guillemots (0) - 11 freq
guilletmot (4) - 1 freq
giblets (6) - 1 freq
guillys (6) - 3 freq
bullets (6) - 29 freq
gallants (6) - 1 freq
gullet (6) - 5 freq
giglets (6) - 1 freq
gullert (6) - 18 freq
fillets (6) - 3 freq
gimlets (6) - 1 freq
gigglymots (6) - 1 freq
billets (6) - 3 freq
galleys (7) - 3 freq
gillypants (7) - 149 freq
gollywogs (7) - 1 freq
diplomats (7) - 3 freq
galleries (7) - 8 freq
gallers (7) - 25 freq
illums (7) - 1 freq
mallets (7) - 5 freq
gullies (7) - 3 freq
gillhaus (7) - 1 freq
collects (7) - 2 freq
galileo's (7) - 1 freq
SoundEx code - G453
glint - 29 freq
glent - 30 freq
glentin - 34 freq
glents - 6 freq
glints - 6 freq
glintin - 15 freq
glentit - 8 freq
glo-in-the-daurk - 1 freq
glinted - 2 freq
glented - 3 freq
gleamed - 4 freq
glands - 2 freq
glunterpuddin - 4 freq
gallantly - 2 freq
guillemots - 11 freq
gleened - 1 freq
glinderin - 4 freq
glunta-flesh - 1 freq
glentan - 1 freq
glaumed - 2 freq
glinderan - 1 freq
glammed - 1 freq
glentsome - 1 freq
glendale - 3 freq
glendinning's - 1 freq
glinder - 4 freq
gluntin - 1 freq
glintit - 1 freq
glentin- - 1 freq
gallant - 3 freq
glendonnen's - 1 freq
glendonnen - 1 freq
glenty - 1 freq
glintie - 1 freq
glintan - 1 freq
€˜gallantry - 1 freq
gallantry - 1 freq
gleaned - 1 freq
glinda - 7 freq
glendinning - 4 freq
gillanders - 1 freq
glentoran - 1 freq
glnutxxiri - 1 freq
glendarroch - 1 freq
glenda's - 1 freq
gallants - 1 freq
glowmeet - 1 freq
gillen'oot - 1 freq
gaelamadan - 1 freq
MetaPhone code - KLMTS
climates - 6 freq
guillemots - 11 freq
climate's - 1 freq
calamitous - 1 freq
GUILLEMOTS
Time to execute Levenshtein function - 0.189328 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.365327 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029205 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.044348 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001361 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.