A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to orcas in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
orcas (0) - 5 freq
morcas (1) - 1 freq
orras (1) - 1 freq
orca (1) - 7 freq
sorras (2) - 6 freq
orra's (2) - 1 freq
eras (2) - 2 freq
orams (2) - 1 freq
oriam (2) - 1 freq
orrae (2) - 1 freq
ora (2) - 1 freq
areas (2) - 94 freq
croas (2) - 1 freq
arcs (2) - 1 freq
lucas (2) - 2 freq
cas (2) - 4 freq
orra (2) - 172 freq
orts (2) - 3 freq
orrals (2) - 13 freq
oreat (2) - 1 freq
craas (2) - 6 freq
bras (2) - 2 freq
marcas (2) - 3 freq
orchis (2) - 1 freq
or's (2) - 1 freq
orcas (0) - 5 freq
arcs (2) - 1 freq
rcs (2) - 2 freq
orca (2) - 7 freq
morcas (2) - 1 freq
orras (2) - 1 freq
or's (3) - 1 freq
raas (3) - 10 freq
marcas (3) - 3 freq
fracas (3) - 2 freq
orchis (3) - 1 freq
oercam (3) - 1 freq
arras (3) - 24 freq
races (3) - 17 freq
aircs (3) - 1 freq
becas (3) - 5 freq
oreos (3) - 1 freq
forces (3) - 42 freq
incas (3) - 1 freq
cas (3) - 4 freq
lucas (3) - 2 freq
areas (3) - 94 freq
orts (3) - 3 freq
orams (3) - 1 freq
eras (3) - 2 freq
SoundEx code - O622
ower-sized - 1 freq
oersicht - 1 freq
orchestrator - 1 freq
orcas - 5 freq
o'ercast - 1 freq
owercaist - 2 freq
orchestrae's - 1 freq
ower-cosy - 1 freq
orgasm - 1 freq
owersicht - 7 freq
orgies - 2 freq
owerseas - 5 freq
orchestra' - 1 freq
owerseys - 8 freq
owersexed - 1 freq
owresicht - 6 freq
orchestra - 4 freq
owreseas - 4 freq
orchis - 1 freq
owerkeikin - 1 freq
'owersicht' - 1 freq
orchestration - 1 freq
owersichts - 1 freq
owercast - 2 freq
orchestral - 1 freq
orchestrated - 1 freq
MetaPhone code - ORKS
orcas - 5 freq
ooihrox - 3 freq
ORCAS
Time to execute Levenshtein function - 0.315151 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.633294 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.081937 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.100102 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001201 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.