A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to zodiac in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
zodiac (0) - 1 freq
dic (3) - 1 freq
'odin (3) - 1 freq
soda (3) - 20 freq
modils (3) - 1 freq
odige (3) - 1 freq
lndia (3) - 1 freq
poliacs (3) - 1 freq
oodie (3) - 1 freq
topic (3) - 25 freq
disc (3) - 5 freq
pontiac (3) - 1 freq
sonia (3) - 2 freq
onic (3) - 1 freq
logic (3) - 22 freq
india' (3) - 1 freq
dia (3) - 2 freq
modal (3) - 7 freq
bondic (3) - 1 freq
jovial (3) - 1 freq
doric (3) - 480 freq
ionic (3) - 2 freq
zinc (3) - 5 freq
colic (3) - 4 freq
ioniae (3) - 1 freq
zodiac (0) - 1 freq
zadar (4) - 4 freq
medic (4) - 3 freq
zac (4) - 4 freq
zijc (4) - 1 freq
dac (4) - 1 freq
doac (4) - 19 freq
zodr (4) - 1 freq
zinc (4) - 5 freq
zdxc (4) - 1 freq
bodice (4) - 1 freq
dic (4) - 1 freq
bodies (5) - 184 freq
dial (5) - 36 freq
today (5) - 161 freq
sonic (5) - 6 freq
xdc (5) - 1 freq
joda (5) - 3 freq
sodas (5) - 1 freq
oda (5) - 5 freq
bodine (5) - 1 freq
dia- (5) - 1 freq
india (5) - 21 freq
zoink (5) - 16 freq
soda (5) - 20 freq
SoundEx code - Z320
zits - 1 freq
zadok - 4 freq
zkots - 4 freq
zxqtkeai - 1 freq
zodiac - 1 freq
zdxc - 1 freq
zwtc - 1 freq
zqtj - 2 freq
zsyitc - 1 freq
MetaPhone code - STK
stookie - 25 freq
stuck - 342 freq
steek - 20 freq
stick - 377 freq
sticky - 27 freq
steak - 46 freq
stock - 102 freq
stag - 50 freq
'stag - 1 freq
stake - 31 freq
steck - 1 freq
staggie - 1 freq
stack - 50 freq
'steek - 1 freq
stak - 9 freq
stakk - 5 freq
s-t-a-k-e - 1 freq
stooky - 2 freq
stocky - 7 freq
sadko - 1 freq
stoic - 8 freq
stuckie - 3 freq
steik - 10 freq
stuik - 1 freq
stoke - 6 freq
stik - 5 freq
stuk - 1 freq
stook - 9 freq
sutck - 1 freq
'stick - 4 freq
steg - 1 freq
staak - 1 freq
zadok - 4 freq
stïck - 4 freq
st'ak - 1 freq
stuggy - 1 freq
sie-dug' - 1 freq
stuggie - 1 freq
staek - 3 freq
stikk - 1 freq
staig - 3 freq
stickie - 2 freq
stic - 1 freq
stuc - 2 freq
stug - 1 freq
€˜stick - 1 freq
stukk - 2 freq
stok - 2 freq
stoukie - 1 freq
steekie - 5 freq
€œstick - 2 freq
sudoku - 1 freq
stag- - 1 freq
stike - 1 freq
xxtk - 1 freq
xdc - 1 freq
zodiac - 1 freq
stick' - 1 freq
zwtc - 1 freq
soteag - 2 freq
stc - 1 freq
cedk - 1 freq
ZODIAC
Time to execute Levenshtein function - 0.197435 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.375258 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027860 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037919 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001330 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.