A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ane-an-twuntie in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ane-an-twuntie (0) - 1 freq
echt-an-twuntie (4) - 1 freq
threi-an-twuntiet (5) - 1 freq
five-an-echtie (7) - 1 freq
near-white (7) - 1 freq
aye-an-oan (7) - 2 freq
ane-nite (7) - 1 freq
ane-tae-ane (7) - 4 freq
twuntie (7) - 8 freq
janeannie (7) - 1 freq
fower-an-twenty (7) - 1 freq
twintie (8) - 7 freq
inanimitie (8) - 1 freq
kinnen-huntin (8) - 1 freq
eiss-shintie (8) - 1 freq
a-wantin (8) - 1 freq
aidanctweets (8) - 1 freq
inchantment (8) - 1 freq
announcin (8) - 4 freq
reconstructit (8) - 1 freq
anesthetic (8) - 1 freq
annuncit (8) - 1 freq
agentbailie (8) - 1 freq
wanchauncie (8) - 2 freq
lang-wundit (8) - 1 freq
ane-an-twuntie (0) - 1 freq
echt-an-twuntie (7) - 1 freq
threi-an-twuntiet (9) - 1 freq
fower-an-twenty (10) - 1 freq
on-an-on (11) - 1 freq
twuntie (11) - 8 freq
anointment (11) - 1 freq
ane-nite (11) - 1 freq
near-white (11) - 1 freq
ane-tae-ane (11) - 4 freq
aye-an-oan (11) - 2 freq
twentie (12) - 1 freq
open-windae (12) - 1 freq
undauntit (12) - 2 freq
in-betweenies (12) - 2 freq
meattwenty (12) - 1 freq
up-an-rinnin (12) - 1 freq
inconstant (12) - 1 freq
annotatit (12) - 1 freq
annette (12) - 1 freq
eontentit (12) - 1 freq
untentie (12) - 2 freq
near-front (12) - 1 freq
ae-ane-yin-wan (12) - 1 freq
no-taen-wi-it (12) - 1 freq
SoundEx code - A553
amoont - 44 freq
anyhin'd - 1 freq
amount - 50 freq
anent - 529 freq
amanda - 14 freq
ammunition - 8 freq
annandale - 2 freq
animated - 6 freq
amounts - 6 freq
'amanda - 1 freq
amanda's - 1 freq
'amanda's - 1 freq
'amanda'll - 1 freq
anoint - 1 freq
amendit - 2 freq
amoonts - 10 freq
ane-nite - 1 freq
amenities - 4 freq
ammonite - 1 freq
amminadab - 7 freq
anointit - 2 freq
anointing - 1 freq
amends - 3 freq
anninted - 2 freq
annint - 1 freq
anunder - 44 freq
annand - 10 freq
amounted - 1 freq
anundir - 1 freq
anointment - 1 freq
an'intherdependent - 1 freq
animatin' - 1 freq
anynted - 1 freq
an'under - 3 freq
animate - 1 freq
animation - 6 freq
amendment - 3 freq
anyont - 1 freq
anointin - 1 freq
ainimatit - 1 freq
amounting - 1 freq
amountin - 1 freq
€œanent - 1 freq
anyntit - 1 freq
amend - 3 freq
ane-an-twuntie - 1 freq
amoontae - 1 freq
amandafbelfast - 1 freq
amandamacaula - 2 freq
annanwaterlamb - 1 freq
animatit - 3 freq
ammonites - 1 freq
amandamcdonn - 2 freq
MetaPhone code - ANNTWNT
ane-an-twuntie - 1 freq
ANE-AN-TWUNTIE
Time to execute Levenshtein function - 0.426397 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.786692 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027917 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.085961 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000931 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.