A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to pillagin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
pillagin (0) - 3 freq
pillaging (1) - 1 freq
killagan (2) - 4 freq
pillowin (2) - 1 freq
pillage (2) - 4 freq
pillaged (2) - 2 freq
villain (2) - 3 freq
pingin (3) - 3 freq
pilla (3) - 11 freq
gillan (3) - 5 freq
villaij (3) - 1 freq
pollin (3) - 10 freq
pallin (3) - 2 freq
pleasin (3) - 11 freq
plaein (3) - 1 freq
pillion (3) - 1 freq
willin (3) - 59 freq
pilin (3) - 9 freq
plungin (3) - 3 freq
milligan (3) - 4 freq
pullin (3) - 107 freq
villainy (3) - 1 freq
follaein (3) - 171 freq
willan (3) - 5 freq
pilgim (3) - 1 freq
pillagin (0) - 3 freq
pillaging (2) - 1 freq
pillage (3) - 4 freq
pillaged (3) - 2 freq
pillowin (3) - 1 freq
killagan (3) - 4 freq
pillion (4) - 1 freq
plaguin (4) - 1 freq
milligan (4) - 4 freq
pallin (4) - 2 freq
pullin (4) - 107 freq
plungin (4) - 3 freq
pluggin (4) - 3 freq
pledgin (4) - 1 freq
pollutin (4) - 3 freq
pollin (4) - 10 freq
villain (4) - 3 freq
pullan (4) - 6 freq
plain (5) - 178 freq
village (5) - 163 freq
playin (5) - 347 freq
pillas (5) - 3 freq
packagin (5) - 6 freq
claagin (5) - 2 freq
bilangin (5) - 1 freq
SoundEx code - P425
plashin - 3 freq
pleasin - 11 freq
pleisance - 1 freq
placename - 1 freq
placin - 17 freq
pluggin - 3 freq
plasma - 5 freq
pleisantly - 1 freq
pluckin - 4 freq
pleisant - 2 freq
pleughman - 2 freq
pleesant - 2 freq
plaisant - 5 freq
polisman's - 1 freq
polisman - 22 freq
place-mats - 1 freq
pilgim - 1 freq
polishin' - 2 freq
pluckin' - 1 freq
polyshen - 1 freq
pleughing - 1 freq
pleughman's - 1 freq
pleasant - 19 freq
pulsin - 9 freq
polishin - 10 freq
ploughman - 9 freq
poliswummin - 1 freq
pleasanter - 1 freq
pollisman - 2 freq
pelicans - 2 freq
pulsing - 3 freq
plaisin - 1 freq
polkemmet - 2 freq
polisnut - 2 freq
place-names - 10 freq
placenames - 4 freq
place-name - 2 freq
ploughin - 2 freq
polismen - 2 freq
polygamists - 3 freq
polygamist - 1 freq
placement - 6 freq
policeman - 3 freq
pleasing - 6 freq
pluckan - 1 freq
placements - 5 freq
placenaimes - 2 freq
policyan' - 1 freq
placenaime - 1 freq
placin' - 1 freq
pillagin - 3 freq
pleasan - 1 freq
plaessment - 1 freq
'placement' - 1 freq
plaessments - 2 freq
plesaunce - 1 freq
pleasance - 3 freq
polishing - 3 freq
pleesantries - 1 freq
play-sangs - 1 freq
policy-makkars - 1 freq
pliss-nem - 1 freq
plaesant - 3 freq
polissman - 2 freq
pleisand - 1 freq
policymakkin - 1 freq
pleasantries - 1 freq
pilsner - 1 freq
placing - 2 freq
pillaging - 1 freq
plaisent - 1 freq
pleasantly - 1 freq
€œpolishin - 1 freq
plesant - 2 freq
place-nemmes - 2 freq
policeman's - 1 freq
plaguin - 1 freq
pljmxstz - 1 freq
plusnethelp - 1 freq
plusnet - 1 freq
pleasence - 1 freq
paulajmossie - 1 freq
polygonbooks - 1 freq
placenamesni - 4 freq
policing - 1 freq
MetaPhone code - PLJN
pledgin - 1 freq
pillagin - 3 freq
PILLAGIN
Time to execute Levenshtein function - 0.321686 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.539260 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.042885 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.050881 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001139 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.