A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to pakistan in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
pakistan (0) - 3 freq
pakistani (1) - 2 freq
paintan (2) - 2 freq
turkistan (3) - 2 freq
pairtin (3) - 19 freq
taisten (3) - 1 freq
castan (3) - 1 freq
christan (3) - 1 freq
prodistan (3) - 2 freq
insistan (3) - 1 freq
pastin (3) - 2 freq
pantan (3) - 3 freq
papists (3) - 2 freq
lampstan (3) - 1 freq
pastas (3) - 1 freq
waittan (3) - 1 freq
vanishan (3) - 1 freq
fake-tan (3) - 1 freq
partisan (3) - 3 freq
wastan (3) - 1 freq
paintin (3) - 45 freq
resistan (3) - 1 freq
pakis (3) - 1 freq
raisan (3) - 3 freq
kistin (3) - 3 freq
pakistan (0) - 3 freq
pakistani (1) - 2 freq
pastin (4) - 2 freq
kistin (4) - 3 freq
piston (4) - 2 freq
paintan (4) - 2 freq
passan (5) - 7 freq
puritan (5) - 2 freq
partan (5) - 17 freq
faisten (5) - 1 freq
gakstoun (5) - 1 freq
paisty (5) - 1 freq
caistin (5) - 1 freq
paist (5) - 1 freq
pointan (5) - 6 freq
inkstaun (5) - 2 freq
peston (5) - 2 freq
palestyne (5) - 1 freq
palestine (5) - 4 freq
postin (5) - 21 freq
kestin (5) - 2 freq
waistin (5) - 1 freq
pokiest (5) - 1 freq
papist (5) - 3 freq
lairistan (5) - 3 freq
SoundEx code - P223
pokiest - 1 freq
possessed - 16 freq
psychiatrist's - 1 freq
psychiatric - 6 freq
psychiatrically - 1 freq
pakkaged - 1 freq
'psychotic - 1 freq
psychotic - 3 freq
pakistan - 3 freq
pakistani - 2 freq
psychedelic - 2 freq
psychiatry - 1 freq
poashoxter - 1 freq
psychiatrists - 1 freq
poke-saiddil - 1 freq
poshest - 2 freq
psychiatrist - 1 freq
packaged - 2 freq
pussycat - 3 freq
psychotherapy - 1 freq
possesed - 1 freq
possesst - 1 freq
pish-stained - 2 freq
pakaged - 1 freq
psychodrama - 1 freq
posessed - 1 freq
MetaPhone code - PKSTN
pakistan - 3 freq
pakistani - 2 freq
paxton - 2 freq
PAKISTAN
Time to execute Levenshtein function - 0.308682 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.509118 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029461 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039880 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001075 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.