Skip to main content

đŸ“Ļ @bntk/pos

UniversalPOSTag​

Defined in: index.ts:11

Universal POS tags based on Universal Dependencies (UD) v2 specification Source: https://universaldependencies.org/u/pos/

Enumeration Members​

Enumeration MemberValueDescriptionDefined in
ADJ"ADJ"Adjectives are words that typically modify nouns Example āĻŦāĻĄāĻŧ/boro, āĻ¸ā§āĻ¨ā§āĻĻāĻ°/sundor, āĻ¨āĻ¤ā§āĻ¨/notunindex.ts:14
ADP"ADP"Adpositions are prepositions and postpositions Example āĻŽāĻ§ā§āĻ¯ā§‡/moddhe, āĻĨā§‡āĻ•ā§‡/theke, āĻ‰āĻĒāĻ°ā§‡/uporeindex.ts:17
ADV"ADV"Adverbs are words that typically modify verbs, adjectives or other adverbs Example āĻ–ā§āĻŦ/khub, āĻ§ā§€āĻ°ā§‡/dhire, āĻ­āĻžāĻ˛ā§‹āĻ­āĻžāĻŦā§‡/bhalobhabeindex.ts:20
AUX"AUX"Auxiliary verbs are used to form tenses, moods, etc. Example āĻ†āĻ›ā§‡/ache, āĻšāĻ¯āĻŧ/hoi, āĻšāĻŦā§‡/hobeindex.ts:23
CCONJ"CCONJ"Coordinating conjunctions connect words, phrases, clauses of equal status Example āĻāĻŦāĻ‚/ebong, āĻ•āĻŋāĻ¨ā§āĻ¤ā§/kintu, āĻ…āĻĨāĻŦāĻž/othobaindex.ts:26
DET"DET"Determiners are words that modify nouns or noun phrases Example āĻāĻ‡/ei, āĻ¸ā§‡āĻ‡/sei, āĻ•ā§‹āĻ¨/konindex.ts:29
INTJ"INTJ"Interjections are exclamatory words Example āĻ“āĻš/oh, āĻŦāĻžāĻš/bah, āĻšāĻžāĻ¯āĻŧ/hayindex.ts:32
NOUN"NOUN"Nouns are words denoting all physical objects and materials Example āĻŦāĻ‡/boi, āĻŽāĻžāĻ¨ā§āĻˇ/manush, āĻŦāĻžāĻĄāĻŧāĻŋ/bariindex.ts:35
NUM"NUM"Numerals represent numbers, quantities, etc. Example āĻāĻ•/ek, āĻĻā§āĻ‡/dui, āĻĒā§āĻ°āĻĨāĻŽ/prothomindex.ts:38
PART"PART"Particles are function words that must be associated with another word Example āĻ¨āĻž/na, āĻ¤ā§‹/to, āĻ•āĻŋ/kiindex.ts:41
PRON"PRON"Pronouns substitute for nouns or noun phrases Example āĻ†āĻŽāĻŋ/ami, āĻ¤ā§āĻŽāĻŋ/tumi, āĻ¸ā§‡/seindex.ts:44
PROPN"PROPN"Proper nouns are names of specific persons, places, organizations Example āĻĸāĻžāĻ•āĻž/dhaka, āĻ°āĻŦā§€āĻ¨ā§āĻĻā§āĻ°āĻ¨āĻžāĻĨ/robindronath, āĻŦāĻžāĻ‚āĻ˛āĻžāĻĻā§‡āĻļ/bangladeshindex.ts:47
PUNCT"PUNCT"Punctuation marks Example āĨ¤, ?, !, ,index.ts:50
SCONJ"SCONJ"Subordinating conjunctions link dependent clauses to independent ones Example āĻ¯āĻĻāĻŋ/jodi, āĻ•āĻžāĻ°āĻŖ/karon, āĻ¯āĻ–āĻ¨/jokhonindex.ts:53
SYM"SYM"Symbols represent currency, math operators, etc. Example ā§ŗ, +, =index.ts:56
VERB"VERB"Verbs denote actions and processes Example āĻ¯āĻžāĻ‡/jai, āĻ–āĻžāĻ‡/khai, āĻĒāĻĄāĻŧāĻŋ/poriindex.ts:59
X"X"Other words that don't fit into above categories Example āĻ‡āĻ¤ā§āĻ¯āĻžāĻĻāĻŋ/ittyadi, āĻĒā§āĻ°āĻ­ā§ƒāĻ¤āĻŋ/probhritiindex.ts:62

TaggedWord​

Defined in: index.ts:68

Represents a word with its POS tag and additional linguistic features

Properties​

PropertyTypeDescriptionDefined in
features?Record<string, string>Additional linguistic features (e.g., gender, number, case)index.ts:74
tagUniversalPOSTagThe POS tag for the wordindex.ts:72
wordstringThe original wordindex.ts:70

getPOSTagFrequencies()​

function getPOSTagFrequencies(taggedWords): Map<UniversalPOSTag, number>;

Defined in: index.ts:196

Gets the most common POS tags in a sequence of tagged words

Parameters​

ParameterTypeDescription
taggedWordsTaggedWord[]Array of tagged words

Returns​

Map<UniversalPOSTag, number>

Map of POS tags to their frequencies

Description​

This function analyzes a sequence of tagged words and returns a map showing how many times each POS tag appears.

Example​

const tagged = [
{ word: "āĻ†āĻŽāĻŋ", tag: UniversalPOSTag.PRON },
{ word: "āĻŦāĻžāĻ‚āĻ˛āĻžāĻ¯āĻŧ", tag: UniversalPOSTag.ADP },
{ word: "āĻ—āĻžāĻ¨", tag: UniversalPOSTag.NOUN },
{ word: "āĻ—āĻžāĻ‡", tag: UniversalPOSTag.VERB },
];
const frequencies = getPOSTagFrequencies(tagged);
console.log(frequencies);
// Output: Map(4) {
// UniversalPOSTag.PRON => 1,
// UniversalPOSTag.ADP => 1,
// UniversalPOSTag.NOUN => 1,
// UniversalPOSTag.VERB => 1
// }

tagText()​

function tagText(text): TaggedWord[];

Defined in: index.ts:162

Tags a Bangla text string with parts of speech

Parameters​

ParameterTypeDescription
textstringThe Bangla text to tag

Returns​

TaggedWord[]

Array of words with their POS tags and features

Description​

This function first tokenizes the input text into words and then performs POS tagging on the resulting word sequence.

Example​

const text = "āĻ†āĻŽāĻŋ āĻŦāĻžāĻ‚āĻ˛āĻžāĻ¯āĻŧ āĻ—āĻžāĻ¨ āĻ—āĻžāĻ‡";
const tagged = tagText(text);
console.log(tagged);
// Output: [
// { word: "āĻ†āĻŽāĻŋ", tag: UniversalPOSTag.PRON },
// { word: "āĻŦāĻžāĻ‚āĻ˛āĻžāĻ¯āĻŧ", tag: UniversalPOSTag.ADP },
// { word: "āĻ—āĻžāĻ¨", tag: UniversalPOSTag.NOUN },
// { word: "āĻ—āĻžāĻ‡", tag: UniversalPOSTag.VERB }
// ]

tagWord()​

function tagWord(word): TaggedWord;

Defined in: index.ts:102

Tags a single Bangla word with its part of speech

Parameters​

ParameterTypeDescription
wordstringThe Bangla word to tag

Returns​

TaggedWord

The word with its POS tag and features

Description​

This function performs POS tagging for a single Bangla word. It uses a combination of rules and dictionary lookup to determine the most likely POS tag for the given word.

Examples​

const result = tagWord("āĻŦāĻžāĻ‚āĻ˛āĻž");
console.log(result);
// Output: { word: "āĻŦāĻžāĻ‚āĻ˛āĻž", tag: UniversalPOSTag.NOUN }
const result = tagWord("āĻ¸ā§āĻ¨ā§āĻĻāĻ°");
console.log(result);
// Output: { word: "āĻ¸ā§āĻ¨ā§āĻĻāĻ°", tag: UniversalPOSTag.ADJ }

tagWords()​

function tagWords(words): TaggedWord[];

Defined in: index.ts:134

Tags a sequence of Bangla words with their parts of speech

Parameters​

ParameterTypeDescription
wordsstring[]Array of Bangla words to tag

Returns​

TaggedWord[]

Array of words with their POS tags and features

Description​

This function performs POS tagging for a sequence of Bangla words. It takes into account the context of surrounding words to improve tagging accuracy.

Example​

const words = ["āĻ†āĻŽāĻŋ", "āĻŦāĻžāĻ‚āĻ˛āĻžāĻ¯āĻŧ", "āĻ—āĻžāĻ¨", "āĻ—āĻžāĻ‡"];
const tagged = tagWords(words);
console.log(tagged);
// Output: [
// { word: "āĻ†āĻŽāĻŋ", tag: UniversalPOSTag.PRON },
// { word: "āĻŦāĻžāĻ‚āĻ˛āĻžāĻ¯āĻŧ", tag: UniversalPOSTag.ADP },
// { word: "āĻ—āĻžāĻ¨", tag: UniversalPOSTag.NOUN },
// { word: "āĻ—āĻžāĻ‡", tag: UniversalPOSTag.VERB }
// ]