Abstract
This document presents the BALAGHAScore.com Arabic Word Tokenisation Scheme, a customised set of rules for segmenting Arabic text into word units for rhetorical density calculations such as in the BALAGHA Score.
Affiliated Institutions
Related Publications
Graph Convolutional Networks for Text Classification
Text classification is an important and classical problem in natural language processing. There have been a number of studies that applied convolutional neural networks (convolu...
A learner-independent evaluation of the usefulness of statistical phrases for automated text categorization
In this work we investigate the usefulness of n-grams for document indexing in text categorization (TCi We call-gram a set g k of n word stems, and we say that g k occurs in a d...
ROUGE: A Package for Automatic Evaluation of Summaries
ROUGE stands for Recall-Oriented Understudy for Gisting Evaluation. It includes measures to automatically determine the quality of a summary by comparing it to other (ideal) sum...
A theory of reading: From eye fixations to comprehension.
This article presents a model of reading comprehension that accounts for the allocation of eye fixations of college students reading scientific passages. The model deals with pr...
<i>Ab initio</i>up to the melting point: Anharmonicity and vacancies in aluminum
We propose a fully ab initio based integrated approach to determine the volume and temperature dependent free-energy surface of nonmagnetic crystalline solids up to the melting ...
Publication Info
- Year
- 2025
- Type
- article
- Citations
- 0
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.64393/balagha-score.tokenisation-v0.1.0