A Novel Approach: Tokenization Framework based on Sentence Structure in Indonesian Language

This study proposes a new approach in the sentence tokenization process. Sentence tokenization, which is known so far, is the process of breaking sentences based on spaces as separators. Space-based sentence tokenization only generates single word tokens. In sentences consisting of five words, token...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of advanced computer science & applications Vol. 14; no. 2
Main Authors Petrus, Johannes, -, Ermatita, -, Sukemi, -, Erwin
Format Journal Article
LanguageEnglish
Published West Yorkshire Science and Information (SAI) Organization Limited 2023
Subjects
Online AccessGet full text
ISSN2158-107X
2156-5570
DOI10.14569/IJACSA.2023.0140264

Cover

Loading…