Kurdish standard EMNIST-like character dataset

A dataset was created by collecting handwritten samples of distinct Kurdish characters. The dataset consists primarily of 58 characters, and approximately 3800 adult volunteers who are native Kurdish speakers participated in the collection process. Each participant was requested to fill two rows in...

Full description

Saved in:

Bibliographic Details
Published in	Data in brief Vol. 52; p. 110038
Main Authors	Majeed, Hamsa D., Nariman, Goran Saman, Azeez, Renas Sardar, Abdulqadir, Bawar Bilal
Format	Journal Article
Language	English
Published	Netherlands Elsevier Inc 01.02.2024 Elsevier
Subjects	Central Kurdish Data Handwritten character recognition Kurdish characters Kurdish handwritten Optical character recognition Kurdish handwritten Kurdish characters Handwritten character recognition Optical character recognition Central Kurdish
Online Access	Get full text

Cover

Loading…

More Information
Summary:	A dataset was created by collecting handwritten samples of distinct Kurdish characters. The dataset consists primarily of 58 characters, and approximately 3800 adult volunteers who are native Kurdish speakers participated in the collection process. Each participant was requested to fill two rows in a character form printed on A4 landscape papers. These papers were divided into sets of four pages, with 18 columns and 10 rows of characters on each page, except for the fourth page in each set, which had 40 cells. To ensure a comprehensive dataset, over 760 sets were prepared and distributed across various universities and institutions. The collected samples underwent scanning, cropping, and preprocessing procedures following the characteristics established by the EMNIST project. The purpose of these procedures was to standardize the dataset and ensure uniformity in the representation of all characters.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	2352-3409 2352-3409
DOI:	10.1016/j.dib.2024.110038