SIMD 명령어를 활용한 정렬된 데이터에 대한 효율적인 그룹 연산 처리

With advances in computing technology, most computers now support vector processing that was previously only supported by certain supercomputers which were made up of vector processors. In data analytics applications, there is a high demand for the ability to group data by specific criteria, which r...

Full description

Saved in:

Bibliographic Details
Published in	정보과학회 컴퓨팅의 실제 논문지 Vol. 24; no. 7; pp. 375 - 380
Main Authors	이훈순(Hun Soon Lee), 김창수(Chang Su Kim)
Format	Journal Article
Language	Korean
Published	Korean Institute of Information Scientists and Engineers 2018 한국정보과학회
Subjects	컴퓨터학 vectorization 벡터화 SIMD group operation vector processing 그룹 연산. 벡터처리
Online Access	Get full text

Cover

Loading…

More Information
Summary:	With advances in computing technology, most computers now support vector processing that was previously only supported by certain supercomputers which were made up of vector processors. In data analytics applications, there is a high demand for the ability to group data by specific criteria, which requires a lot of time. In this paper, we propose a method for the efficient processing of group operations on sorted data through vector processing using SIMD instructions. We applied the proposed method to the group operation of MonetDB which is a column-oriented DBMS and confirmed that the performance of the group operation in MonetDB was improved through the experiment using the TPC-H query. 근래의 대부분의 컴퓨터에서는 컴퓨팅 기술의 발달로 벡터 처리 유닛이 포함됨으로 인해 과거에 벡터 프로세서로 구성된 일부 슈퍼 컴퓨터들만이 지원했던 벡터 처리 기능을 지원하고 있다. 데이터 분석 응용에서는 데이터를 특정 기준으로 그룹핑하는 요구가 많으며 이는 많은 시간을 필요로 한다. 본 논문에서는 SIMD 명령어를 활용하여 정렬된 데이터에 대한 그룹 연산을 벡터 처리 기반으로 효율적으로 처리하는 방법을 제안한다. 제안하는 방법을 열기반 DBMS인 MonetDB의 그룹 연산에 적용한 후 실험을 통해 그룹 연산의 성능이 스칼라 기반 방법에 비해 향상되었음을 확인하였다.
ISSN:	2383-6318 2383-6326
DOI:	10.5626/KTCP.2018.24.7.375