STEM Approach to Enhance Robot-Human Interaction Through AI Large Language Models and Reinforcement Learning

Humanoid Robots with their limitless capabilities have revolutionized the world. Their applications range from household assistance to advertising. As these technologies age however, the use of their sensors, motors, cameras, all become outdated; making previous humanoids a thing of the past. This p...

Full description

Saved in:
Bibliographic Details
Published inIntegrated STEM Education Conference (Online) pp. 1 - 2
Main Authors Shibi, Siddhartha, Zaidi, Sohail
Format Conference Proceeding
LanguageEnglish
Published IEEE 09.03.2024
Subjects
Online AccessGet full text
ISSN2473-7623
DOI10.1109/ISEC61299.2024.10665163

Cover

Abstract Humanoid Robots with their limitless capabilities have revolutionized the world. Their applications range from household assistance to advertising. As these technologies age however, the use of their sensors, motors, cameras, all become outdated; making previous humanoids a thing of the past. This project takes a STEM approach towards enhancing these robots by tackling the most crucial issue that such humanoids face; their adequacy in human-robot interactions. This study explores the promise of integrating LLMs (Large Language Models)-such as Google PaLM2 and ChatGPT-to supplement the capabilities of such robots; as well as bringing CoT (Chain of Thought). The subject of this project is the humanoid robot Pepper, by SoftBank Robotics, a popular robot designed to interact with humans; however, due to its weak natural language processing (NLP) capabilities, it struggles to adequately articulate responses in human-robot conversation. For instance, the robot was capable of easily listing responses to simple questions such as, "What is your name?", or "What are you", yet struggled with providing adequate responses to queries such as, "Who is the president of the United States", or "When is the next World Cup?". By AI/ML LLM integration, such questions were handled by using much improved LLMs in place of the previous built-in responses the robot had. This demonstration has been shown in the video that is uploaded at: https://youtu.be/hF7aRlQmnqs. Our approach targeted the main weak points of the robot; it's ability to provide responses to asked questions, and remembering prior questions/conversation. By intercepting the robot's own NLP Dialog Module, the asked prompt can be connected through a chatAdapter, bringing conversations to a chat database for context as well as LLM of choice. This approach, implemented through use of Android Studio to create an appropriate application for their procedure, addresses the contextual-based reasoning by pulling from the chat database as well as provides adequate responses limited only by the AI/ML model of choice. This project involved integrating ChatGPT /PaLM2 into Pepper's existing system to enable generation of more natural and engaging responses. In addition to the pre-existing development in bringing artificial intelligence into these humanoid robots, further work has been in the process; the aim being to develop a way for the robot to simultaneously extract other situational data from conversation such as facial and tonal expressions, bringing human feedback in order for responses to be further fine-tuned. Aside from the work-in-progress development of integrating RLHF (Reinforcement Learning with Human Feedback), the effectiveness of the aforementioned approach was further evaluated through a user study, comparing it with and without integration. The results indicated that integrating LLM/s into the robot's NLP system significantly improved its ability to generate more coherent responses, leading to more natural human-robot interactions. Overall, this presentation will demonstrate the potential of using LLMs to enhance the NLP capability of human robots like Pepper. It's believed that the proposed approach can pave the way for developing more intelligent human-robot interactions in the future.
AbstractList Humanoid Robots with their limitless capabilities have revolutionized the world. Their applications range from household assistance to advertising. As these technologies age however, the use of their sensors, motors, cameras, all become outdated; making previous humanoids a thing of the past. This project takes a STEM approach towards enhancing these robots by tackling the most crucial issue that such humanoids face; their adequacy in human-robot interactions. This study explores the promise of integrating LLMs (Large Language Models)-such as Google PaLM2 and ChatGPT-to supplement the capabilities of such robots; as well as bringing CoT (Chain of Thought). The subject of this project is the humanoid robot Pepper, by SoftBank Robotics, a popular robot designed to interact with humans; however, due to its weak natural language processing (NLP) capabilities, it struggles to adequately articulate responses in human-robot conversation. For instance, the robot was capable of easily listing responses to simple questions such as, "What is your name?", or "What are you", yet struggled with providing adequate responses to queries such as, "Who is the president of the United States", or "When is the next World Cup?". By AI/ML LLM integration, such questions were handled by using much improved LLMs in place of the previous built-in responses the robot had. This demonstration has been shown in the video that is uploaded at: https://youtu.be/hF7aRlQmnqs. Our approach targeted the main weak points of the robot; it's ability to provide responses to asked questions, and remembering prior questions/conversation. By intercepting the robot's own NLP Dialog Module, the asked prompt can be connected through a chatAdapter, bringing conversations to a chat database for context as well as LLM of choice. This approach, implemented through use of Android Studio to create an appropriate application for their procedure, addresses the contextual-based reasoning by pulling from the chat database as well as provides adequate responses limited only by the AI/ML model of choice. This project involved integrating ChatGPT /PaLM2 into Pepper's existing system to enable generation of more natural and engaging responses. In addition to the pre-existing development in bringing artificial intelligence into these humanoid robots, further work has been in the process; the aim being to develop a way for the robot to simultaneously extract other situational data from conversation such as facial and tonal expressions, bringing human feedback in order for responses to be further fine-tuned. Aside from the work-in-progress development of integrating RLHF (Reinforcement Learning with Human Feedback), the effectiveness of the aforementioned approach was further evaluated through a user study, comparing it with and without integration. The results indicated that integrating LLM/s into the robot's NLP system significantly improved its ability to generate more coherent responses, leading to more natural human-robot interactions. Overall, this presentation will demonstrate the potential of using LLMs to enhance the NLP capability of human robots like Pepper. It's believed that the proposed approach can pave the way for developing more intelligent human-robot interactions in the future.
Author Shibi, Siddhartha
Zaidi, Sohail
Author_xml – sequence: 1
  givenname: Siddhartha
  surname: Shibi
  fullname: Shibi, Siddhartha
  email: in.siddhartha@gmail.com
  organization: Washington High School,Fremont,USA
– sequence: 2
  givenname: Sohail
  surname: Zaidi
  fullname: Zaidi, Sohail
  email: syed.zaidi@sjsu.edu
  organization: Mechanical Engineering San Jose State University,San Jose,USA
BookMark eNo1kMtKAzEYhaMoqLVvIJgXmJrkn9yWpYx2YIrQzr6k6T8XaZOSmS58ewfUzTln9cH5nshdiAEJeeVswTmzb-WuWCkurF0IJvIFZ0pJruCGzK22BiQDKQzjt-RR5BoyrQQ8kPkwfDHGgGutDDyS064uNnR5uaTofEfHSIvQueCRbuMhjtn6enaBlmHE5PzYx0DrLsVr29FlSSuXWpwytFc3jU084mmgLhzpFvvQxOTxjGGkFboU-tA-k_vGnQac__WM1O9FvVpn1edHuVpWWW_5mBkmuASFMm9QGX84GNQAHgU01oGVzEmm3XTZ-ibPpdTW5swJI7XBZhIDM_Lyi-0RcX9J_dml7_2_H_gBZpFa_g
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ISEC61299.2024.10665163
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Education
EISBN 9798350352801
EISSN 2473-7623
EndPage 2
ExternalDocumentID 10665163
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
OCL
RIE
RIL
ID FETCH-LOGICAL-i91t-8021536e54fe68cbb8e733ce23f9a3950a507a0249cf445579940a28578ef1093
IEDL.DBID RIE
IngestDate Wed Aug 27 02:00:09 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i91t-8021536e54fe68cbb8e733ce23f9a3950a507a0249cf445579940a28578ef1093
PageCount 2
ParticipantIDs ieee_primary_10665163
PublicationCentury 2000
PublicationDate 2024-March-9
PublicationDateYYYYMMDD 2024-03-09
PublicationDate_xml – month: 03
  year: 2024
  text: 2024-March-9
  day: 09
PublicationDecade 2020
PublicationTitle Integrated STEM Education Conference (Online)
PublicationTitleAbbrev ISEC
PublicationYear 2024
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0003177683
Score 1.8650972
Snippet Humanoid Robots with their limitless capabilities have revolutionized the world. Their applications range from household assistance to advertising. As these...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Human-robot interaction
Humanoid
Humanoid robots
Large language models
LLMs
NLP
Oral communication
Reinforcement learning
RLHF
Robot vision systems
Title STEM Approach to Enhance Robot-Human Interaction Through AI Large Language Models and Reinforcement Learning
URI https://ieeexplore.ieee.org/document/10665163
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1dS8MwFA1uTz75NfGbPPjaujYfbR7H6NjEDdkq7G0kaaKitKLdi7_em7SbKAi-lFIohNyEc3Ny7rkIXVNmJTcQASIjEUB-q4IU1lEAJwcea0oLq92N7nTGxw_0dsmWbbG6r4UxxnjxmQndq7_LLyq9dlQZ7HDOGSQQHdSBddYUa20JFQBCSJ1Jq-GK-uJmssiGAODC1aPENNz8_aOPioeR0R6abQbQqEdewnWtQv35y5vx3yPcR73vij18v8WiA7RjykPXkblVbxyh10WeTfGgdRDHdYWz8slFHM8rVdWBJ_Ox5webUgecNx188GCC75xcHJ4NtYld_7TXDyzLAs-Nd17VnmTErVnrYw_loywfjoO200LwLKIaUAqAn3DDqDU81UqlJiFEm5hYIYlgfQlZo3TmgtpSylgiBO3LOIXdbqzzozpG3bIqzQnCSiaKMKmEKGJaRFZFPCk4nMmkLCxL-6eo52Zt9dZ4aaw2E3b2x_dztOuC51Vf4gJ16_e1uYQ0oFZXPvxfcUewOA
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1dS8MwFA06H_TJr4nf5sHX1rX5aPM4Rsem25Ctgm8jSVMVRyvavfjrvUm7iYLgSymFQshNODcn556L0DVlueQGIkBkIDzIb5UXwzry4OTAQ01plmt7ozue8MEDvX1kj02xuquFMcY48Znx7au7y89KvbRUGexwzhkkEJtoC4Cfsrpca02pABRC8kwaFVfQETfDWdIDCBe2IiWk_ur_H51UHJD0d9FkNYRaP_LqLyvl689f7oz_HuMean_X7OH7NRrtow1THNiezI1-4xAtZmkyxt3GQxxXJU6KZxtzPC1VWXmOzseOIayLHXBa9_DB3SEeWcE4PGtyE9sOaosPLIsMT43zXtWOZsSNXetTG6X9JO0NvKbXgvciggpwCqCfcMNobnislYpNRIg2IcmFJIJ1JOSN0toL6pxSxiIhaEeGMex3k1tHqiPUKsrCHCOsZKQIk0qILKRZkKuARxmHU5mUWc7izglq21mbv9VuGvPVhJ3-8f0KbQ_S8Wg-Gk7uztCODaTTgIlz1Krel-YCkoJKXbql8AXauLOF
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Integrated+STEM+Education+Conference+%28Online%29&rft.atitle=STEM+Approach+to+Enhance+Robot-Human+Interaction+Through+AI+Large+Language+Models+and+Reinforcement+Learning&rft.au=Shibi%2C+Siddhartha&rft.au=Zaidi%2C+Sohail&rft.date=2024-03-09&rft.pub=IEEE&rft.eissn=2473-7623&rft.spage=1&rft.epage=2&rft_id=info:doi/10.1109%2FISEC61299.2024.10665163&rft.externalDocID=10665163