STEM Approach to Enhance Robot-Human Interaction Through AI Large Language Models and Reinforcement Learning

Humanoid Robots with their limitless capabilities have revolutionized the world. Their applications range from household assistance to advertising. As these technologies age however, the use of their sensors, motors, cameras, all become outdated; making previous humanoids a thing of the past. This p...

Full description

Saved in:

Bibliographic Details
Published in	Integrated STEM Education Conference (Online) pp. 1 - 2
Main Authors	Shibi, Siddhartha, Zaidi, Sohail
Format	Conference Proceeding
Language	English
Published	IEEE 09.03.2024
Subjects	Human-robot interaction Humanoid Humanoid robots Large language models LLMs NLP Oral communication Reinforcement learning RLHF Robot vision systems
Online Access	Get full text
ISSN	2473-7623
DOI	10.1109/ISEC61299.2024.10665163

Cover

Abstract	Humanoid Robots with their limitless capabilities have revolutionized the world. Their applications range from household assistance to advertising. As these technologies age however, the use of their sensors, motors, cameras, all become outdated; making previous humanoids a thing of the past. This project takes a STEM approach towards enhancing these robots by tackling the most crucial issue that such humanoids face; their adequacy in human-robot interactions. This study explores the promise of integrating LLMs (Large Language Models)-such as Google PaLM2 and ChatGPT-to supplement the capabilities of such robots; as well as bringing CoT (Chain of Thought). The subject of this project is the humanoid robot Pepper, by SoftBank Robotics, a popular robot designed to interact with humans; however, due to its weak natural language processing (NLP) capabilities, it struggles to adequately articulate responses in human-robot conversation. For instance, the robot was capable of easily listing responses to simple questions such as, "What is your name?", or "What are you", yet struggled with providing adequate responses to queries such as, "Who is the president of the United States", or "When is the next World Cup?". By AI/ML LLM integration, such questions were handled by using much improved LLMs in place of the previous built-in responses the robot had. This demonstration has been shown in the video that is uploaded at: https://youtu.be/hF7aRlQmnqs. Our approach targeted the main weak points of the robot; it's ability to provide responses to asked questions, and remembering prior questions/conversation. By intercepting the robot's own NLP Dialog Module, the asked prompt can be connected through a chatAdapter, bringing conversations to a chat database for context as well as LLM of choice. This approach, implemented through use of Android Studio to create an appropriate application for their procedure, addresses the contextual-based reasoning by pulling from the chat database as well as provides adequate responses limited only by the AI/ML model of choice. This project involved integrating ChatGPT /PaLM2 into Pepper's existing system to enable generation of more natural and engaging responses. In addition to the pre-existing development in bringing artificial intelligence into these humanoid robots, further work has been in the process; the aim being to develop a way for the robot to simultaneously extract other situational data from conversation such as facial and tonal expressions, bringing human feedback in order for responses to be further fine-tuned. Aside from the work-in-progress development of integrating RLHF (Reinforcement Learning with Human Feedback), the effectiveness of the aforementioned approach was further evaluated through a user study, comparing it with and without integration. The results indicated that integrating LLM/s into the robot's NLP system significantly improved its ability to generate more coherent responses, leading to more natural human-robot interactions. Overall, this presentation will demonstrate the potential of using LLMs to enhance the NLP capability of human robots like Pepper. It's believed that the proposed approach can pave the way for developing more intelligent human-robot interactions in the future.
AbstractList	Humanoid Robots with their limitless capabilities have revolutionized the world. Their applications range from household assistance to advertising. As these technologies age however, the use of their sensors, motors, cameras, all become outdated; making previous humanoids a thing of the past. This project takes a STEM approach towards enhancing these robots by tackling the most crucial issue that such humanoids face; their adequacy in human-robot interactions. This study explores the promise of integrating LLMs (Large Language Models)-such as Google PaLM2 and ChatGPT-to supplement the capabilities of such robots; as well as bringing CoT (Chain of Thought). The subject of this project is the humanoid robot Pepper, by SoftBank Robotics, a popular robot designed to interact with humans; however, due to its weak natural language processing (NLP) capabilities, it struggles to adequately articulate responses in human-robot conversation. For instance, the robot was capable of easily listing responses to simple questions such as, "What is your name?", or "What are you", yet struggled with providing adequate responses to queries such as, "Who is the president of the United States", or "When is the next World Cup?". By AI/ML LLM integration, such questions were handled by using much improved LLMs in place of the previous built-in responses the robot had. This demonstration has been shown in the video that is uploaded at: https://youtu.be/hF7aRlQmnqs. Our approach targeted the main weak points of the robot; it's ability to provide responses to asked questions, and remembering prior questions/conversation. By intercepting the robot's own NLP Dialog Module, the asked prompt can be connected through a chatAdapter, bringing conversations to a chat database for context as well as LLM of choice. This approach, implemented through use of Android Studio to create an appropriate application for their procedure, addresses the contextual-based reasoning by pulling from the chat database as well as provides adequate responses limited only by the AI/ML model of choice. This project involved integrating ChatGPT /PaLM2 into Pepper's existing system to enable generation of more natural and engaging responses. In addition to the pre-existing development in bringing artificial intelligence into these humanoid robots, further work has been in the process; the aim being to develop a way for the robot to simultaneously extract other situational data from conversation such as facial and tonal expressions, bringing human feedback in order for responses to be further fine-tuned. Aside from the work-in-progress development of integrating RLHF (Reinforcement Learning with Human Feedback), the effectiveness of the aforementioned approach was further evaluated through a user study, comparing it with and without integration. The results indicated that integrating LLM/s into the robot's NLP system significantly improved its ability to generate more coherent responses, leading to more natural human-robot interactions. Overall, this presentation will demonstrate the potential of using LLMs to enhance the NLP capability of human robots like Pepper. It's believed that the proposed approach can pave the way for developing more intelligent human-robot interactions in the future.
Author	Shibi, Siddhartha Zaidi, Sohail
Author_xml	– sequence: 1 givenname: Siddhartha surname: Shibi fullname: Shibi, Siddhartha email: in.siddhartha@gmail.com organization: Washington High School,Fremont,USA – sequence: 2 givenname: Sohail surname: Zaidi fullname: Zaidi, Sohail email: syed.zaidi@sjsu.edu organization: Mechanical Engineering San Jose State University,San Jose,USA
BookMark	eNo1kMtKAzEYhaMoqLVvIJgXmJrkn9yWpYx2YIrQzr6k6T8XaZOSmS58ewfUzTln9cH5nshdiAEJeeVswTmzb-WuWCkurF0IJvIFZ0pJruCGzK22BiQDKQzjt-RR5BoyrQQ8kPkwfDHGgGutDDyS064uNnR5uaTofEfHSIvQueCRbuMhjtn6enaBlmHE5PzYx0DrLsVr29FlSSuXWpwytFc3jU084mmgLhzpFvvQxOTxjGGkFboU-tA-k_vGnQac__WM1O9FvVpn1edHuVpWWW_5mBkmuASFMm9QGX84GNQAHgU01oGVzEmm3XTZ-ibPpdTW5swJI7XBZhIDM_Lyi-0RcX9J_dml7_2_H_gBZpFa_g
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.1109/ISEC61299.2024.10665163
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Education
EISBN	9798350352801
EISSN	2473-7623
EndPage	2
ExternalDocumentID	10665163
Genre	orig-research
GroupedDBID	6IE 6IF 6IK 6IL 6IN AAJGR ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IPLJI OCL RIE RIL
ID	FETCH-LOGICAL-i91t-8021536e54fe68cbb8e733ce23f9a3950a507a0249cf445579940a28578ef1093
IEDL.DBID	RIE
IngestDate	Wed Aug 27 02:00:09 EDT 2025
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i91t-8021536e54fe68cbb8e733ce23f9a3950a507a0249cf445579940a28578ef1093
PageCount	2
ParticipantIDs	ieee_primary_10665163
PublicationCentury	2000
PublicationDate	2024-March-9
PublicationDateYYYYMMDD	2024-03-09
PublicationDate_xml	– month: 03 year: 2024 text: 2024-March-9 day: 09
PublicationDecade	2020
PublicationTitle	Integrated STEM Education Conference (Online)
PublicationTitleAbbrev	ISEC
PublicationYear	2024
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0003177683
Score	1.8650972
Snippet	Humanoid Robots with their limitless capabilities have revolutionized the world. Their applications range from household assistance to advertising. As these...
SourceID	ieee
SourceType	Publisher
StartPage	1
SubjectTerms	Human-robot interaction Humanoid Humanoid robots Large language models LLMs NLP Oral communication Reinforcement learning RLHF Robot vision systems
Title	STEM Approach to Enhance Robot-Human Interaction Through AI Large Language Models and Reinforcement Learning
URI	https://ieeexplore.ieee.org/document/10665163
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1dS8MwFA1uTz75NfGbPPjaujYfbR7H6NjEDdkq7G0kaaKitKLdi7_em7SbKAi-lFIohNyEc3Ny7rkIXVNmJTcQASIjEUB-q4IU1lEAJwcea0oLq92N7nTGxw_0dsmWbbG6r4UxxnjxmQndq7_LLyq9dlQZ7HDOGSQQHdSBddYUa20JFQBCSJ1Jq-GK-uJmssiGAODC1aPENNz8_aOPioeR0R6abQbQqEdewnWtQv35y5vx3yPcR73vij18v8WiA7RjykPXkblVbxyh10WeTfGgdRDHdYWz8slFHM8rVdWBJ_Ox5webUgecNx188GCC75xcHJ4NtYld_7TXDyzLAs-Nd17VnmTErVnrYw_loywfjoO200LwLKIaUAqAn3DDqDU81UqlJiFEm5hYIYlgfQlZo3TmgtpSylgiBO3LOIXdbqzzozpG3bIqzQnCSiaKMKmEKGJaRFZFPCk4nMmkLCxL-6eo52Zt9dZ4aaw2E3b2x_dztOuC51Vf4gJ16_e1uYQ0oFZXPvxfcUewOA
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1dS8MwFA06H_TJr4nf5sHX1rX5aPM4Rsem25Ctgm8jSVMVRyvavfjrvUm7iYLgSymFQshNODcn556L0DVlueQGIkBkIDzIb5UXwzry4OTAQ01plmt7ozue8MEDvX1kj02xuquFMcY48Znx7au7y89KvbRUGexwzhkkEJtoC4Cfsrpca02pABRC8kwaFVfQETfDWdIDCBe2IiWk_ur_H51UHJD0d9FkNYRaP_LqLyvl689f7oz_HuMean_X7OH7NRrtow1THNiezI1-4xAtZmkyxt3GQxxXJU6KZxtzPC1VWXmOzseOIayLHXBa9_DB3SEeWcE4PGtyE9sOaosPLIsMT43zXtWOZsSNXetTG6X9JO0NvKbXgvciggpwCqCfcMNobnislYpNRIg2IcmFJIJ1JOSN0toL6pxSxiIhaEeGMex3k1tHqiPUKsrCHCOsZKQIk0qILKRZkKuARxmHU5mUWc7izglq21mbv9VuGvPVhJ3-8f0KbQ_S8Wg-Gk7uztCODaTTgIlz1Krel-YCkoJKXbql8AXauLOF
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Integrated+STEM+Education+Conference+%28Online%29&rft.atitle=STEM+Approach+to+Enhance+Robot-Human+Interaction+Through+AI+Large+Language+Models+and+Reinforcement+Learning&rft.au=Shibi%2C+Siddhartha&rft.au=Zaidi%2C+Sohail&rft.date=2024-03-09&rft.pub=IEEE&rft.eissn=2473-7623&rft.spage=1&rft.epage=2&rft_id=info:doi/10.1109%2FISEC61299.2024.10665163&rft.externalDocID=10665163