Stochastic Learning and Optimization A Sensitivity-Based Approach

"Performance optimization is vital in the design and operation of modern engineering systems, including communications, manufacturing, robotics, and logistics. Most engineering systems are too complicated to model, or the system parameters cannot be easily identified, so learning techniques hav...

Full description

Saved in:

Bibliographic Details
Main Author	Cao, Xi-Ren
Format	eBook Book
Language	English
Published	New York Springer-Verlag 2007 Springer Springer US
Edition	1. Aufl.
Subjects	Artificial Intelligence Calculus of Variations and Optimal Control; Optimization Computer Science Control and Systems Theory Datenverarbeitung Discrete Mathematics in Computer Science Engineering Design Learning models (Stochastic processes) Lernendes System Mathematical optimization Modèles stochastiques d'apprentissage Optimierung Optimisation mathématique Performanz (Linguistik) Probability Theory and Stochastic Processes Stochastisches System Technisches System Computer Informatik
Online Access	Get full text

Cover

Loading…

Abstract	"Performance optimization is vital in the design and operation of modern engineering systems, including communications, manufacturing, robotics, and logistics. Most engineering systems are too complicated to model, or the system parameters cannot be easily identified, so learning techniques have to be applied. This is a multi-disciplinary area which has been attracting wide attention across many disciplines. Areas such as perturbation analysis (PA) in discrete event dynamic systems (DEDSs), Markov decision processes (MDPs) in operations research, reinforcement learning (RL) or neuro-dynamic programming (NDP) in computer science, identification and adaptive control (IAC) in control systems, share the common goal: to make the ""best decision"" to optimize system performance. This book provides a unified framework based on a sensitivity point of view. It also introduces new approaches and proposes new research topics within this sensitivity-based framework."
AbstractList	Performance optimization is vital in the design and operation of modern engineering systems. This book provides a unified framework based on a sensitivity point of view. It introduces new approaches and proposes new research topics. "Performance optimization is vital in the design and operation of modern engineering systems, including communications, manufacturing, robotics, and logistics. Most engineering systems are too complicated to model, or the system parameters cannot be easily identified, so learning techniques have to be applied. This is a multi-disciplinary area which has been attracting wide attention across many disciplines. Areas such as perturbation analysis (PA) in discrete event dynamic systems (DEDSs), Markov decision processes (MDPs) in operations research, reinforcement learning (RL) or neuro-dynamic programming (NDP) in computer science, identification and adaptive control (IAC) in control systems, share the common goal: to make the ""best decision"" to optimize system performance. This book provides a unified framework based on a sensitivity point of view. It also introduces new approaches and proposes new research topics within this sensitivity-based framework."
Author	Cao, Xi-Ren
Author_xml	– sequence: 1 fullname: Cao, Xi-Ren
BackLink	https://cir.nii.ac.jp/crid/1130282269408728064$$DView record in CiNii
BookMark	eNotUE1PAjEQrVGMgPwAb8QQbyvTmXbbHpXgR0LCQWO8Ne1StAq7uLse9NdbWOcwk5e8N3nvDdhJWZWBsQsO1xxATY3SGWSkVZYb0JipIzaABA9IHHeAcqXVa48NMEkMalJ4yvoIxmiUyM_YqGk-IA1KLiX02eSprYp317SxGC-Cq8tYvo1duRovd23cxl_Xxqo8Z7212zRh9H-H7OVu_jx7yBbL-8fZzSJzQhDKLMj0F2WO3hOElRdBCiGkk16Qc94J9JKbFSi1NorI65XzwRSSgzaS1pqGbNo9bnZ1MhJq66vqs7Ec7L4DmzqwYFNOe0htVVJcdYpdXX19h6a1YS8pQtnWbmPntzOiVIlMxElHLGO0RdxvzglQI-ZGgFaoIReJdtnRiphaqGzysXX1j-0gUk5o6A-z6G1B
ContentType	eBook Book
Copyright	Springer-Verlag US 2007
Copyright_xml	– notice: Springer-Verlag US 2007
DBID	08O RYH
DEWEY	519.23
DOI	10.1007/978-0-387-69082-7
DatabaseName	ciando eBooks CiNii Complete
DatabaseTitleList
DeliveryMethod	fulltext_linktorsrc
Discipline	Mathematics Computer Science Applied Sciences
EISBN	0387690824 9780387690827
Edition	1. Aufl. 1
ExternalDocumentID	139397 EBC337875 BA83735788 ciando236329
GroupedDBID	08O 0D6 0DA 38. 7M 7P A4I AABBV AABFA AAHDE AAUKK ACFGI ADQVG ADVHH AETDV AEZAY AGNDD AHMWK ALMA_UNASSIGNED_HOLDINGS AZZ BBABE BG CZZ IEZ JJU LZA MYL NUC NUP SAO SBO Z7X Z83 Z88 Z8R Z8W Z92 -T. 0E8 AAJYQ AATVQ ABBUY ABCYT ABMNI ACAMX ACBPT ACDTA ACDUY AEHEY AEJLV AEKFX AEVYL AHNNE ATJMZ E6I RYH TPJZQ
ID	FETCH-LOGICAL-a44325-e50252562bb30edb4e54445a5b43aaba42b519d077f9733b8dabe9c5108953f83
ISBN	038736787X 9780387367873
IngestDate	Tue Jul 29 20:32:04 EDT 2025 Fri May 30 21:22:49 EDT 2025 Thu Jun 26 22:59:47 EDT 2025 Mon Feb 07 09:33:34 EST 2022
IsPeerReviewed	false
IsScholarly	false
Keywords	Computer Informatik
LCCN	2007928372
LCCallNum_Ident	QA76.9.M35
Language	English
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-a44325-e50252562bb30edb4e54445a5b43aaba42b519d077f9733b8dabe9c5108953f83
Notes	With 119 figures, 27 tables, and 212 problems Includes bibliographical references and index
OCLC	209982521
PQID	EBC337875
PageCount	575
ParticipantIDs	springer_books_10_1007_978_0_387_69082_7 proquest_ebookcentral_EBC337875 nii_cinii_1130282269408728064 ciando_primary_ciando236329
PublicationCentury	2000
PublicationDate	2007 c2007
PublicationDateYYYYMMDD	2007-01-01
PublicationDate_xml	– year: 2007 text: 2007
PublicationDecade	2000
PublicationPlace	New York
PublicationPlace_xml	– name: New York – name: New York, NY – name: Boston, MA
PublicationYear	2007
Publisher	Springer-Verlag Springer Springer US
Publisher_xml	– name: Springer-Verlag – name: Springer – name: Springer US
SSID	ssj0000251550
Score	2.3905337
Snippet	"Performance optimization is vital in the design and operation of modern engineering systems, including communications, manufacturing, robotics, and logistics.... Performance optimization is vital in the design and operation of modern engineering systems. This book provides a unified framework based on a sensitivity...
SourceID	springer proquest nii ciando
SourceType	Publisher
SubjectTerms	Artificial Intelligence Calculus of Variations and Optimal Control; Optimization Computer Science Control and Systems Theory Datenverarbeitung Discrete Mathematics in Computer Science Engineering Design Learning models (Stochastic processes) Lernendes System Mathematical optimization Modèles stochastiques d'apprentissage Optimierung Optimisation mathématique Performanz (Linguistik) Probability Theory and Stochastic Processes Stochastisches System Technisches System
Subtitle	A Sensitivity-Based Approach
TableOfContents	Performance Difference Formulas -- Performance Derivative Formulas -- Optimization -- Learning: Estimating Aggregated Potentials -- Aggregated Potentials -- Aggregated Potentials in the Event-Based Optimization -- Applications and Examples -- Manufacturing -- Service Rate Control -- General Applications -- Problems -- Constructing Sensitivity Formulas -- Motivation -- Markov Chains on the Same State Space -- Event-Based Systems -- Sample-Path Construction* -- Parameterized Systems: An Example -- Markov Chains with Different State Spaces* -- One Is a Subspace of the Other* -- A More General Case -- Summary -- Problems -- Part III Appendices: Mathematical Background -- Probability and Markov Processes -- Probability -- Markov Processes -- Problems -- Stochastic Matrices -- Canonical Form -- Eigenvalues -- The Limiting Matrix -- Problems -- Queueing Theory -- Single-Server Queues -- Queueing Networks -- Some Useful Techniques -- Problems -- Notation and Abbreviations -- References -- Index MDPs with Discounted Rewards -- The nth-Bias Optimization* -- nth-Bias Difference Formulas* -- Optimality Equations* -- Policy Iteration* -- nth-Bias Optimal Policy Spaces* -- Problems -- Sample-Path-Based Policy Iteration -- Motivation -- Convergence Properties -- Convergence of Potential Estimates -- Sample Paths with a Fixed Number of Regenerative Periods -- Sample Paths with Increasing Lengths -- ``Fast" Algorithms* -- The Algorithm That Stops in a Finite Number of Periods* -- With Stochastic Approximation* -- Problems -- Reinforcement Learning -- Stochastic Approximation -- Finding the Zeros of a Function Recursively -- Estimating Mean Values -- Temporal Difference Methods -- TD Methods for Potentials -- Q-Factors and Other Extensions -- TD Methods for Performance Derivatives -- TD Methods and Performance Optimization -- PA-Based Optimization -- Q-Learning -- Optimistic On-Line Policy Iteration -- Value Iteration -- Summary of the Learning and Optimization Methods -- Problems -- Adaptive Control Problems as MDPs -- Control Problems and MDPs -- Control Systems Modelled as MDPs -- A Comparison of the Two Approaches -- MDPs with Continuous State Spaces -- Operators on Continuous Spaces -- Potentials and Policy Iteration -- Linear Control Systems and the Riccati Equation -- The LQ Problem -- The JLQ Problem* -- On-Line Optimization and Adaptive Control -- Discretization and Estimation -- Discussion -- Problems -- Part II The Event-Based Optimization - A New Approach -- Event-Based Optimization of Markov Systems -- An Overview -- Summary of Previous Chapters -- An Overview of the Event-Based Approach -- Events Associated with Markov Chains -- The Event and Event Space -- The Probabilities of Events -- The Basic Ideas Illustrated by Examples -- Classification of Three Types of Events -- Event-Based Optimization -- The Problem Formulation Intro -- Preface -- Contents -- Introduction -- An Overview of Learning and Optimization -- Problem Description -- Optimal Policies -- Fundamental Limitations of Learning and Optimization -- A Sensitivity-Based View of Learning and Optimization -- Problem Formulations in Different Disciplines -- Perturbation Analysis (PA) -- Markov Decision Processes (MDPs) -- Reinforcement Learning (RL) -- Identification and Adaptive Control (I&amp -- AC) -- Event-Based Optimization and Potential Aggregation -- A Map of the Learning and Optimization World -- Terminology and Notation -- Problems -- Part I Four Disciplines in Learning and Optimization -- Perturbation Analysis -- Perturbation Analysis of Markov Chains -- Constructing a Perturbed Sample Path -- Perturbation Realization Factors and Performance Potentials -- Performance Derivative Formulas -- Gradients with Discounted Reward Criteria -- Higher-Order Derivatives and the MacLaurin Series -- Performance Sensitivities of Markov Processes -- Performance Sensitivities of Semi-Markov Processes* -- Fundamentals for Semi-Markov Processes* -- Performance Sensitivity Formulas* -- Perturbation Analysis of Queueing Systems -- Constructing a Perturbed Sample Path -- Perturbation Realization -- Performance Derivatives -- Remarks on Theoretical Issues* -- Other Methods* -- Problems -- Learning and Optimization with Perturbation Analysis -- The Potentials -- Numerical Methods -- Learning Potentials from Sample Paths -- Coupling* -- Performance Derivatives -- Estimating through Potentials -- Learning Directly -- Optimization with PA -- Gradient Methods and Stochastic Approximation -- Optimization with Long Sample Paths -- Applications -- Problems -- Markov Decision Processes -- Ergodic Chains -- Policy Iteration -- Bias Optimality -- MDPs with Discounted Rewards -- Multi-Chains -- Policy Iteration -- Bias Optimality
Title	Stochastic Learning and Optimization
URI	http://ebooks.ciando.com/book/index.cfm/bok_id/236329 https://cir.nii.ac.jp/crid/1130282269408728064 https://ebookcentral.proquest.com/lib/[SITE_ID]/detail.action?docID=337875 http://link.springer.com/10.1007/978-0-387-69082-7
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1BT9swFH4a5cIusMK0Dhg-7IA0eUpiO052g6oIVbDDxFBvlh07Wg-009pd-PV7TuyElklou1iJFcWJv-S95_f8vQfwMXGJdjazNJX4N_G0qmkhdEa5qExV1zqVTY2l26_59Xc-nYlZXzmmYZeszefq8a-8kv9BFfsQV8-S_Qdku5tiBx4jvtgiwthuGb_daeBvrJfVD-1TLMeyDy3TcIkC4CEwKz-1POaV36HeloigXmXZLo14H35o3KWzOf0WeGHRCyC3vADRC7ixOvSRaYbKqK0V8kxW9tsj2gy7ua9-TmWvGLrtepcXuIz1aXGKHdiREoXH7sVkenPfObP8QgWXOiEa3owZsxt1zxBDyhtZfcOYqP69D8cuUcMv5vMNa38rQN3o_bsDGHguyBt45RZD2I8VMEgQiEN4fdtlvV0dwriHhURYCI5HnsJCvhBNnoFCIihHcH81uRtf01CkgmrOWSaoE_j2aDhmxrDEWcOd4JwLLQxnWhvNM4NWsk2krEvJmCmsNq6sUBYWpWB1wd7CYLFcuHdA0LStrBZa1lXKjacQo_no3RelqY21bATH7TSpn20qEtWeZixnWTmCU5w77PJt6gPSaPzlJU8KX4Is5yM4i7Oqmkh82P6rJpdjxhAiMYLzONnKX7BSMak1IqYShYipBjEl378w2DHs9Z_pCQzWv367UzTf1uZD-Hj-AKmVOI8
linkProvider	Library Specific Holdings
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=book&rft.title=Stochastic+learning+and+optimization+%3A+a+sensitivity-based+approach&rft.au=Cao%2C+Xi-Ren&rft.date=2007-01-01&rft.pub=Springer&rft.isbn=9780387367873&rft_id=info:doi/10.1007%2F978-0-387-69082-7&rft.externalDocID=BA83735788
thumbnail_s	http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Fmedia.springernature.com%2Fw306%2Fspringer-static%2Fcover-hires%2Fbook%2F978-0-387-69082-7