学位论文详细信息
Developing trust and managing uncertainty in partially observable sequential decision-making environments
Stochastic optimization;Dynamic programming;Supply chain design;Trust;Data-driven decision-making;Structure;Value of information;Robust optimization;Game theory;Simulation
Bishop, Robert Reid ; White, Chelsea C. Industrial and Systems Engineering Zhou, Enlu Ayhan, Hayriye Wang, He Eames, Brandon Outkin, Alexander ; White, Chelsea C.
University:Georgia Institute of Technology
Department:Industrial and Systems Engineering
关键词: Stochastic optimization;    Dynamic programming;    Supply chain design;    Trust;    Data-driven decision-making;    Structure;    Value of information;    Robust optimization;    Game theory;    Simulation;   
Others  :  https://smartech.gatech.edu/bitstream/1853/62302/1/BISHOP-DISSERTATION-2019.pdf
美国|英语
来源: SMARTech Repository
PDF
【 摘 要 】

This dissertation consists of three distinct, although conceptually related, papers that are unified in their focus on data-driven, stochastic sequential decision-making environments, but differentiated in their respective applications. In Chapter 2, we discuss a special class of partially observable Markov decision processes (POMDPs) in which the sources of uncertainty can be naturally separated into a hierarchy of effects — controllable, completely observable effects and exogenous, partially observable effects. For this class of POMDPs, we provide conditions under which value and policy function structural properties are inherited from an analogous class of MDPs, and discuss specialized solution procedures. In Chapter 3, we discuss an inventory control problem in which actions are time-lagged, and there are three explicit sources of demand uncertainty — the state of the macroeconomy, product-specific demand variability, and information quality. We prove that a base stock policy — defined with respect to pipeline inventory and a Bayesian belief distribution over states of the macroeconomy — is optimal, and demonstrate how to compute these base stock levels efficiently using support vector machines and Monte Carlo simulation. Further, we show how to use these results to determine how best to strategically allocate capital toward a better information infrastructure or a more agile supply chain. Finally, in Chapter 4, we consider how to generate trust in so-called development processes, such as supply chains, certain artificial intelligence systems, and maintenance processes, in which there can be adversarial manipulation and we must hedge against the risk of misapprehension of attacker objectives and resources. We show how to model dynamic agent interaction using a partially-observable Markov game (POMG) framework, and present a heuristic solution procedure, based on self-training concepts, for determining a robust defender policy.

【 预 览 】
附件列表
Files Size Format View
Developing trust and managing uncertainty in partially observable sequential decision-making environments 4732KB PDF download
  文献评价指标  
  下载次数:16次 浏览次数:26次