清华大学:不确定型决策(PPT讲稿)Decision Making under Uncertainty

Decision Making under Uncertainty Jun zhu AI Lab, Tsinghua University http://cs.cmu.edu/-iunzhu Nov6.2011
Jun Zhu AI Lab, Tsinghua University http://cs.cmu.edu/~junzhu Nov 6, 2011 Decision Making under Uncertainty

Decision Making is Ubiquitous THE PERKS OF ONLINE STOCK INVESTING determining which decision. from a set of Pe ossible alternatives, is optimal MLA 2011, Tsinghua university
Decision Making is Ubiquitous determining which decision, from a set of possible alternatives, is optimal 2 MLA 2011, Tsinghua University

Two Challenges o The set of outcomes can be large and complex a The agent must weigh different factors in determining the most referred outcomes a E. g, when deciding which job to take we must consider o Work time, amount of salary, company reputation, officemates utility Theor e The outcome of an action is not fully determined a We must consider the probabilities of various outcomes and the preferences of the agent between these outcomes Probability Theory MLA 2011, Tsinghua University
Two Challenges The set of outcomes can be large and complex ❑ The agent must weigh different factors in determining the most preferred outcomes ❑ E.g., when deciding which job to take we must consider: Work time, amount of salary, company reputation, officemates, … The outcome of an action is not fully determined ❑ We must consider the probabilities of various outcomes and the preferences of the agent between these outcomes Utility Theory Probability Theory 3 MLA 2011, Tsinghua University

Decision Theory o the theory of statistical decision functions [Wald, 1950 +.. is concerned with the process of making decisions and explicitly includes the payoffs that may result determining which decision, from a set of possible alternatives, is optimal for a particular set of conditions MLA 2011, Tsinghua University
Decision Theory the theory of statistical decision functions [Wald, 1950] … is concerned with the process of making decisions and explicitly includes the payoffs that may result ❑ determining which decision, from a set of possible alternatives, is optimal for a particular set of conditions 4 MLA 2011, Tsinghua University

Decision Making in Statistical Learning o distribution-free only training examples are provided a use data to estimate the complete distribution, and then derive decision rule( density estimation is hard P(a Cu) a use training examples to directly make decision, e. g, empirical risk minimization o DT: assumes the complete data distribution is qiven MLA 2011, Tsinghua University
Decision Making in Statistical Learning distribution-free; only training examples are provided ❑ use data to estimate the complete distribution, and then derive decision rule (density estimation is hard!) ❑ use training examples to directly make decision, e.g., empirical risk minimization DT: assumes the complete data distribution is given 5 MLA 2011, Tsinghua University

Outline Utilities and Decisions( Chap 22) Maximum Expected Utility(MEU) principle a Utility theor o Structured decision problems( Chap 23 a Decision tree a Influence diagrams MLA 2011, Tsinghua University
Outline Utilities and Decisions (Chap 22) ❑ Maximum Expected Utility (MEU) principle ❑ Utility theory Structured decision problems (Chap 23) ❑ Decision tree ❑ Influence diagrams 6 MLA 2011, Tsinghua University

Decision-making Situation o A decision-making situation D consists of et of outcomes o-f ON a a set of possible actions that an agent can take A=fal,.,aKJ a a probabilistic outcome model p: A-A0, which defines a lottery Ta Utility function U: 0+R, where U(O) preference fo the outcome o (o1);…;oN:丌a(ON) a a probability distribution over outcomes given the action a was taken Preference ordering: T1 7 T2 if the agent prefers T1 丌1~丌2 if the agent is indifferent between丌iand丌2 MLA 2011, Tsinghua University Compound lottery
Decision-making Situation A decision-making situation consists of ❑ a set of outcomes ❑ a set of possible actions that an agent can take ❑ a probabilistic outcome model , which defines a lottery ❑ a utility function , where is the agent’s preference for the outcome Lottery ❑ a probability distribution over outcomes given the action a was taken ❑ Preference ordering: 7 MLA 2011, Tsinghua University Compound Lottery

Maximum Expected Utility (MEU Principle o The Meu principle asserts that, in a decision-making situation D we should choose the action a that maximizes the expected utility EUDd]=∑xa(oU MLA 2011, Tsinghua University
Maximum Expected Utility (MEU) Principle The MEU principleasserts that, in a decision-making situation , we should choose the action that maximizes the expected utility 8 MLA 2011, Tsinghua University

One state/One Action Example SO EU[S0]=100x0.2+50x0.7+70x0.1 20+35+7 A1 62 S1 0.2 0.7 0.1 100 70 MLA 2011, Tsinghua University
One State/One Action Example s0 s1 s2 s3 A1 0.2 0.7 0.1 100 50 70 EU[S0] = 100 x 0.2 + 50 x 0.7 + 70 x 0.1 = 20 + 35 + 7 = 62 9 MLA 2011, Tsinghua University

One state/Two Actions Example SO ElsO=62 EU2S0=74 euso=maxEUlSo,Eu2ISOB 74 Al A2 S2 S4 0.2 0.70.20.1 100 50 70 MLA 2011, Tsinghua University
One State/Two Actions Example s0 s1 s2 s3 A1 0.2 0.7 0.1 100 50 70 A2 s4 0.2 0.8 80 • EU1[S0] = 62 • EU2[S0] = 74 • EU[S0] = max{EU1[S0],EU2[S0]} = 74 10 MLA 2011, Tsinghua University
按次数下载不扣除下载券;
注册用户24小时内重复下载只扣除一次;
顺序:VIP每日次数-->可用次数-->下载券;
- 《Computer Networking:A Top Down Approach》英文教材教学资源(PPT课件讲稿,4th edition)Chapter 1 Introduction.ppt
- 《神经网络和模糊系统》课程教学资源(PPT讲稿)第四章 突触动力学、非监督学习.ppt
- 南京航空航天大学:《C++》课程电子教案(PPT课件讲稿)第4章 类的高级部分.ppt
- 西安电子科技大学:《计算机网络 Computer Networks》课程教学资源(PPT课件讲稿)概述(主讲:岳鹏).ppt
- 计算机语言的学科形态与发展历程(PPT课件讲稿).ppt
- 西安电子科技大学:《计算机网络 Computer Networks》课程教学资源(PPT课件讲稿)第一章 概述(主讲:马涛).pptx
- 北京航空航天大学:动态拼车服务中的高效插入操作(PPT讲稿)An Efficient Insertion Operator in Dynamic Ridesharing Services.pptx
- 中国科学技术大学:《计算机体系结构》课程教学资源(PPT课件讲稿)第三章 流水线技术.pptx
- 《网络算法学》课程教学资源(PPT课件讲稿)第二部分 端节点算法学 第五章 拷贝数据.ppt
- 中国科学技术大学:《数值分析》课程教学资源(PPT课件讲稿)第1章 插值.ppt
- 中国科学技术大学:《算法基础》课程教学资源(PPT课件讲稿)第四讲 递归和分治策略(主讲人:吕敏).pptx
- 中国科学技术大学:《现代密码学理论与实践》课程教学资源(PPT课件讲稿)第10章 密钥管理与其他公钥体制.pptx
- 《数据结构 Data Structure》课程教学资源(PPT课件讲稿)第四章 数组、串与广义表.ppt
- 《Computer Networking:A Top Down Approach》英文教材教学资源(PPT课件讲稿,6th edition)Chapter 1 Introduction.ppt
- 《数据结构》课程教学资源(PPT课件讲稿)第五章 树.ppt
- 《Computer Networking:A Top Down Approach》英文教材教学资源(PPT课件讲稿,6th edition)Chapter 2 Application Layer.ppt
- 中国科学技术大学:A Practical Verification Framework for Preemptive OS Kernels(PPT讲稿).ppt
- 《算法设计与分析基础》课程教学课件(PPT讲稿)Chapter 2 Fundamentals of the Analysis of Algorithm Efficiency.ppt
- 中国医科大学:《计算机基础》课程教学资源(PPT课件)第8章 Internet应用基础.ppt
- RDA Testing & Comparison with AACR2(session 1).ppt
- 西安电子科技大学:《计算机网络 Computer Networks》课程教学资源(PPT课件讲稿)第五章 传输层.pptx
- 《机器学习》课程教学资源(PPT课件讲稿)第七章 贝叶斯分类器 MACHINE LEARNING.pptx
- 清华大学:计算机科学与技术(PPT讲稿)组播 Multicast.pptx
- 《网络算法学》课程教学资源(PPT课件讲稿)第四章 原则的运用.ppt
- 《电子商务概论》课程教学资源(PPT课件讲稿)第7章 电子商务与物流.ppt
- 《计算机网络》课程教学资源(PPT课件讲稿,第三版)Chapter 04 网络层 Network Layer.ppt
- Efficient Algorithms for Optimal Location Queries in Road Networks.ppt
- 北京大学:《项目成本管理》课程教学资源(PPT课件讲稿)质量管理计划(主讲:周立新).ppt
- Flexible Online Task Assignment in Real-Time Spatial Data.pptx
- 《机器学习》课程教学资源(PPT课件讲稿)第10讲 决策树.ppt
- 《物联网技术导论》课程教学资源(PPT讲稿)Continuous Scanning with Mobile Reader in RFID Systems - an Experimental Study.pptx
- 西安电子科技大学:《计算机网络 Computer Networks》课程教学资源(PPT课件讲稿)第六章 应用层.pptx
- 沈阳理工大学:《计算机网络技术及应用》课程教学资源(PPT课件讲稿)第一章 互联网与网站 Interent & Website(主讲:廉哲).ppt
- 香港科技大学:Transaction Management、Serializability Theory and Concurrency Control、Lock-Based Protocols、Deadlock Problems、Recovery.ppt
- 《大学计算机基础》课程电子教案(PPT教学课件)第5章 多媒体技术基础.ppt
- 中国科学院:超级计算平台Linux初级培训(PPT讲稿,2009.11).ppt
- Routing in Vehicular Ad Hoc Network(PPT课件讲稿).ppt
- Linux操作系统初级培训(PPT讲稿)DSC认证培训体系.ppt
- Linux操作系统使用(PPT讲稿,简明基础教程,共七章).ppt
- 香港科技大学:Overviewof the Internet of Things(IoTs,PPT课件讲稿).ppsx