发表在 Operations Research, 1990. DOI: https://doi.org/10.1287/opre.38.2.278. Subject classification: Inventory/production, multi-item/echelon/stage: multilocation models with lateral transshipments. This paper examines appropriate inventory policies when transshipments among multiple outlets are permitted as recourse actions once demands are observed. $x_t \in \mathrm{R}^n$: starting inventory……
发表在 Management Science, 2022. DOI: https://doi.org/10.1287/mnsc.2020.3922. Area of review: optimization. Key words: prescriptive analytics; data-driven optimization; machine learning; linear regression This paper proposes a general framework called Smart “Predict, then Op……
This lecture we consider methods that learn a parameterized policy that can select actions without consulting a value function. We denote $\boldsymbol{\theta}$ the policy parameter, and write the policy $$ \pi(a \mid s, \boldsymbol{\theta})=\operatorname{Pr}\left\{A_t=a \mid S_t=s, \boldsymbol{\theta}_t=\boldsymbol{\theta}\right\} $$ If the performance measure……
Stochastic dominance is a partial order between random variables. The concept arises in decision theory and decision analysis in situations where one gamble (a probability distribution over possible outcomes) can be ranked as superior to another gamble for a broad class of……