主页

Markov Perfect Equilibrium

2023/1/20 2025/12/16 运筹与优化 947

Stochastic Games A (discounted) stochastic game with $N$ players consists of the following elements. A state space $\mathcal{X}$. For each player $i$ and state $x$, a set $A_i(x)$ of actions available to player $i$ in state $x$. For each player $i$, state……

Online Linear Programming

2023/1/8 2025/12/16 运筹与优化 234

Consider a generic LP problem: $$ \begin{aligned} \max \;& \sum_{j=1}^n r_j x_j \\ \text {s.t. } & \sum_{j=1}^n a_{i j} x_j \leq b_i, \quad i=1, \ldots, m \\ & 0 \leq x_j \leq 1, j=1, \ldots, n, \end{aligned} \qquad \Longleftrightarrow \qquad \begin{aligned}……

Supply Chain Management Basics

2023/1/1 2025/12/16 运筹与优化 2662

经济批量订货模型（EOQ） Economic order quantity (EOQ) 的基本假设：需求确定，单位时间内的需求是 $\lambda $ 不允许……

minimax 定理

2022/11/16 2025/12/16 分析与概率 341

Max–min inequality 一个矩阵，行最小值的最大值，不超过其列最大值的最小值。设 $f:X \times Y\to \ma……

Dirichlet Process

2022/11/14 2025/12/16 分析与概率 218

Dirichlet Process The original deﬁnition of the DP is due to Ferguson (1973)1 . Given a measurable space $(\Omega, \mathcal{F})$, a random distribution (measure) $G$ is said to follow a Dirichlet pocess with a……

Multi-armed Bandits (3)

2022/11/12 2025/12/16 运筹与优化/MAB 814

Lower Bound 证明 lower bound，我们需要构造一些问题实例（problem instances, $\mathcal{……

Analytics for an Online Retailer: Demand Forecasting and Price Optimization

2022/10/25 2025/12/16 论文简读 2317

发表在 Manufacturing & Service Operations Management, 2016. DOI: https://doi.org/10.1287/msom.2015.0561. Keywords: online retailing; ﬂash sales; initial pricing; revenue management; price optimization; machine learning; regression trees; demand forecasting; demand interdependency; model implementation 这篇文章属于数……

Nonparametric learning methods

2022/10/1 2025/12/16 运筹与优化 3994

Robust Dynamic Pricing with Demand Learning in the Presence of Outlier Customers OR, 2022. Articles in Advance. Xi Chen, Yining Wang. 这篇文章研究在 model misspecification 下的动态定价问题。在 $(1-\epsilon)T$ 段……

Parametric learning methods

2022/10/1 2025/12/16 运筹与优化 2859

Weak aggregating algorithm for the distribution-free perishable inventory problem ORL 2010 In this article, we propose a novel approach to the distributionfree, multi-period problem that utilizes recent advances in the theory of prediction and learning with expert advice. Weak Aggregating Algorithm (WAA)……

Multi-armed Bandits (2)

2022/9/30 2025/12/16 运筹与优化/MAB 321

Contextual Bandits Lipschitz Bandits Continuum-armed bandits 先考虑 arm 是连续变量的特殊情况（CAB），不妨假设 arm 是 $X=[0, 1]$，其均值 $\mu(x)$ 满……