An Optimal Clustering Algorithm for the Labeled Stochastic Block Model
Learning in Multi-Memory Games Triggers Complex Dynamics Diverging from Nash Equilibrium
Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum Games
Fair Matrix Factorisation for Large-Scale Recommender Systems
強化学習一般
Mutation-Driven Follow the Regularized Leader for Last-Iterate Convergence in Zero-Sum Games
Anytime Capacity Expansion in Medical Residency Match by Monte Carlo Tree Search
Computing Strategies of American Football via Counterfactual Regret Minimization
Thresholded Lasso Bandit
Off-Policy Exploitability-Evaluation in Two-Player Zero-Sum Markov Games
Mean Variance Efficient Reinforcement Learning
見間違えのある繰り返し囚人のジレンマにおける方策勾配法に関する研究
強化学習
Online Learning for Bidding Agent in First Price Auction