Chunpai Wang, PhD @ SUNY-Albany

This is a note about finding the connection between structured prediction problems and reinforcement learning, which starts from the structured-svm and conditional random fields, and ends with the expected reward maximization with entropy regularization and reward augmented maximum likelihood.

Structured-SVM

Derive Multi-class SVM from Logistic Regression

Reward Augmented Maximum Likelihood