Archive

reinforcement-learning

Top-k Off-Policy Correction for a REINFORCE Recommender System (10 Oct 2020)
Multi-armed Bandits: Thompson Sampling (06 May 2020)
Structured Prediction and Reinforcement Learning (25 Aug 2019)
Stochastic Policy Gradient (20 Apr 2019)
Q Learning (12 Apr 2019)

deep-learning

Attention Mechanism [3]: Memory Networks (21 Jun 2020)
Attention Mechanism [2]: Transformer and Graph Attention Networks (17 Jun 2020)
Attention Mechanism [1]: Seq2Seq Models (15 Jun 2020)
Generative Adversarial Networks (21 Nov 2018)

approximate-inference

Gibbs Sampling with Data Augmentation for Normal Ogive Item Response Theory (18 May 2020)
Variational Inference (22 May 2018)
Expectation Maximization Explaination (27 Apr 2018)
Markov Chain Monte Carlo (01 Apr 2018)

structured-prediction

Structured Prediction and Reinforcement Learning (25 Aug 2019)
Energy Based Models (03 Oct 2018)
Conditional Random Fields (22 Aug 2017)

probabilistic-machine-learning

Logistic Regression (08 Aug 2017)
Probabilistic Interpretation of Ridge Regression and LASSO (15 Mar 2017)
Point Estimation and Statistical Inference (28 Jan 2017)

statistics

Probabilistic Interpretation of Ridge Regression and LASSO (15 Mar 2017)
Point Estimation and Statistical Inference (28 Jan 2017)

generalized-linear-model

Logistic Regression (08 Aug 2017)
Probabilistic Interpretation of Ridge Regression and LASSO (15 Mar 2017)

recommender-system

Top-k Off-Policy Correction for a REINFORCE Recommender System (10 Oct 2020)

programming

Wrapping C Program For Python (07 Jan 2018)

optimization

First Order and Second Order Methods for Convex Optimization (30 May 2017)

counterfactual-evaluation

Top-k Off-Policy Correction for a REINFORCE Recommender System (10 Oct 2020)