Blog

Notes on statistical modeling, machine learning systems, and the tools I am learning from.

Data Science

How LASSO Work

A practical walkthrough of LASSO regularization.

GLMStatisticsModeling
How LLMs Work

Notes on next-token prediction, tokenization, embeddings, attention, MLPs, training, inference, hallucination, and nondeterminism.

LLMAttentionTransformers
Model Interpretability: SHAP, LIME, PDP, and ICE Plots

How I think about local explanations, global explanations, marginal effects, and individual-level model behavior.

InterpretabilitySHAPLIME
Double Lift Charts for Insurance Model Evaluation

How double lift charts compare actual loss cost, current pricing, and a challenger model.

InsuranceModel EvaluationLift
Calibration vs. Model Power

Why a model can rank risks well but still produce unreliable probabilities.

CalibrationModel EvaluationRisk
Controls, Offsets, and Omitted Variable Bias in GLMs

When to estimate a variable as a control, when to use an offset, and why omitted variables can bias insurance models.

GLMControlsOffsets
Variance Inflation Factor: Why Adding a Constant Matters

How VIF diagnoses multicollinearity, why it affects standard errors, and why the auxiliary regression should include an intercept.

VIFMulticollinearityRegression
Understanding GLM Coefficients

How GLM coefficients are interpreted, and why correlated predictors can make that interpretation unstable.

GLMCoefficientsMulticollinearity
How GLMs Work

A practical walkthrough of GLM likelihood, link functions, mean-variance relationships, deviance, dispersion, and optimization.

GLMStatisticsModeling

ML Systems