Kerrick's AI safety + ML reading list

This is a list of papers and blog posts that I've read and want to read related to AI safety and ML (and a few broader SWE-related entries). I'm publishing this in an effort to learn in public. Last updated 2026-05-17.

The star ratings are entirely for me and are mostly based on how relevant the work is to my interests and how easily I was able to digest useful insights from the work. A low rating does not mean the work or results are "bad". The commentary is dictated off-the-cuff and all opinions are weakly held.

Read

To Read