Kerrick's AI safety + ML reading list

This is a list of papers and blog posts that I've read and want to read related to AI safety and ML (and a few broader SWE-related entries). I'm publishing this in an effort to learn in public. Last updated 2026-06-22.

The star ratings are entirely for me and are mostly based on how relevant the work is to my interests and how easily I was able to digest useful insights from the work. The rating is roughly how likely I am to refer back to this piece in the future or recommend it to others (directly vs a summary). A low rating does not mean the work or results are "bad". The commentary is dictated off-the-cuff and all opinions are weakly held.

Read

To Read