One topic that came up in Jay's talk was reinforcement learning with human feedback (RLHF). For anyone who would like to go deeper on the topic, here are two sources I'd recommend:
Andrej Karpathy's "State of GPT" talk
https://www.youtube.com/watch?v=bZQun8Y4L2A
Chip Huyen's RLHF explainer
https://huyenchip.com/2023/05/02/rlhf.html