One topic that came up in Jay's talk was reinforcement learning with human feedback (RLHF). For anyone who would like to go deeper on the topic, here are two sources I'd recommend:
Andrej Karpathy's "State of GPT" talk
State of GPT | BRK216HFSChip Huyen's RLHF explainer