One topic that came up in Jay's talk was reinforcement learning with human feedback (RLHF). For anyone who would like to go deeper on the topic, here are two sources I'd recommend:
Andrej Karpathy's "State of GPT" talk

Chip Huyen's RLHF explainer
Welcome to the PyData Pittsburgh Forum! If you're new to the community, please start here.