view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 โข 197
๐ Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized โข 105 items โข Updated about 14 hours ago โข 97