Supervised Fine Tuning on Curated Data Is Reinforcement Learning independentresearch.ai 3 points by saijajin 7 hours ago