AARTHI ANBALAGAN; MUTHURAMAN SAMINATHAN; VINCENT KANKA. Reinforcement Learning from Human Feedback for Enhanced Code Generation and Debugging Capabilities in LLMs. Journal of Computational Intelligence and Robotics, Ahmedabad, India, v. 4, n. 1, p. 152–193, 2024. Disponível em: https://nucleuscorp.org/jcir/article/view/563. Acesso em: 15 jun. 2025.