Skip to main content
K
External Link
On the second day of ship-mas, my AI sent to me... reinforcement fine-tuning.

OpenAI just announced an alpha program for a new tool (called reinforcement fine-tuning) that lets developers train models on specific tasks, using example problems and answers. In a post after the livestream announcement, CEO Sam Altman said this will make it “really easy to create expert models in specific domains with very little training data.”