K
On the second day of ship-mas, my AI sent to me... reinforcement fine-tuning.
OpenAI just announced an alpha program for a new tool (called reinforcement fine-tuning) that lets developers train models on specific tasks, using example problems and answers. In a post after the livestream announcement, CEO Sam Altman said this will make it “really easy to create expert models in specific domains with very little training data.”
Most Popular
Most Popular