
The Data Exchange with Ben Lorica
A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].
The Data Exchange with Ben Lorica
The Evolution of Reinforcement Fine-Tuning in AI
•
Ben Lorica
Travis Addair is Co-Founder & CTO at Predibase. In this episode, the discussion centers on transforming pre-trained foundation models into domain-specific assets through advanced customization techniques.
Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/
Support our work by leaving a small tip 💰 https://buymeacoffee.com/gradientflow
Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS.
Detailed show notes - with links to many references - can be found on The Data Exchange web site.