Automated Evaluation of LLMs
Infinite Machine Learning: Artificial Intelligence | Startups | Technology
More Info
Infinite Machine Learning: Artificial Intelligence | Startups | Technology
Automated Evaluation of LLMs
May 07, 2024
Prateek Joshi

Anand Kannappan is the cofounder and CEO of Patronus AI, an automated AI evaluation and security company. They have raised funding from Lightspeed Venture Partners, Replit CEO Amjad Masad, Gokul Rajaram, and Fortune 500 executives. He was previously at Meta and Vertis. He was also the cofounder of Kyber Technologies, which was a service to systematically predict market events using AI and remote sensing data. It evolved into a futures quant hedge fund managing $15M for partners.

Anand's favorite book: Harry Potter series (Author: JK Rowling)

(00:00) Introduction and Common Failure Modes of Large Language Models
(03:02) Challenges of Automated Evaluation in AI Models
(06:08) The Importance of Fine-Tuning and Retrieval Augmented Generation
(09:02) Addressing Copyright Detection in Language Models
(11:51) The Liability of Companies Using AI Models
(15:02) Advancements in Multimodal Models and State Space Models
(20:48) The Role of Fine-Tuning in the Evolution of Language Models
(23:51) The Significance of Adversarial Testing in AI
(25:56) The Role of Retrieval Augmented Generation in AI
(28:05) The Need for Continuous Function Optimization in Prompting
(29:02) Rapid Fire Round

--------
Where to find Prateek Joshi:

Newsletter: https://prateekjoshi.substack.com 
Website: https://prateekj.com 
LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19 
Twitter: https://twitter.com/prateekvjoshi 

Episode Artwork Automated Evaluation of LLMs 36:03 Episode Artwork Investing In AI-first Startups 33:28 Episode Artwork AI's Role In Physics, Chemistry, and Beyond 39:27 Episode Artwork Generative AI for Coding 32:03 Episode Artwork Building Robots That Can Cook 43:55 Episode Artwork Discovering New Materials With AI 39:35 Episode Artwork Designing Printed Circuit Boards With AI 39:26 Episode Artwork Algorithmic Data Curation 41:10 Episode Artwork Modifying Speech Accents In Real Time With AI 34:34 Episode Artwork Voice AI Agents 40:08