This blog discusses the emergence of agentic AI, capable of planning and executing complex tasks autonomously, contrasting with traditional generative AI. The post emphasizes the importance of managing expectations, oversight, and ensuring transparency due to the unpredictability, including potential hallucinations associated with these systems. LangGraph is highlighted as a powerful tool for developing agentic workflows.
Exploring the “AI Operational Complexity Cube idea” for Testing Applications integrating LLMs
The post explores the integration of Large Language Models (LLMs) in applications, stressing the need for effective production testing. It introduces the AI Operational Complexity Cube concept, emphasizing new testing dimensions for LLMs, including prompt testing and user engagement. A structured testing approach is proposed to ensure reliability and robustness.
