{"id":7720,"date":"2026-04-16T13:44:40","date_gmt":"2026-04-16T11:44:40","guid":{"rendered":"https:\/\/blog.bart.sk\/en\/?p=7720"},"modified":"2026-04-16T13:45:39","modified_gmt":"2026-04-16T11:45:39","slug":"ai-and-cloud-in-production-why-most-ai-solutions-never-move-beyond-the-demo","status":"publish","type":"post","link":"https:\/\/blog.bart.sk\/en\/ai-and-cloud-in-production-why-most-ai-solutions-never-move-beyond-the-demo\/","title":{"rendered":"AI and Cloud in Production: Why Most AI Solutions Never Move Beyond the Demo"},"content":{"rendered":"<p><strong>AI tools are evolving fast, and in demos they can feel incredibly convincing \u2014 especially when built on a clean slate. The real difference shows up when they need to become part of a real product, one with history, dependencies, and complex integrations.<\/strong><\/p>\n<p>AWS Community Day in Ko\u0161ice wasn\u2019t about shiny new features for me. It was about reality \u2014 what actually works in production and which changes make sense even for existing systems. These are three takeaways I brought back from the conference as a DevOps engineer and former backend developer at Crossuite (along with a few stickers and a notebook \ud83d\ude42).<\/p>\n<h2>1. AI delivers the most value in clearly defined use cases<\/h2>\n<p>Across multiple talks, one theme kept coming up \u2014 the gap between what we can quickly build with AI today and what can actually run reliably in production.<\/p>\n<p>Solutions built on LLMs, agents, or tools like Amazon Bedrock looked very impressive at first. <strong>The turning point came when the discussion shifted to real-world deployment \u2014 specifically how these solutions fit into existing systems, how their behavior is monitored, and how their outputs are controlled.<\/strong><\/p>\n<p><strong>Key idea:<\/strong><\/p>\n<blockquote>\n<p>AI only makes sense when it\u2019s useful, not when it\u2019s trendy.<\/p>\n<\/blockquote>\n<p>At Crossuite, this approach has proven itself in practice. We integrate AI into specific steps, such as data processing or working with text, where inputs and outputs are clearly defined and the impact is immediate. This is where AI becomes a natural part of the product, delivering real value.<\/p>\n<h2>2. Less code, more controlled flow<\/h2>\n<p>In serverless discussions, a clear shift toward simpler and more transparent architecture kept emerging. <strong>A standout moment came from Tom\u00e1\u0161 Sabol\u2019s talk, which clearly articulated how the role of AWS Lambda is evolving.<\/strong><\/p>\n<p>Lambda remains a powerful tool for handling business logic, but the way we use it is changing. Its greatest value comes when it focuses on a specific operation, while orchestration is handled by other parts of the system \u2014 such as native AWS services like API Gateway, EventBridge, or Step Functions.<\/p>\n<p>One recommendation stood out:<\/p>\n<blockquote>\n<p>The less logic hidden inside Lambda functions, the more readable and stable the system becomes.<\/p>\n<\/blockquote>\n<p>Instead of \u201cgluing\u201d services together with custom code, responsibility shifts into the architecture, where flow is explicitly defined and easier to control. This leads to less custom code and systems that are easier to debug, scale, and evolve \u2014 while remaining understandable even as complexity grows.<\/p>\n<figure>\n<img decoding=\"async\" src=\"https:\/\/blog.bart.sk\/wp-content\/uploads\/2026\/04\/AWS-Community-Day-Slovakia-016-2000x1500.jpg\" alt=\"AWS Community Day Slovakia\" \/><figcaption><em>I attended the conference together with my colleagues Ady and Mi\u0161o (pictured here in yellow).<\/em><\/figcaption><\/figure>\n<h2>3. Karpenter shows that scaling is solved. Cost is not.<\/h2>\n<p>Viktor Vedmich\u2019s talk on Karpenter stood out because it didn\u2019t focus on scaling Kubernetes clusters itself, but on everything around it. He prepared two demos for the session.<\/p>\n<p>The first one was smaller \u2014 a cluster running 10 pods. <strong>Using a visualization tool, he showed what was happening inside the cluster \u2014 Karpenter dynamically added and removed nodes and rescheduled pods between them.<\/strong> It was both impressive and easy to grasp.<\/p>\n<p>The second demo aimed higher. When asked how many pods to scale to, someone from the audience suggested \u201c5000.\u201d Viktor kicked off the scripts, and hundreds of pods were supposed to start spinning up in the background. <strong>After about 15 minutes, it became clear that something wasn\u2019t right.<\/strong> A small issue caused the process to stall at the beginning. It happens. The demo didn\u2019t go as planned \u2014 but at Crossuite, we know Karpenter works in practice.<\/p>\n<p>The talk included several practical tips on how to further improve Karpenter through configuration. <strong>By combining Spot and On-Demand instances (for example in a 50\/50 split) and applying consolidation strategies, it\u2019s possible to significantly reduce cluster operating costs.<\/strong><\/p>\n<p>We already run Karpenter in production at Crossuite. The talk confirmed that our foundations are solid, while also showing there\u2019s still room to push further. <strong>Distributing workloads between Spot and On-Demand instances could bring additional cost savings without compromising application stability.<\/strong><\/p>\n<h2>What I\u2019m taking into practice<\/h2>\n<p>After the conference, I realized that many of these topics are already part of what we do. Still, it was valuable to see them clearly named and broken down in detail.<\/p>\n<p>Across AI, serverless, and infrastructure, one pattern kept repeating:<\/p>\n<blockquote>\n<p>The difference doesn\u2019t come from big decisions, but from how the smaller ones are set up \u2014 and how often we revisit them over time.<\/p>\n<\/blockquote>\n<p>Infrastructure stood out the most for me. Scaling works well today, but how efficiently a system runs between load peaks is where attention to detail really pays off.<\/p>\n<figure>\n<img decoding=\"async\" src=\"https:\/\/blog.bart.sk\/wp-content\/uploads\/2026\/04\/IMG_4393-2000x1374.jpg\" alt=\"Conference moment\" \/><br \/>\n<\/figure>\n<h2>Frequently Asked Questions about AI, Cloud, and Production Systems<\/h2>\n<div class=\"faq\">\n<details>\n<summary>Why do many AI solutions never move beyond the demo stage?<\/summary>\n<p>AI solutions often work well in isolation or early prototypes. The challenge comes when integrating them into real products with existing architecture, data, and operational constraints. That\u2019s where production readiness is truly tested.<\/p>\n<\/details>\n<details>\n<summary>When does AI deliver the most value in a product?<\/summary>\n<p>AI delivers the most value in clearly defined use cases with structured inputs and expected outputs \u2014 such as data processing, automation, or text handling.<\/p>\n<\/details>\n<details>\n<summary>What does it mean that AI should be useful, not trendy?<\/summary>\n<p>It means AI should solve a real problem or improve an existing process. Its value lies in practical application, not in simply adopting the latest technology.<\/p>\n<\/details>\n<details>\n<summary>What is AWS Lambda used for?<\/summary>\n<p>AWS Lambda allows you to run code without managing servers. It\u2019s commonly used for handling specific business logic, event-driven processing, and lightweight APIs.<\/p>\n<\/details>\n<details>\n<summary>Why does serverless architecture often lead to less code?<\/summary>\n<p>Serverless architectures rely on managed services like API Gateway, EventBridge, or Step Functions, which take over orchestration and infrastructure concerns, reducing the need for custom code.<\/p>\n<\/details>\n<details>\n<summary>What is Karpenter in Kubernetes?<\/summary>\n<p>Karpenter is a Kubernetes autoscaling tool that dynamically provisions infrastructure based on workload demands, helping optimize performance and cost.<\/p>\n<\/details>\n<details>\n<summary>Why isn\u2019t scaling enough on its own?<\/summary>\n<p>Scaling ensures availability under load, but efficiency depends on how resources are managed outside peak times \u2014 including how quickly capacity is released and how workloads are distributed.<\/p>\n<\/details>\n<details>\n<summary>How can Kubernetes costs be optimized in AWS?<\/summary>\n<p>Costs can be reduced through proper configuration \u2014 including NodePools, Spot vs. On-Demand instance usage, and consolidation strategies.<\/p>\n<\/details>\n<details>\n<summary>What is the benefit of using Spot and On-Demand instances together?<\/summary>\n<p>Combining both allows you to lower costs with Spot instances while maintaining reliability with On-Demand capacity.<\/p>\n<\/details>\n<details>\n<summary>Why are conferences like AWS Community Day valuable?<\/summary>\n<p>They provide real-world insights, practical examples, and deeper understanding of how technologies behave in production environments.<\/p>\n<\/details>\n<\/div>\n","protected":false},"excerpt":{"rendered":"AI tools are evolving fast, and in demos they can feel incredibly convincing \u2014 especially when built on&hellip;","protected":false},"author":53,"featured_media":7722,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":"","csco_display_header_overlay":false,"csco_singular_sidebar":"","csco_page_header_type":""},"categories":[210],"tags":[346,1108,1116,317,1126,1117,570,223,1119,1109,633,1114,1122,432,224,1113,1124,1121,1115,1110,1120,1123,1118,274,1111,1112,1125,1127],"_links":{"self":[{"href":"https:\/\/blog.bart.sk\/en\/wp-json\/wp\/v2\/posts\/7720"}],"collection":[{"href":"https:\/\/blog.bart.sk\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.bart.sk\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.bart.sk\/en\/wp-json\/wp\/v2\/users\/53"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.bart.sk\/en\/wp-json\/wp\/v2\/comments?post=7720"}],"version-history":[{"count":1,"href":"https:\/\/blog.bart.sk\/en\/wp-json\/wp\/v2\/posts\/7720\/revisions"}],"predecessor-version":[{"id":7723,"href":"https:\/\/blog.bart.sk\/en\/wp-json\/wp\/v2\/posts\/7720\/revisions\/7723"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blog.bart.sk\/en\/wp-json\/wp\/v2\/media\/7722"}],"wp:attachment":[{"href":"https:\/\/blog.bart.sk\/en\/wp-json\/wp\/v2\/media?parent=7720"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.bart.sk\/en\/wp-json\/wp\/v2\/categories?post=7720"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.bart.sk\/en\/wp-json\/wp\/v2\/tags?post=7720"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}