?This post covers the topic of the video in more detail and includes some code samples. The $9,000 Problem You launch a chatbot powered by one of the popular LLMs like Gemini, Claude or GPT-4. It’s amazing and your users love it. Then you check your API bill at the end of the month: $15,000. […]
04
2026
Semantic Caching for LLM Apps: Reduce Costs by 40-80% and Speed up by 250x
25
2025
Building the Future of MySQL: Announcing Plans for MySQL Vector Support and a MySQL Binlog Server
At Percona, our mission has always been to help you succeed with open source databases. We do that by listening to the community, understanding the challenges you face, and building the solutions you need. Now, after a comprehensive review of market trends and direct feedback from our customers and the MySQL community, we are excited […]
18
2025
How Can AI Talk to My Database Part Two: MySQL and Gemini
My first experiments creating an MCP Server to provide AI access to a PostgreSQL database using the FastMCP Python framework and Anthropic’s and OpenAI’s APIs highlighted an important requirement: for now, these two APIs can only communicate with an MCP server through HTTPS over a public URL. While researching how to make this work (which […]
17
2025
How Can AI Talk to My (PostgreSQL) Database?
I admittedly have some work to do to catch up with the AI “trend”. It’s been around (as in, easily accessible) for a few years now, but I can probably still count on my fingers the number of times I’ve used a prompt to ask it anything. That is, discounting the mostly frustrating and usually […]
25
2025
pgvector: The Critical PostgreSQL Component for Your Enterprise AI Strategy
You’re likely racing to enhance your applications with more intelligent, data-driven capabilities, whether through AI-powered models (which have moved into “must implement now!” territory), advanced search functions, real-time fraud detection, or geospatial analysis. As these demands grow, you face a significant challenge: efficiently storing, managing, and querying high-dimensional vector data within your existing database infrastructure. […]
18
2025
The Percona Perspective: A Vision for the Future of Open Source Databases in the Age of AI
From punch cards to containerization, database technologies have changed tremendously over the years. And there has been no shortage of watershed moments along the way. However, we believe the emergence of advanced AI may very well be the most significant one yet. Percona certainly isn’t alone in that belief. But, when it comes to the […]
07
2024
Open Source AI Database Agent Part 2: OSADA
In the first part of this series, we built a basic AI-powered database agent. Now, it’s time to make a minimal viable product out of it. We’ll refine our scripts, add support for MySQL, incorporate OpenAI, and wrap everything in a user-friendly interface with Docker for easy deployment. Get ready to transform your data interactions […]
05
2024
Open Source AI Database Agent Part 1: Introduction
Generative AI is top of mind for many engineers. The questions of how it can be applied to solve business problems and boost productivity are still up in the air. Recently I wrote a blog post about the impact of AI on platform engineers, where I talked about various AI Agents and how they can […]
23
2024
Beyond Cloud: The Impact of AI Agents on Platform Engineering
In November 2023, Bill Gates wrote an article about how AI would change the way we interact with computers and personal devices through so-called Agents: “Agents are not only going to change how everyone interacts with computers. They’re also going to upend the software industry, bringing about the biggest revolution in computing since we went from […]
24
2024
An Introduction to Vector Databases
Imagine that winter is coming to the south of the planet, that you are going on a vacation trip to Patagonia, and you want to buy some cozy clothes. You go to that nice searcher page that says “do no evil” and write in the search field “Jackets for Patagonia weather,” not thinking of a […]