It’s tempting just to replicate all databases in the cloud, but it’s a much better approach to get your data house in order as part of the move. Credit: Thinkstock Last week I discussed database normalization as a best practice in multicloud architecture. Let’s look at this concept within cloud migration, as well. Don’t confuse database normalization with data normalization. Data normalization is about reducing redundancy and defining a more optimized structure. Perhaps you DBAs are aware of this process. I taught it in college more than 30 years ago. Database normalization is the process of reducing redundancy of the databases themselves to create a set of databases that are better focused on serving the needs of the business applications, data scientists, and those performing data analytics. The challenge is that databases and data slated to move to the cloud are overly complex and full of redundancy (few single sources of truth). Also, most people moving that data to the cloud just want to replicate the databases to the public cloud destination—a huge mistake, and here’s why. I understand that most budgets are limited and that the cost of moving and combining data to cloud-native and noncloud-native databases is much higher than simply pushing bad database architectures to the cloud. However, I also understand that you’re better off figuring that out as you move to the cloud, rather than having to fix it later. If you don’t, you’ll have to migrate data twice: First lifting and shifting to a public cloud or clouds, then having to loop back and fix things once you figure out that the database architecture in the cloud is not optimized (because it’s overly complex, or redundant, or too expensive). What should those charged with cloud migration do? Here is the ideal process: Get buy-in from stakeholders. This is first because you will spend at least twice as much for the migration and normalization, including changing applications to process the new database stack. If leaders are not willing to put up the money, explain the risks and future costs. (Try sending an email. You can pull it out of your sent mail folder as proof you tried to warn against the folly of doing nothing.) Make sure you have the necessary human resources. You’ll need people who understand the incumbent database stack, cloud-native database options, database design, and data migration processes. Include data security and governance in there, as well as data operations. Spend enough time on planning. Much of this work is around migration planning, including how the databases will change and which new or existing ones to use. The devil is in the details. If you’re missing a piece of middleware or a data compliance system, it will have to be redone. This is really not as hard as it seems. We’ve moved from platform to platform and simultaneously changed databases in the past. What’s new is that we’re looking more strategically at the data now. It needs to be core analytical data, as well as training data for AI systems. Data is really everything to the business now. Treat it as such. Related content analysis Generative AI won’t fix cloud migration You’ve probably heard how generative AI will solve all cloud migration problems. It’s not that simple. Generative AI could actually make it harder and more costly. By David Linthicum Jul 12, 2024 5 mins Generative AI Artificial Intelligence Cloud Computing analysis All the brilliance of AI on minimalist platforms Buy all the processing and storage you can or go with a minimum viable platform? AI developers and designers are dividing into two camps. By David Linthicum Jul 09, 2024 5 mins Generative AI Cloud Architecture Artificial Intelligence analysis The next 10 years for cloud computing Despite AI's explosive growth, the industry still needs to face facts that customers are unhappy about costs and vendor lock-in. By David Linthicum Jul 05, 2024 5 mins Amazon Web Services Google Cloud Platform Microsoft Azure analysis Serverless cloud technology fades away Serverless was a big deal for a hot minute, but now it seems old-fashioned, even though its basic elements, agility and scalability, are still relevant. By David Linthicum Jul 02, 2024 4 mins Serverless Computing Cloud Computing Software Development Resources Videos