The larger your database, the higher the possibility of data repetition and inaccuracies that compromise the results you pull from the database. Normalization in DBMS exists to counteract those problems by helping you to create more uniform databases in which redundancies are less likely to occur.


Mastering normalization is a key skill in DBMS for the simple fact that an error-strewn database is of no use to an organization. For example, a retailer that has to deal with a database that has multiple entries for phone numbers and email addresses is a retailer that can’t see as effectively as one that has a simple route to the customer. Let’s look at normalization in DBMS and how it helps you to create a more organized database.


The Concept of Normalization


Grab a pack of playing cards and throw them onto the floor. Now, pick up the “Jack of Hearts.” It’s a tough task because the cards are strewn all over the place. Some are facing down and there’s no rhyme, reason, or pattern to how the cards lie, meaning you’re going to have to check every card individually to find the one you want.


That little experiment shows you how critical organization is, even with a small set of “data.” It also highlights the importance of normalization in DBMS. Through normalization, you implement organizational controls using a set of principles designed to achieve the following:


  • Eliminate redundancy – Lower (or eliminate) occurrences of data repeating across different tables, or inside individual tables, in your DBMS.
  • Minimize data anomalies – Better organization makes it easier to spot datasets that don’t fit the “norm,” meaning fewer anomalies.
  • Improve data integrity – More accurate data comes from normalization controls. Database users can feel more confident in their results because they know that the controls ensure integrity.

The Process of Normalization


If normalization in DBMS is all about organization, it stands to reason that they would be a set process to follow when normalizing your tables and database:


  1. Decompose your tables – Break every table down into its various parts, which may lead to you creating several tables out of one. Through decomposition, you separate different datasets, eliminate inconsistencies, and set the stage for creating relationships and dependencies between tables.
  2. Identify functional dependencies – An attribute in one table may be dependent on another to exist. For example, a “Customer ID” number in a retailer’s “Customer” table is functionally dependent on the “Customer Name” field because the ID can’t exist without the customer. Identifying these types of dependencies ensures you don’t end up with empty records (such as a record with a “Customer ID” and no customer attached to it).
  3. Apply normalization rules – Once you’re broken down your table and identified the functional dependencies, you apply relevant normalization rules. You’ll use Normal Forms to do this, with the six highlighted below each having its own rules, structures, and use cases.

Normal Forms in DBMS


There isn’t a “single” way to achieve normalization in DBMS because every database (and the tables it contains) is different. Instead, there are six normal forms you may use, with each having its own rules that you need to understand to figure out which to apply.


First Normal Form (1NF)


If a relation can’t contain multiple values, it’s in 1NF. In other words, each attribute in the table can only contain a single (called “atomic”) value.


Example


If a retailer wants to store the details of its customers, it may have attributes in its table like “Customer Name,” “Phone Number,” and “Email Address.” By applying 1NF to this table, you ensure that the attributes that could contain multiple entries (“Phone Number” and “Email Address”) only contain one, making contacting that customer much simpler.


Second Normal Form (2NF)


A table that’s in 2NF is in 1NF, with the additional condition that none of its non-prime attributes depend on a subset of candidate keys within the table.


Example


Let’s say an employer wants to create a table that contains information about an employee, the skills they have, and their age. An employee may have multiple skills, leading to multiple records for the same employee in the table, with each denoting a skill while the ID number and age of the employee repeat for each record.


In this table, you’ve achieved 1NF because each attribute has an atomic value. However, the employee’s age is dependent on the employee ID number. To achieve 2NF, you’d break this table down into two tables. The first will contain the employee’s ID number and age, with that ID number linking to a second table that lists each of the skills associated with the employee.


Third Normal Form (3NF)


In 3NF, the table you have must already be in 2NF form, with the added rule of removing the transitive functional dependency of the non-prime attribute of any super key. Transitive functional dependency occurs if the dependency is the result of a pair of functional dependencies. For example, the relationship between A and C is a transitive dependency if A depends on B, B depends on C, but B doesn’t depend on A.


Example


Let’s say a school creates a “Students” table with the following attributes:


  • Student ID
  • Name
  • Zip Code
  • State
  • City
  • District

In this case, the “State,” “District,” and “City” attributes all depend on the “Zip Code” attribute. That “Zip” attribute depends on the “Student ID” attribute, making “State,” “District,” and “City” all transitively depending on “Student ID.”


To resolve this problem, you’d create a pair of tables – “Student” and “Student Zip.” The “Student” table contains the “Student ID,” “Name,” and “Zip Code” attributes, with that “Zip Code” attribute being the primary key of a “Student Zip” table that contains the rest of the attributes and links to the “Student” table.



Boyce-Codd Normal Form (BCNF)


Often referred to as 3.5NF, BCNF is a stricter version of 3NF. So, this normalization in DBMS rule occurs if your table is in 3NF, and for every functional dependence between two fields (i.e., A -> B), A is the super key of your table.


Example


Sticking with the school example, every student in a school has multiple classes. The school has a table with the following fields:


  • Student ID
  • Nationality
  • Class
  • Class Type
  • Number of Students in Class

You have several functional dependencies here:


  • Student ID -> Nationality
  • Class -> Number of Students in Class, Class Type

As a result, both the “Student ID” and “Class” attributes are candidate keys but can’t serve as keys alone. To achieve BCNF normalization, you’d break the above table into three – “Student Nationality,” “Student Class,” and “Class Mapping,” allowing “Student ID” and “Class” to serve as primary keys in their own tables.


Fourth Normal Form (4NF)


In 4NF, the database must meet the requirements of BCNF, in addition to containing no more than a single multivalued dependency. It’s often used in academic circles, as there’s little use for 4NF elsewhere.


Example


Let’s say a college has a table containing the following fields:


  • College Course
  • Lecturer
  • Recommended Book

Each of these attributes is independent of the others, meaning each can change without affecting the others. For example, the college could change the lecturer of a course without altering the recommended reading or the course’s name. As such, the existence of the course depends on both the “Lecturer” and “Recommended Book” attributes, creating a multivalued dependency. If a DBMS has more than one of these types of dependencies, it’s a candidate for 4NF normalization.


Fifth Normal Form (5NF)


If your table is in 4NF, has no join dependencies, and all joining is lossless, it’s in 5NF. Think of this as the final form when it comes to normalization in DBMS, as you’ve broken your table down so much that you’ve made redundancy impossible.


Example


A college may have a table that tells them which lecturers teach certain subjects during which semesters, creating the following attributes:


  • Subject
  • Lecturer Name
  • Semester

Let’s say one of the lecturers teaches both “Physics” and “Math” for “Semester 1,” but doesn’t teach “Math” for Semester 2. That means you need to combine all of the fields in this table to get an accurate dataset, leading to redundancy. Add a third semester to the mix, especially if that semester has no defined courses or lecturers, and you have to join dependencies.


The 5NF solution is to break this table down into three tables:


  • Table 1 – Contains the “Semester” and “Subject” attributes to show which subjects are taught in each semester.
  • Table 2 – Contains the “Subject” and “Lecturer Name” attributes to show which lecturers teach a subject.
  • Table 3 – Contains the “Semester” and “Lecturer Name” attributes so you can see which lecturers teach during which semesters.

Benefits of Normalization in DBMS


With normalization in DBMS being so much work, you need to know the following benefits to show that it’s worth your effort:


  • Improved database efficiency
  • Better data consistency
  • Easier database maintenance
  • Simpler query processing
  • Better access controls, resulting in superior security

Limitations and Trade-Offs of Normalization


Normalization in DBMS does have some drawbacks, though these are trade-offs that you accept for the above benefits:


  • The larger your database gets, the more demands it places on system performance.
  • Breaking tables down leads to complexity.
  • You have to find a balance between normalization and denormalization to ensure your tables make sense.

Practical Tips for Mastering Normalization Techniques


Getting normalization in DBMS is hard, especially when you start feeling like you’re dividing tables into so many small tables that you’re losing track of the database. These tips help you apply normalization correctly:


  • Understand the database requirements – Your database exists for you to extract data from it, so knowing what you’ll need to extract indicates whether you need to normalize tables or not.
  • Document all functional dependencies – Every functional dependence that exists in your database makes the table in which it exists a candidate for normalization. Identify each dependency and document it so you know whether you need to break the table down.
  • Use software and tools – You’re not alone when poring through your database. There are plenty of tools available that help you to identify functional dependencies. Many make normalization suggestions, with some even being able to carry out those suggestions for you.
  • Review and refine – Every database evolves alongside its users, so continued refining is needed to identify new functional dependencies (and opportunities for normalization).
  • Collaborate with other professionals – A different set of eyes on a database may reveal dependencies and normalization opportunities that you don’t see.

Make Normalization Your New Norm


Normalization may seem needlessly complex, but it serves the crucial role of making the data you extract from your database more refined, accurate, and free of repetition. Mastering normalization in DBMS puts you in the perfect position to create the complex databases many organizations need in a Big Data world. Experiment with the different “normal forms” described in this article as each application of the techniques (even for simple tables) helps you get to grips with normalization.

Related posts

Sage: The ethics of AI: how to ensure your firm is fair and transparent
OPIT - Open Institute of Technology
OPIT - Open Institute of Technology
Mar 7, 2025 3 min read

Source:


By Chris Torney

Artificial intelligence (AI) and machine learning have the potential to offer significant benefits and opportunities to businesses, from greater efficiency and productivity to transformational insights into customer behaviour and business performance. But it is vital that firms take into account a number of ethical considerations when incorporating this technology into their business operations. 

The adoption of AI is still in its infancy and, in many countries, there are few clear rules governing how companies should utilise the technology. However, experts say that firms of all sizes, from small and medium-sized businesses (SMBs) to international corporations, need to ensure their implementation of AI-based solutions is as fair and transparent as possible. Failure to do so can harm relationships with customers and employees, and risks causing serious reputational damage as well as loss of trust.

What are the main ethical considerations around AI?

According to Pierluigi Casale, professor in AI at the Open Institute of Technology, the adoption of AI brings serious ethical considerations that have the potential to affect employees, customers and suppliers. “Fairness, transparency, privacy, accountability, and workforce impact are at the core of these challenges,” Casale explains. “Bias remains one of AI’s biggest risks: models trained on historical data can reinforce discrimination, and this can influence hiring, lending and decision-making.”

Part of the problem, he adds, is that many AI systems operate as ‘black boxes’, which makes their decision-making process hard to understand or interpret. “Without clear explanations, customers may struggle to trust AI-driven services; for example, employees may feel unfairly assessed when AI is used for performance reviews.”

Casale points out that data privacy is another major concern. “AI relies on vast datasets, increasing the risk of breaches or misuse,” he says. “All companies operating in Europe must comply with regulations such as GDPR and the AI Act, ensuring responsible data handling to protect customers and employees.”

A third significant ethical consideration is the potential impact of AI and automation on current workforces. Businesses may need to think about their responsibilities in terms of employees who are displaced by technology, for example by introducing training programmes that will help them make the transition into new roles.

Olivia Gambelin, an AI ethicist and the founder of advisory network Ethical Intelligence, says the AI-related ethical considerations are likely to be specific to each business and the way it plans to use the technology. “It really does depend on the context,” she explains. “You’re not going to find a magical checklist of five things to consider on Google: you actually have to do the work, to understand what you are building.”

This means business leaders need to work out how their organisation’s use of AI is going to impact the people – the customers and employees – that come into contact with it, Gambelin says. “Being an AI-enabled company means nothing if your employees are unhappy and fearful of their jobs, and being an AI-enabled service provider means nothing if it’s not actually connecting with your customers.”

Read the full article below:

Read the article
Reuters: EFG Watch: DeepSeek poses deep questions about how AI will develop
OPIT - Open Institute of Technology
OPIT - Open Institute of Technology
Feb 10, 2025 4 min read

Source:

  • Reuters, Published on February 10th, 2025.

By Mike Scott

Summary

  • DeepSeek challenges assumptions about AI market and raises new ESG and investment risks
  • Efficiency gains significant – similar results being achieved with less computing power
  • Disruption fuels doubts over Big Tech’s long-term AI leadership and market valuations
  • China’s lean AI model also casts doubt on costly U.S.-backed Stargate project
  • Analysts see DeepSeek as a counter to U.S. tariffs, intensifying geopolitical tensions

February 10 – The launch by Chinese company DeepSeek, opens new tab of its R1 reasoning model last month caused chaos in U.S. markets. At the same time, it shone a spotlight on a host of new risks and challenged market assumptions about how AI will develop.

The shock has since been overshadowed by President Trump’s tariff wars, opens new tab, but DeepSeek is set to have lasting and significant implications, observers say. It is also a timely reminder of why companies and investors need to consider ESG risks, and other factors such as geopolitics, in their investment strategies.

“The DeepSeek saga is a fascinating inflection point in AI’s trajectory, raising ESG questions that extend beyond energy and market concentration,” Peter Huang, co-founder of Openware AI, said in an emailed response to questions.

DeepSeek put the cat among the pigeons by announcing that it had developed its model for around $6 million, a thousandth of the cost of some other AI models, while also using far fewer chips and much less energy.

Camden Woollven, group head of AI product marketing at IT governance and compliance group GRC International, said in an email that “smaller companies and developers who couldn’t compete before can now get in the game …. It’s like we’re seeing a democratisation of AI development. And the efficiency gains are significant as they’re achieving similar results with much less computing power, which has huge implications for both costs and environmental impact.”

The impact on AI stocks and companies associated with the sector was severe. Chipmaker Nvidia lost almost $600 billion in market capitalisation after the DeepSeek announcement on fears that demand for its chips would be lower, but there was also a 20-30% drop in some energy stocks, said Stephen Deadman, UK associate partner at consultancy Sia.

As Reuters reported, power producers were among the biggest winners in the S&P 500 last year, buoyed by expectations of ballooning demand from data centres to scale artificial intelligence technologies, yet they saw the biggest-ever one-day drops after the DeepSeek announcement.

One reason for the massive sell-off was the timing – no-one was expecting such a breakthrough, nor for it to come from China. But DeepSeek also upended the prevailing narrative of how AI would develop, and who the winners would be.

Tom Vazdar, professor of cybersecurity and AI at Open Institute of Technology (OPIT), pointed out in an email that it called into question the premise behind the Stargate Project,, opens new tab a $500 billion joint venture by OpenAI, SoftBank and Oracle to build AI infrastructure in the U.S., which was announced with great fanfare by Donald Trump just days before DeepSeek’s announcement.

“Stargate has been premised on the notion that breakthroughs in AI require massive compute and expensive, proprietary infrastructure,” Vazdar said in an email.

There are also dangers in markets being dominated by such a small group of tech companies. As Abbie Llewellyn-Waters, Investment manager at Jupiter Asset Management, pointed out in a research note, the “Magnificent Seven” tech stocks had accounted for nearly 60% of the index’s gains over the previous two years. The group of mega-caps comprised more than a third of the S&P 500’s total value in December 2024.

Read the full article below:

Read the article