Jump to content

Data management

From Wikipedia, the free encyclopedia
The data lifecycle

Data management comprises all disciplines related to handling data as a valuable resource, it is the practice of managing an organization's data so it can be analyzed for decision making.[1]

Concept

[edit]

The concept of data management emerged alongside the evolution of computing technology. In the 1950s, as computers became more prevalent, organizations began to grapple with the challenge of organizing and storing data efficiently. Early methods relied on punch cards and manual sorting, which were labor-intensive and prone to errors. The introduction of database management systems in the 1970s marked a significant milestone, enabling structured storage and retrieval of data.

By the 1980s, relational database models revolutionized data management, emphasizing the importance of data as an asset and fostering a data-centric mindset in business. This era also saw the rise of data governance practices, which prioritized the organization and regulation of data to ensure quality and compliance. Over time, advancements in technology, such as cloud computing and big data analytics, have further refined data management, making it a cornerstone of modern business operations.

As of 2025, data management encompasses a wide range of practices, from data storage and security to analytics and decision-making, reflecting its critical role in driving innovation and efficiency across industries.[2]

Topics in Data Management

[edit]

The Data Management Body of Knowledge, DMBoK, developed by the Data Management Association, DAMA, outlines key knowledge areas that serve as the foundation for modern data management practices. suggesting a framework for organizations to manage data as a strategic asset.

Data Governance

[edit]

Setting policies, procedures, and accountability frameworks to ensure that data is accurate, secure, and used responsibly throughout the organization.

Data Architecture

[edit]

Focuses on designing the overall structure of data systems. It ensures that data flows are efficient and that systems are scalable, adaptable, and aligned with business needs.

Data Modeling and Design

[edit]

This area centers on creating models that logically represent data relationships. It’s essential for both designing databases and ensuring that data is structured in a way that facilitates analysis and reporting.

Data Storage and Operations

[edit]

Deals with the physical storage of data and its day-to-day management. This includes everything from traditional data centers to cloud-based storage solutions and ensuring efficient data processing.

Data Security

[edit]

Encompasses the practices and technologies needed to protect data from unauthorized access, breaches, and other security threats, ensuring data privacy and compliance with regulations.

Data Integration and Interoperability

[edit]

Ensures that data from various sources can be seamlessly shared and combined across multiple systems, which is critical for comprehensive analytics and decision-making.

Document and Content Management

[edit]

Focuses on managing unstructured data such as documents, multimedia, and other content, ensuring that it is stored, categorized, and easily retrievable.

Data Warehousing, Business Intelligence and Data Analytics

[edit]

Involves consolidating data into repositories that support analytics, reporting, and business insights.

Metadata Management

[edit]

Manages data about data, including definitions, origin, and usage, to enhance the understanding and usability of the organization’s data assets.

Data Quality Management

[edit]

Dedicated to ensuring that data remains accurate, complete, and reliable, this area emphasizes continuous monitoring and improvement practices.

Reference and master data management

[edit]

Reference data comprises standardized codes and values for consistent interpretation across systems. Master data management (MDM) governs and centralizes an organization’s critical data, ensuring a unified, reliable information source that supports effective decision-making and operational efficiency.

Data security

[edit]

Data security refers to a comprehensive set of practices and technologies designed to protect digital information and systems from unauthorized access, use, disclosure, modification, or destruction. It encompasses encryption, access controls, monitoring, and risk assessments to maintain data integrity, confidentiality, and availability.

Data privacy

[edit]

Data privacy involves safeguarding individuals’ personal information by ensuring its collection, storage, and use comply with consent, legal standards, and confidentiality principles. It emphasizes protecting sensitive data from misuse or unauthorized access while respecting users' rights.

Data Management as a foundation of Information Management

[edit]

The distinction between data and derived value is illustrated by the "information ladder" or the DIKAR model.

diagram displays the DIKAR model - Data, Information, Knowledge, Action, Response. A model showing the relationship between data, information and knowledge.
The DIKAR model - Data, Information, Knowledge, Action, Response. A model showing the relationship between data, information and knowledge.

The "DIKAR" model stands for Data, Information, Knowledge, Action, and Result. It is a framework used to bridge the gap between raw data and actionable outcomes. The model emphasizes the transformation of data into information, which is then interpreted to create knowledge. This knowledge guides actions that lead to measurable results. DIKAR is widely applied in organizational strategies, helping businesses align their data management processes with decision-making and performance goals. By focusing on each stage, the model ensures that data is effectively utilized to drive informed decisions and achieve desired outcomes. It is particularly valuable in technology-driven environments.

The "information ladder" illustrates the progression from data (raw facts) to information (processed data), knowledge (interpreted information), and ultimately wisdom (applied knowledge). Each step adds value and context, enabling better decision-making. It emphasizes the transformation of unstructured inputs into meaningful insights for practical use.

Data management in research

[edit]

In research, Data management refers to the systematic process of handling data throughout its lifecycle. This includes activities such as collecting, organizing, storing, analyzing, and sharing data to ensure its accuracy, accessibility, and security.

Effective data management also involves creating a data management plan, DMP, addressing issues like ethical considerations, compliance with regulatory standards, and long-term preservation. Proper management enhances research transparency, reproducibility, and the efficient use of resources, ultimately contributing to the credibility and impact of research findings. It is a critical practice across disciplines to ensure data integrity and usability both during and after a research project.

Big Data

[edit]

However, data has staged a comeback with the popularisation of the term big data, which refers to the collection and analyses of massive sets of data. While big data is a recent phenomenon, the requirement for data to aid decision-making traces back to the early 1970s with the emergence of decision support systems (DSS). These systems can be considered as the initial iteration of data management for decision support.[3]

Several organisations have established data management centers (DMC) for their operations.[4]

Data sources

[edit]

Marketers and marketing organizations have been using data collection and analysis to refine their operations for the last few decades. Marketing departments in organizations and marketing companies conduct data collection and analysis by collecting data from different data sources and analyzing them to come up with insightful data they can use for strategic decision-making (Baier et al., 2012). In the modern business environment, data has evolved into a crucial asset for businesses since businesses use data as a strategic asset that is used regularly to create a competitive advantage and improve customer experiences. Among the most significant forms of data is customer information which is a critical asset used to assess customer behavior and trends and use it for developing new strategies for improving customer experience (Ahmed, 2004). However, data has to be of high quality to be used as a business asset for creating a competitive advantage. Therefore, data governance is a critical element of data collection and analysis since it determines the quality of data while integrity constraints guarantee the reliability of information collected from data sources. Various technologies including Big Data are used by businesses and organizations to allow users to search for specific information from raw data by grouping it based on the preferred criteria marketing departments in organizations could apply for developing targeted marketing strategies (Ahmed, 2004). As technology evolves, new forms of data are being introduced for analysis and classification purposes in marketing organizations and businesses. The introduction of new gadgets such as Smartphones and new-generation PCs has also introduced new data sources from which organizations can collect, analyze and classify data when developing marketing strategies. Retail businesses are the business category that uses customer data from smart devices and websites to understand how their current and targeted customers perceive their services before using the information to make improvements and increase customer satisfaction (Cerchiello and Guidici, 2012). Analyzing customer data is crucial for businesses since it allows marketing teams to understand customer behavior and trends which makes a considerable difference during the development of new marketing campaigns and strategies. Retailers who use customer data from various sources gain an advantage in the market since they can develop data-informed strategies for attracting and retaining customers in the overly competitive business environment. Based on the information on the benefits of data collection and analysis, the following hypotheses are proposed: The sources of data used as the foundation of data collection and analysis have a considerable impact on the data analysis tools used for analyzing and categorizing data.

Financial and economic outcomes

[edit]

Studies indicate that customer transactions account for a 40% increase in the data collected annually, which means that financial data has a considerable impact on business decisions. Therefore, modern organizations are using big data analytics to identify 5 to 10 new data sources that can help them collect and analyze data for improved decision-making. Jonsen (2013) explains that organizations using average analytics technologies are 20% more likely to gain higher returns compared to their competitors who have not introduced any analytics capabilities in their operations. Also, IRI reported that the retail industry could experience an increase of more than $10 billion each year resulting from the implementation of modern analytics technologies. Therefore, the following hypothesis can be proposed: Economic and financial outcomes can impact how organizations use data analytics tools.

See also

[edit]

References

[edit]
  1. ^ "What Is Data Management? Importance & Challenges | Tableau". www.tableau.com. Retrieved 2023-12-04.
  2. ^ Kramer, Robert (20 Mar 2025). "The State Of Enterprise Data Management In Early 2025". Forbes. Retrieved 8 Apr 2025.
  3. ^ Watson, Hugh J.; Marjanovic, Olivera (2013). "Big Data: The Fourth Data Management Generation". Business Intelligence Journal; Seattle. 18 (3): 4–8.
  4. ^ For example: Kumar, Sangeeth; Ramesh, Maneesha Vinodini (2010). "Lightweight Management framework (LMF) for a Heterogeneous Wireless Network for Landslide Detection". In Meghanathan, Natarajan; Boumerdassi, Selma; Chaki, Nabendu; Nagamalai, Dhinaharan (eds.). Recent Trends in Networks and Communications: International Conferences, NeCoM 2010, WiMoN 2010, WeST 2010,Chennai, India, July 23-25, 2010. Proceedings. Communications in Computer and Information Science. Vol. 90. Springer. p. 466. ISBN 9783642144936. Retrieved 2016-06-16. 4.4 Data Management Center (DMC)[:] The Data Management Center is the data center for all of the deployed cluster networks. Through the DMC, the LMF allows the user to list the services in any cluster member belonging to any cluster [...]. Data collecting.
  5. ^ "Data Mesh: Delivering data-driven value at scale".

Further reading

[edit]
  • Sebastian-Coleman, Laura (2018). Navigating the Labyrinth: An Executive Guide to Data Management. New York: Morgan Kaufmann.
  • The DAMA Guide to the Data Management Body of Knowledge (DMBoK): Data Management for Practitioners and Professionals (2 ed.). DAMA International, Technics Publications. 2017.
[edit]