Passionately curious about Data, Databases and Systems Complexity. Data is ubiquitous, the database universe is dichotomous (structured and unstructured), expanding and complex. Find my Database Research at SQLToolkit.co.uk . Microsoft Data Platform MVP

"The important thing is not to stop questioning. Curiosity has its own reason for existing" Einstein

Tuesday, 10 May 2022

Tech in Ten, On the Road by Venturi!

I was excited to be a part of the Tech in Ten podcast by Venturi discussing SQLBits, Data Governance and Data Toboggan. It has been released as Episode 9.

The Tech in Ten team went down to the SQLBits data platform conference held at ExCeL London. Ben sat down with some of the speakers and data professionals at the event to talk about all things data and the ever-changing industry.

Listen to the podcast on Spotify. 

Thursday, 5 May 2022

CDO and Data Leaders Global Summit

Today was the CDO and data leaders global summit hosted by the EDM Council and CDO Magazine designed specifically for senior and C-level data and analytics executives. 

One of the sessions was about the EDM. 

There were discussion about the Cloud Data Management Capabilities (CDMC) 14 key controls and Automations. These should help build trust and confidence in the industry. 

The EDM Councils areas of Advocacy 

  • Best Practice - DCAM & Cloud CDMC, Data ROI, ESG Data (environmental, social & corporate governance)
  • Driving Standards - Knowledge Graph, Industry Ontologies, shared lab
  • Training and Certification - virtual & elearning and webinars and events
  • Research and Benchmarking - Global Industry Study, Life Sciences, Data Sharing
  • Regulatory Engagement - Regulators participation
  • Networking - Data Visions, CDO Summit, workgroups and forums

Learn more about the CDMC, and download with a free license for internal use, at: https://edmcouncil.org/page/CDMC

For  those who have been very DAMA Book of Knowledge focused the DCAM (Data Management Capability Assessment Model) is closely aligned. CDMC took the DCAM framework and focused on managing data in Cloud.  It contains a number of best practices specific to the challenges (and opportunities) of Cloud. The council has been in regular communication with global regulators about CDMC, on what controls may be needed in ensuring safe and effective Cloud environments.

Controls that will be used in Snowflake to satisfy the EDMC's CDMC framework can be found at the GIT link is: https://github.com/Snowflake-Labs/EDMC-CDMC-v1-14-Control_Mapping . On the Microsoft side in Microsoft Purview Compliance Manager there are templates for CDMC key controls.

The keynote was interesting talking about how the CDMC - Cloud Data Management Capabilities Framework - will have a global impact as the best practice standard for cloud Data.  The session labelled as global industry standard for accelerating trusted cloud adoption created through global industry workgroup between May 2020 to September 2021.  There were 100+ companies and 300+ participants. The panel was made up of 


Two reports to read
EDM Council ESG Corporate Reporting Entities report 
EDM Council ESG Rating Providers and Data Aggregators report

Sunday, 1 May 2022

Different data models and frameworks

 There are 3 different models that can help when thinking about data governance and data management.

The revised DAMA Wheel has data governance at the top

The core components to think about for data governance and data management are: Policy; Stewardship & Ownership; Culture Change; Strategy; Principles and Ethics; Data Valuation; Data Maturity Assessment; Data Classification.

When getting started I look at the fundamentals as a starting place.
1. Data Governance
2. Meta Data Management
3. Data Quality
4. Reference/Master Data

DAMA-DMBOK | Data Management Body of Knowledge

DCAM (Data Management Capability Assessment Model) was first published in 2014 following the Socratic method of question and debate. . CDMC (Cloud Data Management Capabilities Framework) is a playbook of best practice for managing data in the cloud. Version 1.1.1 was released in September 2021 , created by a cross industry workgroup of 100+ firms. It is a framework for best data management practices to accelerate trusted cloud adoption.  It can be downloaded here. The holistic list of capabilities highlighted in the CDMC from the EDM Council is:

Data Cataloguing and Discovery 
Data Classification 
Data Ownership 
Data Security 
Data Sovereignty and Cross-Border Data Sharing 
Data Quality
Data Lifecycle Management 
Data Entitlements and Access Tracking  
Data Lineage  
Data Privacy 
Trusted Source Management and Data Contracts 
Ethical Use and Purpose 
Master Data Management

The Data Management Maturity (DMM)  program from the CMMI Institute has best practices for providing support for the implementation of process for these five categories: strategy; governance; data quality; operations; and platform and architecture. 

Best practice for ensuring broad participation in the practice and senior oversight of the effectiveness of data management. 

Capability Maturity Model Integration  (CMMI) Six themes  The CMMI models are described as collections of effective practices and process improvement goals that organisations can use to evaluate and improve their processes.