Passionately curious about Data, Databases and Systems Complexity. Data is ubiquitous, the database universe is dichotomous (structured and unstructured), expanding and complex. Find my Database Research at SQLToolkit.co.uk . Microsoft Data Platform MVP

"The important thing is not to stop questioning. Curiosity has its own reason for existing" Einstein

Friday, 3 December 2021

The Chief Data Officer seat at the table

Microsoft have written a white paper called Microsoft Azure: The Chief Data Officer (CDO) Seat at the Cloud Table It is an interesting paper to read.

The Whitepaper looks at four areas

  1. Journey
  2. Framework
  3. Product
  4. Resources

The document mentions other related capabilities that need to be included in a data governance program:

  • Data Discoverability 
  • Data Quality Management
  • Data Access Management 
  • Data Compliance 
  • Data Lifecycle Management 
  • Data Health Scorecards  

Microsoft’s data governance approach is listed as

  • Set the scope of data governance for your organization
  • Set enterprise data governance requirements through policies and standards
  • Set ownership and accountability for data governance
  • Start with a unified, metadata-driven vision with automation
  • Iterate, not big bang
  • Educate and enable change
  • Monitor and revise

The Data Management Capabilities Model Framework (DCAM) core capabilities closely align the Microsoft Framework.  DCAM establishes the data strategy, position the business case, implement the operating model, ensure funding and supportive organizational collaboration.

The holistic list of capabilities highlighted in the CDMC from the EDM Council is:

  • Data Cataloguing and Discovery 
  • Data Classification 
  • Data Ownership 
  • Data Security 
  • Data Sovereignty and Cross-Border Data Sharing 
  • Data Quality
  • Data Lifecycle Management 
  • Data Entitlements and Access Tracking  
  • Data Lineage  
  • Data Privacy 
  • Trusted Source Management and Data Contracts 
  • Ethical Use and Purpose 
  • Master Data Management


Sunday, 28 November 2021

Data Toboggan 2022

We are pleased to announce that Data Toboggan 2022 is back on 29 January 2022. 7:45 AM to 7:59 PM GMT

Call for Speakers https://bit.ly/DT22CFS 

Register https://bit.ly/DT22Register


Join us for our THIRD all-day event specializing in Azure Synapse Analytics !

Azure Synapse Analytics is a practically limitless analytics service that brings together data integration, enterprise data warehousing and big data analytics. Let's spend a day exploring and showcasing these capabilities.

It is this analytical power that will help enable any organisation to transform from being reactive into being truly proactive, generating actionable insights that enable both business flow and timely decision-support.

This is a virtual event and free to attend.

As it's our third event, there will be 3 session types:

  • Standard Sessions : 45 minutes long.
  • Live Short Talks : 5 to 10 minutes.
  • Recorded Short Talks : 5 to 10 minutes.

We'll have most awesome content we can find, with a wide range of speakers and experience. Check out our CFS page (https://sessionize.com/data-toboggan-2022/) if you're thinking of submitting !

Friday, 26 November 2021

Event Synopsis for Azure World Synapse Day

We were really excited to have run a new type of event, Azure World Synapse Day. The event crossed 3 time zones APAC, EMEA and AMER.

The aim of the unconference was to have a lighter format that allowed people to share their personal stories, to share things about Azure Synapse technology, provide demos etc. Our interpretation of the unconference was 

Our session planning run list looked like

Followed by an intermission

Followed by an intermission

We had fun, we learnt a lot about running this type of event and wanted to thank our amazing speakers and attendees for sharing the event with us. 

We will be running the same type of event next year. We learnt a lot about this type of event. It will be coming further under the Data Toboggan brand as Data Toboggan - Alpine Coaster. So coasting through those shorter sessions.

Tuesday, 16 November 2021

T-SQL Tuesday #144 – Data Governance reimagination - Wrap up

This month’s T-SQL Tuesday attracted some great responses! Thank you to everyone who participated!

My invitation for this month’s #tsql2sday was 3 fold on sharing your experiences on data governance

  • The current cost of data governance versus its benefits
  • The amazing things data governance has enabled you to achieve or will enable you to achieve in the future
  • The potential uses for Azure Purview within your estates and the automated deployment options for that

Rob Farley published a post in reply


Rob raises some key points 

  • But the checks that we do are more about things that the database can allow, but are business scenarios that should never happen.
  • You need to discover which situations cause people not to trust the data.
  • Data quality can lead to the trust, but only when it has been demonstrated repeatedly over time. Trust must be earned

Deborah Melkin published a post in reply


Deborah Melkin talks about the switch to implement data governance.

  • It is about understanding your data from both the micro and macro level
  • It’s understanding where our data lives (data assets) and how data flows through data sources (data lineage) as well as how it’s consumed and used (data catalogs and data profiling). More importantly, this is knowledge that can be shared to make data even more valuable.
  • When you start expanding the number of databases and the complexity of how your systems work, the job of governance becomes a lot harder
  • Getting started with data governance seems like a very daunting task.

Data Governance is a broad topic with many different areas which can be be seen from the replies. There is plenty for us to get started with and I'm looking forward to using Azure Purview to help with this. 

Thank you for taking the time to post insightful posts. That is the wrap up. If I’ve missed anyone please let me know and I’ll update the post.

Wednesday, 10 November 2021

Bridge to a new universe: the end-to-end Azure Data Platform


Exciting to watch the Day 1 Keynote a Bridge to a new universe: the end-to-end Azure Data Platform delivered by Rohan Kumar and many other people. 

A journey to a new universe is just waiting to inspire innovation, to tap into limitless possibilities and potential. It covered how to shape your data so you can harness its power to find a new galaxy of insights, answers, and predictions.  Some amazing slides and discussion to set you on a new path. 

Three universes bridged together unmatched analytics and insights, limitless cloud data services and unified data governance.

Rohan shared a great quote "If you want to go fast, go alone. If you want to go far, go together. " The SQL Server community has always worked together to achieve some amazing goals. 
Three main data communities were discussed SQLSaturday, Data Saturdays and the Azure Data Community
SQL Server 2022 preview brings with it many new features.  

There are to be two interlinking services Azure Synapse Link and Azure Purview. 

More details were discussed in the keynote but I will share those separately. As ever an inspiring future for us in the data community. 

Read More

Microsoft Ignite book of news

 Announcing SQL Server 2022 preview: Azure-enabled with continued performance and security innovation

Data Toboggan Azure World Synapse Day: Speakers

Data Toboggan have an amazing line up of speakers. We would love you to join us and support our amazing speakers who have given up their time to speak. 

Register now https://bit.ly/RegisterDTWSD21 


Lakehouse in a nutshell: Serverless SQL pool + Aggs + PowerBI - Armando Lacerda

DW Automation for EDW in Synapse  - Demo Only - Bob Duffy

Manage Packages on Synapse Spark - Dustin Vannoy

Migrating a Data Warehouse to Synapse Analytics - Andy Cutler

Patterns with Synapse Notebooks - Damien O'Connor

From Housekeeping to Data Engineer - My journey to find my passion -Jean Joseph

Secrets of SQL Dedicated Pool - Dennes Torres

Spreading the word about Azure Synapse Analytics - Sidney Cirqueira

Synapse and Power BI - Intro to a great data mix - Gaston Cruz

dbt & Synapse: have you seen SQL do this before? - Anders Swanson

Distributed Data in Dedicated SQL Pool - Rob Farley

Tuesday, 9 November 2021

Data Toboggan extravaganza

There nothing so exciting as a surprise. Data Toboggan is trying something different. Take an international journey with us through 3 time zones: 

APAC - 08:00 - 09.00 GMT; 

EMEA - 12:00 - 13.20 GMT; 

AMER - 17:00 - 18.20 GMT

Bit size sessions to share how Azure Synapse Analytics inspired you, empowers you, and how it accelerates your business analytics. Register now: https://bit.ly/RegisterDTWSD21 

The Session Titles

The Abstracts

The Speakers

Jean Joseph, Dennes Torres, Anders Swanson, Rob Farley, Bob Duffy, Andy Cutler, Damien O'Connor, Sidney Cirqueira, Armando Lacerda, Gaston Cruz, Dustin Vannoy