Secure: We will employ security controls to ensure data collaboration is operationally secure where it is desired.
Chaos, complexity, curiosity and database systems. A place where research meets industry
Welcome
"The important thing is not to stop questioning. Curiosity has its own reason for existing" Einstein
Wednesday, 22 December 2021
Open Data Campaign
Secure: We will employ security controls to ensure data collaboration is operationally secure where it is desired.
Sunday, 19 December 2021
Data Toboggan - our year 2021
We have had a very busy first year and enjoyed it all. Thank you to all who made our conferences possible, organisers, speakers, attendees, our logo designer and to those who helped share our event.
We are back 29 January 2022.
Call for Speakers https://bit.ly/DT22CFS
Register https://bit.ly/DT22Register
Wednesday, 15 December 2021
Azure Purview Dataset provisioning by data owner for Azure Storage (preview)
Wednesday, 8 December 2021
Put Responsible AI into Practice
I attended a digital event, 7 December, where Microsoft launched the Ten Guidelines for Product Leaders to Implement AI Responsibly following their own journey. This is a really useful document and has been collated with diverse perspectives, lived and possessional skills sets. It is where technology meets society and business and research have been working together to enhance the output.
Microsoft shared their path to a responsible AI governance model.
- AETHER - AI & Ethics in Engineering & Research
- ORA - Office of Responsible AI
- RAISE - Responsible AI Strategy in Engineering
The AI guidelines process has 3 stages:
- Assess & prepare
- Design, build, & document
- Validate & support
The report explains the actionable steps
There is a Responsible AI dashboard which is helpful for actionable insights. The responsible AI dashboard includes: Error Analysis Model Statistics, Data Explorer, Aggregate feature importance, What-if counterfactuals, Causal analysis.
There is a Responsible AI Toolbox to get started with
Friday, 3 December 2021
The Chief Data Officer seat at the table
Microsoft have written a white paper called Microsoft Azure: The Chief Data Officer (CDO) Seat at the Cloud Table It is an interesting paper to read.
The Whitepaper looks at four areas
- Journey
- Framework
- Product
- Resources
The document mentions other related capabilities that need to be included in a data governance program:
- Data Discoverability
- Data Quality Management
- Data Access Management
- Data Compliance
- Data Lifecycle Management
- Data Health Scorecards
Microsoft’s data governance approach is listed as
- Set the scope of data governance for your organization
- Set enterprise data governance requirements through policies and standards
- Set ownership and accountability for data governance
- Start with a unified, metadata-driven vision with automation
- Iterate, not big bang
- Educate and enable change
- Monitor and revise
The Data Management Capabilities Model Framework (DCAM) core capabilities closely align the Microsoft Framework. DCAM establishes the data strategy, position the business case, implement the operating model, ensure funding and supportive organizational collaboration.
The holistic list of capabilities highlighted in the CDMC from the EDM Council is:
- Data Cataloguing and Discovery
- Data Classification
- Data Ownership
- Data Security
- Data Sovereignty and Cross-Border Data Sharing
- Data Quality
- Data Lifecycle Management
- Data Entitlements and Access Tracking
- Data Lineage
- Data Privacy
- Trusted Source Management and Data Contracts
- Ethical Use and Purpose
- Master Data Management
Monday, 29 November 2021
PASS Summit Key note Unified Data Governance with Azure Purview
Raghu Ramakrishnan CTO Data, Technical Fellow, Microsoft spoke at PASS Community Summit in November and explained the next part of the vision, policy, for data governance. Microsoft are seeing data governance as the emerging data pillar. Operational databases, unified analytics platform, and unified automated data governance. The unified part is the important element going forward, a unified single pane to extend governance across the entire data estate. Automated data classification to remove the PII headache of missing personal data and pushing the control up the stack to knowledge workers. Microsoft intend to have dynamic data providence that is fully integrated with the 6 responsible AI principles. Azure Purview will operate a Central RBAC control and is the governing permission future state for SQL Server with full propagation. With AI integrated the policy feature will be human readable. The link to watch the session .
- self-service search and browse
- curated and standardized business glossaries
- interactive lineage visualization
- simplified data curation and stewardship
- data asset distribution
- business glossary
- data classification and labelling
- data location and movement (in progress)
Still looking through the lens of GDPR Compliance data classification is an important feature
Sunday, 28 November 2021
Data Toboggan 2022
We are pleased to announce that Data Toboggan 2022 is back on 29 January 2022. 7:45 AM to 7:59 PM GMT
Call for Speakers https://bit.ly/DT22CFS
Register https://bit.ly/DT22Register
Details
Join us for our THIRD all-day event specializing in Azure Synapse Analytics !
Azure Synapse Analytics is a practically limitless analytics service that brings together data integration, enterprise data warehousing and big data analytics. Let's spend a day exploring and showcasing these capabilities.
It is this analytical power that will help enable any organisation to transform from being reactive into being truly proactive, generating actionable insights that enable both business flow and timely decision-support.
This is a virtual event and free to attend.
As it's our third event, there will be 3 session types:
- Standard Sessions : 45 minutes long.
- Live Short Talks : 5 to 10 minutes.
- Recorded Short Talks : 5 to 10 minutes.
We'll have most awesome content we can find, with a wide range of speakers and experience. Check out our CFS page (https://sessionize.com/data-toboggan-2022/) if you're thinking of submitting !
Friday, 26 November 2021
Event Synopsis for Azure World Synapse Day
We were really excited to have run a new type of event, Azure World Synapse Day. The event crossed 3 time zones APAC, EMEA and AMER.
The aim of the unconference was to have a lighter format that allowed people to share their personal stories, to share things about Azure Synapse technology, provide demos etc. Our interpretation of the unconference was
Our session planning run list looked like
We had fun, we learnt a lot about running this type of event and wanted to thank our amazing speakers and attendees for sharing the event with us.
We will be running the same type of event next year. We learnt a lot about this type of event. It will be coming further under the Data Toboggan brand as Data Toboggan - Alpine Coaster. So coasting through those shorter sessions.
Tuesday, 16 November 2021
T-SQL Tuesday #144 – Data Governance reimagination - Wrap up
This month’s T-SQL Tuesday attracted some great responses! Thank you to everyone who participated!
My
invitation for this month’s #tsql2sday
was 3 fold on sharing your experiences on data governance
- The current cost of data
governance versus its benefits
- The amazing things data
governance has enabled you to achieve or will enable you to achieve in the
future
- The potential uses for Azure
Purview within your estates and the automated deployment options for that
Rob Farley published a post in reply
http://blogs.lobsterpot.com.au/2021/11/09/being-sure-of-your-data/
Rob raises some key points
- But the checks that we do are more about things that the database can allow, but are business scenarios that should never happen.
- You need to discover which situations cause people not to trust the data.
- Data quality can lead to the trust, but only when it has been demonstrated repeatedly over time. Trust must be earned
Deborah Melkin published a post in reply
https://debthedba.wordpress.com/2021/11/09/t-sql-tuesday-144-data-governance/
Deborah Melkin talks about the switch to implement data governance.
- It is about understanding your data from both the micro and macro level
- It’s understanding where our data lives (data assets) and how data flows through data sources (data lineage) as well as how it’s consumed and used (data catalogs and data profiling). More importantly, this is knowledge that can be shared to make data even more valuable.
- When you start expanding the number of databases and the complexity of how your systems work, the job of governance becomes a lot harder
- Getting started with data governance seems like a very daunting task.
Data Governance is a broad topic with many different areas which can be be seen from the replies. There is plenty for us to get started with and I'm looking forward to using Azure Purview to help with this.
Thank you for taking the time to post insightful posts. That is the wrap up. If I’ve missed anyone please let me know and I’ll update the post.
Thursday, 11 November 2021
Drive a data culture to power a new class of data first applications
The PASS Data Community summit session keynote contained a section presented by Arun Ulag, Corporate Vice President of the Intelligence Platform at Microsoft.
From data to intelligence for everyone and for every decision at any scale. He talked about data integration, analytics and business intelligence. The 3 messages were:
Empower every individual with AI capabilities such as the automatic report insights in Power BI with descriptive and diagnostic insights and insights on the move.
Wednesday, 10 November 2021
Bridge to a new universe: the end-to-end Azure Data Platform
Exciting to watch the Day 1 Keynote a Bridge to a new universe: the end-to-end Azure Data Platform delivered by Rohan Kumar and many other people.
A journey to a new universe is just waiting to inspire innovation, to tap into limitless possibilities and potential. It covered how to shape your data so you can harness its power to find a new galaxy of insights, answers, and predictions. Some amazing slides and discussion to set you on a new path.
There are to be two interlinking services Azure Synapse Link and Azure Purview.
More details were discussed in the keynote but I will share those separately. As ever an inspiring future for us in the data community.
Read More
Announcing SQL Server 2022 preview: Azure-enabled with continued performance and security innovation
Data Toboggan Azure World Synapse Day: Speakers
Data Toboggan have an amazing line up of speakers. We would love you to join us and support our amazing speakers who have given up their time to speak.
Register now https://bit.ly/RegisterDTWSD21
Speakers
Lakehouse in a nutshell: Serverless SQL pool + Aggs + PowerBI - Armando Lacerda
DW Automation for EDW in Synapse - Demo Only - Bob Duffy
Manage Packages on Synapse Spark - Dustin Vannoy
Migrating a Data Warehouse to Synapse Analytics - Andy Cutler
Patterns with Synapse Notebooks - Damien O'Connor
From Housekeeping to Data Engineer - My journey to find my passion -Jean Joseph
Secrets of SQL Dedicated Pool - Dennes Torres
Spreading the word about Azure Synapse Analytics - Sidney Cirqueira
Synapse and Power BI - Intro to a great data mix - Gaston Cruz
dbt & Synapse: have you seen SQL do this before? - Anders Swanson
Distributed Data in Dedicated SQL Pool - Rob Farley
Tuesday, 9 November 2021
Data Toboggan extravaganza
There nothing so exciting as a surprise. Data Toboggan is trying something different. Take an international journey with us through 3 time zones:
APAC - 08:00 - 09.00 GMT;
EMEA - 12:00 - 13.20 GMT;
AMER - 17:00 - 18.20 GMT
Bit size sessions to share how Azure Synapse Analytics inspired you, empowers you, and how it accelerates your business analytics. Register now: https://bit.ly/RegisterDTWSD21
The Session Titles
The Abstracts
The Speakers
Jean Joseph, Dennes Torres, Anders Swanson, Rob Farley, Bob Duffy, Andy Cutler, Damien O'Connor, Sidney Cirqueira, Armando Lacerda, Gaston Cruz, Dustin Vannoy
Tuesday, 2 November 2021
Ignite Innovate Anywhere From Multicloud to Edge
- Hybrid and multicloud
- End-to-end data platform
- Cloud native development
- Developer velocity
- Deeper integration with VMware vSphere and Azure Stack HCI
- Azure Virtual Desktop on Azure Stack HCI
- Azure Arc enabled data services updates
- Extension of Microsoft Defender to AWS
- Azure Data Explorer
- Synapse Link SQL Server 2022
- Synapse Link Dataverse (GA)
Microsoft Ignite November 2021
Microsoft Ignite was opened by Satya Nadella on 2 November 2021. An inspiring session.
The headline for the opening was, our economy and society is undergoing a sea change of digitization. Satya talked about emerging technology trends and innovations across the Microsoft Cloud that will transform every business and industry going forward.
We are a moment of real structural change. The case for digital transformation has been never so urgent. What will happen and what we need to do to support our business is core with the transition of mobile to a cloud era to ubiquitous computing and ambient intelligence.
There are four key trends that he mentioned
- Hybrid work - when and where we work
- The trend for a hyper connected business with omnichannel reach with freely flowing data and intelligence
- Every business is a digital business - multi cloud multi edge
- The need to protect everything end to end, with security being the biggest risk
There is also a need for business to meet sustainability goals and track our own carbon footprint.
Microsoft Loop was announced a new collaborative canvas.
There were some other great transformational announcements.