Chaos, complexity, curiosity and database systems. A place where research meets industry
Welcome
"The important thing is not to stop questioning. Curiosity has its own reason for existing" Einstein
Saturday 22 January 2022
Data Toboggan this week
Sunday 16 January 2022
Data Toboggan Piste Map
The Data Toboggan agenda is out. Take a journey with us down the slope that enables predictive analytics on 29 January. We have lots of amazing speakers for a full 12 hours of free content specialising on Azure Synapse. We look forward to seeing you there.
Register: https://bit.ly/DT22Register
Agenda: https://bit.ly/DT2022Agenda
Wednesday 12 January 2022
Data Governance with Azure Purview
SQLBits 8-12 March 2022 is approaching soon. I am really excited to be a part of a session with Erwin de Kreuk and Wolfgang Strasser. The session: Data Governance with Azure Purview - Ask the Experts
To submit your questions in advance for the session we have a Microsoft form to complete.
Sunday 2 January 2022
Creating an AI Ethics panel
- Establish governance for data ethics & AI and consider the importance for data collection and sharing.
- Describe how/when fairness happens and how/what biases have been accounted for
- Provide mechanisms for recourse.
Harvard business review has an article Create an Ethics Committee to Keep Your AI Initiative in Check
Accenture have a summary page Building data and AI ethics Committee in your business
Accenture have a full report on building a data ethics committee.
- understand unintended and or negative consequences
- human rights considerations
- justify the benefit
- make user needs and public benefit transparent
- check everyone understand user need and how to use the data
- ensure diversity within your team
- involve external stakeholders
- effective governance structures with experts
- transparency
- compliance with GDPR and DPA 2018
- data protection by design
- accountability
- transparency
- project complainant with the equality act 2010
- ensure effective governance of your data
- data source being used
- meta data understood
- processes to maintain integrity
- is synthetic data appropriate for the project evidence based caveats
- bias in data to train the model
- determine proportionality
- data anonymisation
- robust practices - demonstrated reproducibility, quality of the model
- make data open and shareable whenever possible
- think about transparency of sensitive models
- explainability
- repeatability
- project influences
- accountability structures
- skills, training and maintenance for longevity of the project
- share learnings
Another model is the Data Ethics Decision Aid. DEDA is a tool-kit facilitating initial brainstorming sessions to map ethical issues in data projects, documenting the deliberation process and furthering accountability towards the various stakeholders and the public.
Saturday 1 January 2022
Data as an asset
I just read an interesting article about the recipe for success handling data assets. It talks about data as an essential factor for business agility and that it enables competitive advantage. Data is an asset in its own right and organizations must change how data is viewed at a strategic level. Gartner and Accenture talk about data as the essential focus and ingredient. This assets become valuable once actionable insight can be derived. The article sets out 15 mantras for implementing data as an asset
- Define your Data Strategy with tangible measurable metrics linked to business outcomes with a data architecture blueprint and executable roadmap.
- Disrupt business models with AI
- Establish the right Data Culture and Architecture with accountability, data curation and data quality competency, frictionless trusted data supply with embedded data fluency across the business and a data taxonomy and dictionary.
- Implement DataOps to infuse life into your data with data acquisition and management connecting data creators with data consumers.
- Establish Tech Intensity initiatives for Data-Fluency enablement by setting baselines for data literacy skills resulting in data fluency
- Establish Data Signals and Patterns Repository
- Establish Data Marketplace – for Data sharing and sourcing across ecosystems reviewing the data supply chain and data monetization strategy
- Use AI and ML Algorithms
- Democratize Data – Secure Data Access and the correct type of BI and BI tools and make data visualization more transparent, intuitive and contextualised.
- Data Governance to produce trusted data with data lineage, managed data quality, business meta data and data profiling with risk and privacy policies for compliance.
- Establish Data Ethics principles covering things such as transparency, traceability and explainability
- Data Observability being the understanding of the health of the data in the system. The data observability pillars freshness and velocity, distribution, volume, schemes, lineage and data security & compliance.
- Define Data security and compliance controls
- Hire the right Data Engineering and AI talent
- Establish a Chief Data Officer and office of CDO
The article finishes stating Data Fluency and empowerment will be the determining success factors in a data-literate world.