Welcome

Passionately curious about Data, Databases and Systems Complexity. Data is ubiquitous, the database universe is dichotomous (structured and unstructured), expanding and complex. Find my Database Research at SQLToolkit.co.uk . Microsoft Data Platform MVP

"The important thing is not to stop questioning. Curiosity has its own reason for existing" Einstein



Thursday, 16 May 2019

West Women Awards Ceremony

Proud to be listed in the top 100 most inspiring women in the region. The awards ceremony tonight in Bristol helps promotes diversity and inclusion in the workplace which is fundamental to providing an environment for innovation and to enable companies to lead. It also creates a culture that can enable everyone to aspire to follow their dreams.

Thursday, 9 May 2019

Monday, 6 May 2019

Google AI training data set

Google has released an AI training data set with 5 million images and 200,000 landmarks. The open-sourced Google-Landmarks-v2 contains a larger landmark recognition corpus. Google has also launched two new challenges Landmark Recognition 2019 and Landmark Retrieval 2019 on Kaggle.


Tuesday, 30 April 2019

Azure Open Datasets

Azure Open Datasets are curated public datasets that can be used to add scenario-specific features to machine learning solutions for more accurate models. Open Datasets are on Microsoft Azure and are available to Azure Databricks, Machine Learning service, and Machine Learning Studio. Access to the datasets is through the APIs and other products, such as Power BI and Azure Data Factory.



Sunday, 28 April 2019

Microsoft Build is coming






















It is that time of year again and I am looking forward to see what announcements are going to be made at MSBuild 2019. MSBuild explores the latest developer tools and technologies.

Thursday, 25 April 2019

Spark+AI Summit 2019


The SparkAI Summit shared a lot of  announcements. The open source announcements were






Koalas - a more complete Pandas API

The open sourcing of Databricks Delta as Delta Lake. Delta dramatically simplifies building reliable data lakes on HDFS and cloud storage with ACID transactions, indexes and scalable metadata handling.


Microsoft is joining the MLflow project and adding MLflow APIs in Azure ML.

Rohan Kumar  of Microsoft announced .NET for Apache Spark, making Apache Spark accessible to .NET developers - Git Hub


Spark 3.0 expected later in the year



The keynote videos are all online now and other session videos will be there in about 2 weeks.