Passionately curious about Data, Databases and Systems Complexity. Data is ubiquitous, the database universe is dichotomous (structured and unstructured), expanding and complex. Find my Database Research at SQLToolkit.co.uk . Microsoft Data Platform MVP

"The important thing is not to stop questioning. Curiosity has its own reason for existing" Einstein

Saturday, 27 January 2018

Data Scientist Skills

There are an avalanche of skills required to become a data scientist. I came across this useful diagram.

The hierarchy of needs for data science could help you be more effective with AI and machine learning.

Monday, 22 January 2018

Migration from the Relational world to Graph

I came across this useful blog SQL2Gremlin which translates the Northwind dataset. This was used as a sample database in older versions of SQL Server. The blog post explains the Apache TinkerPop's Gremlin graph traversal language using typical patterns found when querying data with SQL. The SQL examples make use of the T-SQL syntax.

This blog was helpful when looking at Azure Cosmos DB (Microsoft’s globally distributed multi-model database service). The  Gremlin console on the Azure portal is explained in the documentation, Azure Cosmos DB: create, query, and traverse a graph in the Gremlin console. The tutorial creates and queries vertices and edges, updates a vertex property, queries vertices, traverses the graph, and drops a vertex.

The Gremlin console runs on Linux, Mac, and Windows. It can be downloaded from the Apache TinkerPop site.

Apache TinkerPop is a graph computing framework for both graph databases (OLTP) and graph analytic systems (OLAP).

Wednesday, 17 January 2018

Field Guide to Data Science

I read this really useful  guide about data science. The Field Guide to Data Science was created to help organizations of all types and missions understand how to make use of data as a resource

More details about understanding the DNA of data can be found here.