Passionately curious about Data, Databases and Systems Complexity. Data is ubiquitous, the database universe is dichotomous (structured and unstructured), expanding and complex. Find my Database Research at SQLToolkit.co.uk

"The important thing is not to stop questioning. Curiosity has its own reason for existing" Einstein

Saturday, 16 January 2016

Resources for R and SQL Server 2016

R contains libraries to create a range of visualizations. This cheat sheet contains examples for representing data using Bar Chart, Histogram, Box Plot, Scatter Plot, Hexbin Binning, Mosaic Plot, Heat Map, Map Visualization, 3D Graphs, Correlogram (GUIs).

I came across an article on how to Transition from Excel to R. This Intro to R for Microsoft Excel Users is https://t.co/xpKk84VrdH

To start with you will need

The R Project for Statistical Computing

RStudio IDE is a powerful and productive user interface for R. It’s free and open source

A couple of useful libraries
ggplot2 - to creating charts and graphs
dplyr - to help with manipulating the data

R is SQL Server 2016 CTP3 release has native in-database support for open source R and multi-threaded RevoScaleR functions. Scenarios for using R Services in SQL Server 2016:

Read more at 

Thursday, 14 January 2016

RStudio Cheat Sheets

Some great cheat sheets to help you navigate and use RStudio from https://www.rstudio.com/resources/cheatsheets/

RStudio IDE is the most popular integrated development environment for R.

Data Visualization using the ggplot2 package lets you make beautiful and customizable plots of your data


There are many more cheat sheets.

Tuesday, 12 January 2016

Free ebook: Introducing Microsoft SQL Server 2016

There is a new free ebook Introducing Microsoft SQL Server 2016: Mission-Critical Applications, Deeper Insights, Hyperscale Cloud, Preview Edition.

The chapters released so far are

Better Security
Improved database engine
Better reporting

Sunday, 6 December 2015

SQL Server in the Cloud Options

There are many SQL Server choices available to us as database architects. This diagram shows the Microsoft database stack available choices and the business drivers that affect the choice of cost versus administrative overhead.

An article SQL Server in Azure: Compare PaaS (SQLDB) andIaaS (Virtual Machine)
explains in more depth about the cloud options of SQL Server in Azure (IaaS) and Azure SQL Database. Igor Pagliai included this useful table.

  SQLDB (PaaS)
  Server VM (IaaS)
Features Less features than box Full box product features

Performances Max 1750 DTU in Premium Tier Depends on VM SKU/Storage

DB Size Max 1TB in Premium Tier (P11) 64TB on G-SERIES
Workload Sizing by average usage Sizing based on peaks
High-Availability Built-in by platform   

Manual configuration by AlwaysOn AG
Fault-Handling Necessary fault-handling &
Recommended fault-handling &
Locality No co-location with application Co-located by VMs and VNETs
Segregation Internet exposed endpoint

Internal private endpoint

Versioning No control on upgrades 

Full control over DB upgrade
TCO Very low, almost self-managed     High (as on-premise)    

Administration No full-time DBA required       Full staffed DBA required

Management Easy to manage many DBs

Complex to manage many DBs/VMs
Scale-Out Tools & Frameworks available No easy scale-out       

Configuration No setup customization 

Full access to OS and SQL

Authentication Only SQL standard authentication SQL standard and integrated

Security No Fixed IP available  

Fixed IP possible at VM level
Backup Backup files not accessible Full control of backup files

Thursday, 3 December 2015

SQL Server 2005 End of Life

SQL Server 2005 end of extended support finishes 12 April 2016. This means no further hotfixes and support from Microsoft when things fail.

There is an excellent article highlighting some of the new features that can be utilized if you upgrade.  Enhanced Security features are a must to consider.

The article is entitled:

What You Are Missing by Sticking with SQL Server 2005: Data Architects Version