Now available on Azure Marketplace – Ubuntu Data Science Virtual Machine


Ubuntu Linux Data Science Virtual Machine is now available is now released on the Azure marketplace.

 

What is the Data Science Virtual Machine

 

clip_image001

The Data Science team will continue to support CentOS and Windows Data Science Virtual Machine ‘DSVM’.

The team have made some major enhancement to the DSVM offering with the Ubuntu version.  We have found from feedback Ubuntu is overwhelmingly the most popular Linux distro among data scientists and academics.

So the team are very excited and happy to now offer Ubuntu as a core Linux DSVM platform with CPU and GPU Images available on the Azure MarketPlace.

The Data Science Virtual Machine family of VM images on Azure includes the DSVM for Windows, a CentOS-based DSVM for Linux, and an Ubuntu-based DSVM for Linux. These images come with popular data science and machine learning tools, including Microsoft R Server Developer Edition, Microsoft R Open, Anaconda Python, Julia, Jupyter notebooks, Visual Studio Code, RStudio, xgboost, and many more. A full list of tools for all editions of the DSVM is available here.

The DSVM has proven popular with many Academic data scientists as it helps them focus on teaching, learning and research and avoid delay due to IT Support and mundane steps around tool installation and configuration. The use of the DSVM has additionally helped many Institutions IT teams as it reduce the amount of support required in lab setup and preparation.

image 

In addition to all the data science tools you love, you now have a choice of deep learning tools  (CNTK, Tensorflow, MxNet, Caffe/Caffe2, Torch, Theano, Keras, NVidia Digits) on the Ubuntu version.

GPU builds

Are now available so you can deploy the same DSVM on a GPU VM (Azure NC-Series) or a CPU-Only VM. You just fall back to using the CPU when running the deep learning tools on CPU-only hardware. NVidia Drivers, CUDA etc are all on the VM image by default. So you can just get started with deep learning in matter of minutes. No need to download the framework source, fight with compiling those tools and installing the dependencies.

Here is a blog post on the new release

https://blogs.technet.microsoft.com/machinelearning/2017/04/18/deep-learning-on-the-new-ubuntu-based-data-science-virtual-machine-for-linux/

The Data Science team have also been working with with Facebook to be among the five partners featured in their announcement of open sourcing their latest deep learning framework called Caffe2 (a rewrite of Caffe) at their F8 conference.

For more details on Caffe2 see http://caffe2.ai/blog/2017/04/18/caffe2-open-source-announcement.html

And more details on the Official Microsoft Machine Learning Blog https://blogs.technet.microsoft.com/machinelearning/2017/04/18/deep-learning-with-caffe2-on-the-azure-data-science-virtual-machine/

Getting Started

To spin up a Ubuntu DSVM just go to its product page at: http://aka.ms/dsvm/ubuntu  and click “Get it now” button.

You can login in to it with SSH client like Putty, SSH command line OR use X2GO for graphical interface. Jupyter/Jupyterhub, RStudio Server are available.

A side by side comparison of tools preinstalled on different editions Windows and Linux of the DSVM can be found here.

So I really look forward to how academica are using this and please do share your usage, support and feedback to help the Data Science team keep improving the DSVM and make it the best analytics development environment anywhere on the cloud. Please send any questions to or feedback to the DSVM forum on MSDN.

Comments (4)

  1. Lee Stott says:

    When setting up VMs for GPU please use NC series GPU which are available in the South Central US region and ensure you use SSD not SSD see https://blogs.msdn.microsoft.com/uk_faculty_connection/2017/03/27/azure-gpu-tensorflow-step-by-step-setup/ for step by step instructions.

  2. davidmakovoz says:

    I’m using Data Science Virtual Machine for Linux (Ubuntu). It does have TensorFlow, however it is version 1.0.1, which is badly outdated. I don’t believe I have permissions to update it, do I?
    Is it possible to have it updated, or should I use a different type of Machine, or what are my options?

    Thank you

    1. Lee Stott says:

      Please use the Data Science Forum for queries http://aka.ms/dsvm/forum but as Gopi has stated you can update this at present, also Stack overflow for queries relating to the DSVM if search key as “dsvm” on Stackoverflow.

  3. Gopi says:

    We are releasing a new refresh later this month that will have latest. You should be able to do pip or conda update to refresh packages. You will need to be in sudoers group since the default conda environments(root and py35) are global in a central location and available for all users.

Skip to main content