Carl's Blog

Carl Nolan's ramblings on development and data processing

Pushing Hadoop Cluster Configuration Changes using PowerShell

In my previous post I talked about Implementing and Deploying Rack Awareness using PowerShell....

Author: Carl Nolan Date: 04/02/2014

Deploying Hadoop Rack Awareness with PowerShell

In a previous post I talked about Implementing Hadoop Rack Awareness with PowerShell. One thing I...

Author: Carl Nolan Date: 03/19/2014

Implementing Hadoop Rack Awareness with PowerShell

This post walks-through building a PowerShell script for enabling Rack Awareness in Hadoop. While...

Author: Carl Nolan Date: 03/14/2014

Managing Your HDInsight Cluster using PowerShell – Update

Since writing my last post Managing Your HDInsight Cluster and .Net Job Submissions using...

Author: Carl Nolan Date: 12/16/2013

Managing Your HDInsight Cluster and .Net Job Submissions using PowerShell

This post explains how best to manage an HDInsight cluster using a management console and Windows...

Author: Carl Nolan Date: 12/02/2013

Implementing LOB Storage in Memory Optimized Tables

Memory optimized tables do not have off-row or large object (LOB) storage, and the row size is...

Author: Carl Nolan Date: 09/16/2013

Managing Your HDInsight Cluster with PowerShell

An updated version of this post can be found here. This blog post provides a mechanism for managing...

Author: Carl Nolan Date: 06/07/2013

Hadoop .Net HDFS File Access

Provided with the Microsoft Distribution of Hadoop, HDInsight, is a C library for HDFS file access....

Author: Carl Nolan Date: 02/08/2013

Submitting Hadoop MapReduce Jobs using PowerShell

As always here is a link to the “Generics based Framework for .Net Hadoop MapReduce Job Submission”...

Author: Carl Nolan Date: 02/05/2013

Try F# 3.0 Released

From the F# Team Blog: If you're familiar with the site Try F#, which lets you run F# right in your...

Author: Carl Nolan Date: 01/23/2013

Hive and XML File Processing

When I put together the “Generics based Framework for .Net Hadoop MapReduce Job Submission” code one...

Author: Carl Nolan Date: 12/13/2012

Co-occurrence Approach to an Item Based Recommender Update

In a previous post I talked about a Co-occurrence Approach to an Item Based Recommender, that...

Author: Carl Nolan Date: 12/05/2012

Implementing a MapReduce Join with Hadoop and the .Net Framework

I have often been asked how does one implement a Join whilst writing MapReduce code. As such, I...

Author: Carl Nolan Date: 11/13/2012

Framework for .Net Hadoop MapReduce Job Submission V1.0 Release

It has been a few months since I have made a change to the “Generics based Framework for .Net...

Author: Carl Nolan Date: 10/10/2012

Execution Time Based Heuristic Custom Task Scheduler

If you follow the samples for Parallel Programming with the .Net Framework, you may have come across...

Author: Carl Nolan Date: 10/02/2012

Framework for .Net Hadoop MapReduce Job Submission TextOutput Type

Some recent changes made to the “Generics based Framework for .Net Hadoop MapReduce Job Submission”...

Author: Carl Nolan Date: 07/10/2012

C# MapReduce Based Co-occurrence Item Based Recommender

As promised, to conclude the Co-occurrence Approach to an Item Based Recommender posts I wanted to...

Author: Carl Nolan Date: 07/09/2012

MapReduce Based Co-occurrence Approach to an Item Based Recommender

In a previous post I covered the basics for a Co-occurrence Approach to an Item Based Recommender....

Author: Carl Nolan Date: 07/07/2012

Framework for .Net Hadoop MapReduce Job Submission Json Serialization

A while back one of the changes made to the “Generics based Framework for .Net Hadoop...

Author: Carl Nolan Date: 07/01/2012

Framework for .Net Hadoop MapReduce Job Submission configuration update

To better support configuring the Stream environment whilst running .Net Streaming jobs I have made...

Author: Carl Nolan Date: 06/25/2012

Co-occurrence Approach to an Item Based Recommender

For a while I thought I would tackle the problem of creating an item-based recommender. Firstly I...

Author: Carl Nolan Date: 06/23/2012

Framework for .Net Hadoop MapReduce Job Submission Binary Output

To end the week I decided to make a minor change to the “Generics based Framework for .Net...

Author: Carl Nolan Date: 06/15/2012

Framework for .Net Hadoop MapReduce Job Submission libjars update

If you have been using the “Generics based Framework for .Net Hadoop MapReduce Job...

Author: Carl Nolan Date: 06/11/2012

Hadoop .Net HDFS File Access (Revisited Archived)

Updated post can be found here:...

Author: Carl Nolan Date: 06/02/2012

.Net Implementation of a Priority Queue (aka Heap)

I thought I would take a break for a while from Hadoop and put together an F# .Net implementation of...

Author: Carl Nolan Date: 05/03/2012

Generics based Framework for .Net Hadoop MapReduce Job Submission

Over the past month I have been working on a framework to allow composition and submission of...

Author: Carl Nolan Date: 04/29/2012

.Net Hadoop MapReduce Job Framework - Revisited (Archived)

An updated version of this post can be found at:...

Author: Carl Nolan Date: 04/25/2012

Framework for Composing and Submitting .Net Hadoop MapReduce Jobs (Archived)

An updated version of this post can be found at:...

Author: Carl Nolan Date: 04/10/2012

Hadoop .Net HDFS File Access (Archived)

Updated post can be found here:...

Author: Carl Nolan Date: 02/29/2012

Hadoop Streaming in F# and MapReduce (summary)

With all my recent posts around Hadoop Streaming I thought it would be useful to summarize them into...

Author: Carl Nolan Date: 02/22/2012

FSharpChart for Valentines Day

As I have not posted anything about FSharpChart for a while I thought I would do a quick post, one...

Author: Carl Nolan Date: 02/14/2012

Hadoop XML Streaming and F# MapReduce

So, to round out the Hadoop Streaming samples I thought I would put together an XML Streaming...

Author: Carl Nolan Date: 01/21/2012

Hadoop Streaming and Windows Azure Blob Storage

One of the cool features of the Microsoft Distribution of Hadoop (MDH) is the native support for...

Author: Carl Nolan Date: 01/07/2012

Hadoop Streaming and Reporting

If like me you are a .Net developer and have written some Streaming jobs it is not immediately...

Author: Carl Nolan Date: 01/06/2012

A lazy evaluation of F# Seq.groupBy for sorted sequences

In doing some recent work with Hadoop I needed to process a sequence which was grouped by a...

Author: Carl Nolan Date: 01/02/2012

Hadoop Binary Streaming and PDF File Inclusion

In a previous post I talked about Hadoop Binary Streaming for the processing of Microsoft Office...

Author: Carl Nolan Date: 01/01/2012

Hadoop Binary Streaming and F# MapReduce

As mentioned in my previous post Hadoop Streaming not only supports text streaming, but it also...

Author: Carl Nolan Date: 12/30/2011

MapReduce Tester: A Quick Word

In my previous post I talked a little about testing the Hadoop Streaming F# MapReduce code; but it...

Author: Carl Nolan Date: 12/29/2011

Using Hadoop on Azure JS Console for Data Visualizations

In my last past I demonstrated a F# MapReduce program based on Hadoop Streaming. One thing that...

Author: Carl Nolan Date: 12/18/2011

Hadoop Streaming and F# MapReduce

And now for something completely different. As you may know Microsoft has recently announced plans...

Author: Carl Nolan Date: 12/16/2011

F# Project and Item Templates (revised)

Over the past few months I have posted up a few Project and Item Templates for F#. Here is a quick...

Author: Carl Nolan Date: 12/11/2011

F# WPF Project Template

This template is for a basic F# Windows Presentation Foundation (WPF) project. This template can be...

Author: Carl Nolan Date: 12/11/2011

SQL Server Query Performance Analysis using DMVs

Continuing the TSQL theme I thought it would be worthwhile sharing some TSQL scripts that I have...

Author: Carl Nolan Date: 11/23/2011

Adventures in TSQL: Using CTEs when performing DELETE/UPDATE operations

It seems in quite a few applications of late there has been the requirement to batch process, in a...

Author: Carl Nolan Date: 11/15/2011

Adventures in TSQL: Adding date and time values

With the addition of the SQL Server date and time types, I have often found myself needing to create...

Author: Carl Nolan Date: 11/15/2011

Adventures in TSQL: Comma separated string from column values

It seems that several times now I have had the requirement to create a comma separated string from...

Author: Carl Nolan Date: 11/15/2011

Creating a Partitioned View in the BAM Archiving Database

When you run the BAM data maintenance package (BAM_DM_<activity name>) BAM copies each...

Author: Carl Nolan Date: 11/15/2011

F# Dynamic Lookup Operator and Regex

In a recent post I talked about using String.forall for string matching over the use of Regex:...

Author: Carl Nolan Date: 11/11/2011

F# Parallel Execution and Future Tasks

In a previous post I talked about some options around performing parallel executions by providing...

Author: Carl Nolan Date: 11/07/2011

Next>