Slides and Video from My Talk at PDC 2019

I got a chance to speak to Hadoop folks at this years Pune Data Conference held in Pune, India.

My talk is titled Admins: Smoke Test Your Hadoop Cluster! This is the abstract:

Software smoke testing is a preliminary level of testing. It makes certain that all of the primary components of a system are functioning correctly. For example, when installing a new secured Hadoop cluster, running a series of quick tests to make sure that things like HDFS and MapReduce are operational can save a lot of headache before enabling Kerberos. Smoke tests can also save you time and embarrassment by making sure that things work before you turn the cluster over to your customer.


In this talk, Michael Arnold will explain the utility of testing Hadoop components after cluster builds and software upgrades. Michael will present code examples that you can use to confirm functionality of Spark, Kudu, HBase, Kafka, MapReduce, etc on your cluster.

This is the link to the slide presentation and video.

Slides from My Puppet Camp Phoenix 2015 Presentation

I got a chance to speak to some local Puppet Users at this years Puppet Camp Phoenix held in Chandler, Arizona.

My talk is titled Managing Files via Puppet: Let Me Count The Ways. This is the abstract:

Configuration files abound. They are the primary way to get our systems to do the things we want them to do. Puppet has a number of ways for us to manage these files. From static, whole file control through template-based file deployment to separate, line-based configuration, Puppet can get the bits we need into the correct file with little trouble. In addition to managing the file, this talk will also cover patterns (APIs) that a module can follow to allow other modules to control the contents of it’s configuration files.

This is a link to my slide presentation and sourcecode.

Video of my PuppetConf 2012 Presentation

The PuppetConf 2012 videos are up! Here is a link to mine: Automated Configuration of Firmware – Michael Arnold. And here are the Slides from my PuppetConf 2012 Presentation.

Slides from my PuppetConf 2012 Presentation

This is a link to my slide presentation and screencast.

You can also see all the links that I referred to in my presentation by searching for the firmware tag.

I will be speaking at PuppetConf 2012

I will be speaking this year at the greatest operations and infrastructure automation conference of the year: PuppetConf 2012. Here is the abstract that I submitted:

Automated Configuration of Firmware

Everyone hears about automation of OS installs and configuration of services on top of the OS yet no one talks about automating the configuration of firmware. There are a number of venders who provide tools that allow the intrepid systems administrator to never again have to arrow through BIOS menus to toggle some less known but needed setting. Gone are the days of manually entering disk configurations into a RAID controller. This session is for SysAdmins or hardware geeks that want to free up their time to enjoy the finer things in life. You will be presented with the knowledge gained from two years of experience managing firmware configuration as a part of hardware discovery and provisioning.

Video of my Hadoop Summit 2012 Presentation

It looks like the video of my presentation at the 2012 Hadoop Summit made it online. It is too bad the Summit website doese not link to it. This is a link to my presentation as well.

I will be speaking at Hadoop Summit 2012

After an initial “on-hold” status, it looks like I will get to speak at the Hadoop Summit 2012 in San Jose, CA. This will be my first time speaking at a conference and I am really looking forward to it. Below is the abstract I submitted:

Hadoop Operations: Starting Out Small / So Your Cluster Isn`t Yahoo-sized (yet)

Everyone hears about large clusters with thousands of machines and petabytes of storage yet not everyone starts their first Hadoop deployment with dozens of cabinets of equipment. What do you do when you don`t have quite as large of a deployment? What decisions should you make now and which should you postpone for later? This session is for SysAdmins that have not yet or just recently jumped into the Hadoop fray. You will be presented with the knowledge gained from two years of operational experience at a (currently) small Hadoop site. We will discuss things that are initially important for a small (10-100 node) cluster and what happens when you outgrow your first deployment.