Thomas Henson is a known Data Engineering Advocate who is known for helping teams solve complex problems with Big Data. Thomas is a Software Engineer at heart and Big Data Analytics Evangelist by trade; where he specializes in solving real world problems with Scaled-out Computing Solutions (Hadoop, Spark, Flink, Redshift, Kafka, etc.). He is proud Alumni of the University of North Alabama; where he received both his undergraduate and graduate degree. Thomas has been seen at many conferences events like Hadoop Summit, Future of Data Roadshow and Fed Forum. You can always check him out at thomashenson.com or on twitter at @henson_tm.
Publications
Data Engineering Courses
Pig Latin: Getting Started – Course designed to get developers familiar with the Pig Latin language fundamentals. Learn how to write your first MapReduce job without Java.
Getting Started with HDFS – Learning to work with Hadoop Distributed File System (HDFS) is a baseline skill for anyone administering or developing in the Hadoop ecosystem. In this course, you will learn how to work with HDFS, Hive, Pig, Sqoop and HBase from the command line.
Analyzing Machine Data with Splunk – Splunk is one of the most used applications for analyzing unstructured data in the data center. This course will teach you the basics of setting up Splunk, writing Splunk queries, and running Splunk with Hadoop.
Getting Started with Hortonworks Data Platform – Hortonworks Data Platform is one of the leading Hadoop distributed platform. In this course learn how to build and deploy a HDP cluster in your environment.
Enteprise Skills in Hortonworks Data Platform – Data Engineers are in high demand mostly because Enterprises are adopting Hadoop at hyperscale. In my newest Pluralsight course I cover those skills Data Engineers must have to be successful in the Enterprise.
Implementing Neural Networks with TFLearn – Tensorflow course taking a Data Engineers approach to learning how to get started in Deep Learning. In this course we walk through how to writing layers using Tensorflow Python APIs. This course is targeted at those just getting started with Machine Learning.
Installing and Configuring Splunk – Learn how to start building out Splunk environments for analyzing machine generated data. In this course we explore the basic building blocks of Splunk Architect.
Performing Basic Splunk Searches – In this course we focus on the building blocks of search in Splunk. Learn how to create Splunk searches and use the Search Processing Language in Splunk.
Building Reports, Dashboards, and Alerts in Splunk – Follow up to the 2 above Splunk courses where we dig into Splunk visualizations. Learn how to use the data from the previous tutorials to build reports, dashboards, and alert in Splunk.
Articles
Pig vs Java MapReduce: What to know
Hadoop: Ultimate List of Frameworks
Explaining the Time Value of Data
Andrew Ng’s Machine Learning Course
The Rise of Deep Learning in the Enterprise
Ultimate List of Tensorflow Resources for Machine Learning Engineers
Where Were You When Artificial Intelligence Transformed The Enterprise
Podcast Interviews
Data Engineering Podcast 71: Deep Learning for Data Engineers
Developer On Fire 278: The Role of the Data Engineer
Big Data Beard Podcast 02: The Hot Seat Seat in the Summer
Roaring Elephant 49: Thomas Henson on IoT Architectures
Get Up And Code 093: All About Running With Thomas Henson
My Life for the Code 02: Big Data Niche, Pluralsight, Family, and more with Thomas Henson
Webinars
How Nav Canada gets it digital game off the ground
Digital Transformation Worth its Weight in Gold
Conference Talks
O’Reilly AI Conference – Data Fueling AI of the Future
Dell Technologies World 2018 – Harness The Intersection Of Applications, Data & Analytics For IoT
Hortonworks Future of Data Roadshow Atlanta
Hortonworks Future of Data Roadshow Charlotte
Dataworks Summit San Jose – Future Architecutres of Streaming Analytics