I honestly think developing real-time analytics is one of the hardest feats for developers to take on! I'll admit I'm for sure biased, but that doesn't make me wrong. My first project in the Hadoop eco-system was a real-time application when … [Continue reading]
Big Data Big Questions: Big Data Kappa Architecture Explained
Learning how to develop streaming architectures can be tricky and difficult. In Big Data the Kappa Architecture has become the powerful streaming architecture because of the growing need to analyze streaming data. For the past few years the Lambda … [Continue reading]
Big Data Lambda Architecture Explained
What is Lambda Architecture? Since the Spark, Storm, and other streaming processing engines entered the Hadoop ecosystem the Lambda Architecture has been the defacto architecture for Big Data with a real-time processing requirement. In this episode … [Continue reading]
Is Hadoop Killing the EDW?
Is Hadoop Killing the EDW? Fair question since in it's 11th year Hadoop is known as the innovative kid on the block for analyzing large data sets. If the Hadoop ecosystem can analyze large data sets will it kill the EDW? The Enterprise Data … [Continue reading]
DataWorks Summit 2017 Recap
All Things Data Just coming off an amazing week with a ton of information in the Hadoop Ecosystem. It's been a 2 years since I've been to this conference. Somethings have changed like the name from Hadoop Summit to DataWorks Summit. Other things … [Continue reading]
Isilon Quick Tips: Compare Snapshots in OneFS
How to Compare Snapshots in OneFS At least once every Isilon Administrator will need to compare snapshots in OneFS. It might be a situation where a user has upload files to the wrong directory or you need to roll back to a different version of a … [Continue reading]
DataWorks Summit: Future Architecture of Streaming Analytics
Ready to learn about the Future Architecture of Streaming Analytics? Next week I will be heading to the DataWorks Summit in San Jose (formerly Hadoop Summit). The DataWorks summit is one of the top conferences for the Hadoop Ecosystem. Last year … [Continue reading]
Big Data Big Questions: Do I need to know Java to become a Big Data Developer?
Today there are so many applications and frameworks in the Hadoop ecosystem, most of which are written in Java. So does this mean anyone wanting to become a Hadoop developer or Big Data Developer must learn Java? Should you go through hours and weeks … [Continue reading]
Complete Guide to Splunk Add-Ons
Splunk is a popular application for analyzing machine data in the data center. What happens when Splunk Administrators want to add new data sources to their Splunk environment outside the default list? The Administrators have two options: … [Continue reading]
Isilon Quick Tips: Deep Dive FTP
Deep Dive into FTP on OneFS (Part 2 of my Isilon Quick Tips on FTP and talk at the Huntsville Isilon User Working Group talk) The FTP protocol is one of the most overlooked protocols in OneFS. On the surface there doesn't appear to be more than … [Continue reading]
7 Commands for Copying Data in HDFS
What happens when you need a duplicate file in two different locations? It's not a trivial problem you just need to copy that file to the new location. In Hadoop and HDFS you can copy files easily. You just have to understand how you want to copy … [Continue reading]
Ultimate Big Data Battle: Batch Processing vs. Streaming Processing
Today developers are analyzing Terabytes and Petabytes of data in the Hadoop Ecosystem. There are many projects that are helping to accelerate and speed up this innovation. All of these projects rely on batch and streaming processing, but what is the … [Continue reading]
- « Previous Page
- 1
- …
- 7
- 8
- 9
- 10
- 11
- …
- 16
- Next Page »