Is there anyway to partition or containerize workloads in Isilon? 100% yes! Isilon's OneFS offers Access Zones to divide different workflows/users/AD servers/ GroupNets/etc. in the same Isilon Cluster. Learn to setup Access Zones in Isilon's … [Continue reading]
How to Execute HBase Script from the Command Line
In the past few post we've been working with creating tables and interacting with data in HBase. What happens when we have bulk data to upload? Do we want to enter each row from the HBase Shell? Wow that doesn't sound fun! Let's speed up … [Continue reading]
Big Data Big Questions: Learning to Become a Data Engineer?
Data Scientist for the past few years has been named the sexiest job in IT. However the Data Engineer is a huge part of the Big Data movement. The Data Engineer is one the top paying jobs in IT. On average the Data Engineer can make anywhere from 90K … [Continue reading]
Using HBase Scan From the HBase Shell
Continued Post in the HBase series. In this post we will continue from the example created in the Creating a Table in HBase. Now that we have our Asteroid Warning System table created in HBase let's learn how to use the HBase Scan table to quickly … [Continue reading]
HBase Error Solved – Error: Can’t get master address from ZooKeeper
The inspiration for this post on the HBase Error "Error: Can't get master address from ZooKeeper" came from my work on the HBase Creating a Table blog post. Here is the quick and easy way to check for this error. Why Does This Always … [Continue reading]
How to Create a Table in HBase
HBase is one the hottest Non-Relational Databases on Hadoop right now! HBase is a NoSQL built to work on top the Hadoop Distrubuted File System (HDFS). HDFS is built on the concept of Schema-on-Read where a schema is applied to the data on read. … [Continue reading]
Bound vs. Unbound Data in Real Time Analytics
Breaking The World of Processing Streaming and Real-Time analytics are pushing the boundaries of our analytic architecture patterns. In the big data community we now break down analytics processing into batch or streaming. If you glance at the top … [Continue reading]
Big Data Big Questions: Kappa Architecture for Real-Time
Should I Use Kappa Architecture For Real-Time Analytics? Analytics architectures are challenging to design. If you follow the latest trends in Big Data, you'll see a lot different architecture patterns to chose from. Architects have a … [Continue reading]
16 Hadoop fs Commands Every Data Engineer Must Know
Commands in Hadoop The Hadoop shell is the CLI for the Hadoop cluster. Most of the time Hadoop Administrators will find themselves using the Hadoop CLI just as much as the HDP, Ambari, or CDH management interface. Learning how to navigate and run … [Continue reading]
13 Step By Step Apache Hive Data Types
Hive is one of the leading SQL engine running on Hadoop. Hive has had a long relationship with Hadoop from the start to support SQL like syntax. Even though Hive supports SQL like syntax there are some differences the in the Hive data types vs. SQL … [Continue reading]
What is a Data Lake
Explaining the Data Lake The Enterprise space is notorious for throwing around jargon. Take Data Lake for example the term Data lake. Does it mean there is a real lake in my data center because that sounds like a horrible idea. Or is a Data … [Continue reading]
Python Options in Hadoop
New developers in the Hadoop ecosystem often struggle to get involved because they think they need to learn Java. Where do Python and non-Java developers turn to when developing in the Hadoop eco-system? What are the Python options in … [Continue reading]
- « Previous Page
- 1
- …
- 6
- 7
- 8
- 9
- 10
- …
- 16
- Next Page »