Stuck trying to manipulate a string in Hadoop and don't want to use Java? No Problem use Pig's built in String Functions. Why Pig for ETL? Using Apache Pig in Hadoop is a must for ETL transactions. Pig allows for developer to quickly write a … [Continue reading]
HDFS Getting Started Course
Are you ready to get some Hadoop knowledge dropped on you? Well here it is after eight long months since my last Pluralsight course. HDFS Getting Started has been launched. I couldn't be more excited to have this course released. HDFS … [Continue reading]
Top 10 Favorite Post of 2015
So long 2015 2015 is done. I love the New Year because it's always a good time to look back at what you have accomplished. 2015 presented me with new challenges and opportunities. As I was planning out my goals for next year, I wanted to look back … [Continue reading]
How Big Data Impacts Holiday Shopping
Christmas is a magical time of year. I still remember the Christmas when I was 7 years old. After all the gifts had been opened my parents made me take the trash out to the road. It had been a great Christmas I was very happy with all my gifts and so … [Continue reading]
Comparing Data with Pig Latin MAX() Function
Last time we tackled how to use the Min() function in Pig and so this week we are going to learn to use the opposite function the MAX(). It's just like the MIN() function but instead of finding the lowest value in an array/column, it finds the … [Continue reading]
Pig Latin Eval Function MIN
If you were working in Excel could you easily find the minimum value in a column or row of data? Of course you could. Excel has a function built in to find the minimum value and many other functions. Well so does Apache Pig, you just have to learn … [Continue reading]
Execute Pig Script from Command Line
Ready to run a Pig script with the Grunt Shell or Pig Editor? The time has come to take the training wheels off and run a Pig script without using the Grunt Shell. At least that is how I felt when I ran my first Pig script from the … [Continue reading]
Apache Pig Eval Functions Series
Ready to master the Apache Pig but not sure how to get started? How can I master Apache Pig? The process for mastering a programming language is that same as learning any other skills. Practice, Practice, Practice. The practice needs to be focused … [Continue reading]
Pig Eval Series: Tokenize
In this Pig Eval tutorial we are going to use the Apache Pig Tokenize function. If you not familiar with the tokenize function you're probably thinking we are going to do something crazy like turn a field into a game token that can be used a Chucky … [Continue reading]
Apache Pig Latin Tutorial
Hadoop development is one of the top skills most desired in software development. One of the reasons is because Hadoop is early in the product life cycle. It's like getting involved with Relational Databases back in the early 80's. Huge … [Continue reading]
Learn to Process Data with Apache Pig
Apache Pig is one of the hottest languages in the Hadoop ecosystem. Right now the average salary for a Pig Developer is $124,563 according to a report released in Infoworld. A Pig developer can process both unstructured and semi-structured in … [Continue reading]
Pig Latin Concatenation Function
Today we are going to talk about how to concatenate fields using Pig Latin. For this week's example we are going to use a different data set than we have used in the Apache Pig Latin Eval Function series. Our new data set is a sample data set … [Continue reading]
- « Previous Page
- 1
- …
- 10
- 11
- 12
- 13
- 14
- …
- 16
- Next Page »