Category Archives: hadoop

Writing an ORC File Using Java

So, I saw this question sometime ago on Stack Overflow and I’ve been meaning to answer this for sometime.  The question: How to convert .txt / .csv file to ORC format. Below is a code sample of how to do it … Continue reading

Posted in code example, coding, github, hadoop, java, programming | Tagged , , , | 10 Comments

Hadoop Streaming Confidential

Warning: This is a short tech article.  To my usual readers that come here to read my cute daddy-daughter stories, please feel free to skip this one. If you are working with hadoop streaming with python and see the following … Continue reading

Posted in coding, hadoop | Tagged , , , | Leave a comment