Techroz is my personal blog. I write articles regarding my daily interaction with Computer Science. My recent interests are BigData, Cloud Computing and improving my programming skills.
Programming
Hadoop Distributed File System (HDFS)

Hadoop Distributed File System (HDFS)

Apache Hadoop is an open source framework for distributed storage and processing. The distributed storage part of framework is commonly known as Hadoop Distributed Filesystem (HDFS) [1] which is the flagship filesystem for Hadoop. The processing part is known as MapReduce. Hadoop runs on commodity hardware and has the capabilities to store and process very...
Spark vs Hama

Spark vs Hama

Spark is an opensource, in-memory and iterative computing engine that can run on any Hadoop cluster, just like Hama. Recently, it got a lot of attention due to its capabilities to outperform Hadoop in iterative algorithms. Spark provides a clean programming interface and users write distributed programs as if they were doing serial implementation. This...
Apache Hama - General Introduction

Apache Hama – General Introduction

Apache Hama [1] is an opensource distributed computation framework written in Java and based on BSP programming model. It is currently a top level project at Apache Software Foundation (ASF). Hama runs on top of Hadoop [2] and can work seamlessly in any Hadoop environment. It processes data which is stored in HDFS and can...
Bulk Synchronous Model (BSP)

Bulk Synchronous Model (BSP)

Bulk Synchronous Parallel (BSP) programming model is also an analogous bridge but for parallel computation, introduced by Valiant and Leslie G. A BSP computer can be defined by p processors, each with its local memory, connected via some means of point-to-point communication.
Configuring third party JAR with native library in Apache Hama

Configuring third party JAR with native library in Apache Hama

Article about running third party libraries with native source code in Eclipse, terminal/CLI and Apache Hama.
H-Store - In memory row based relational database (A complete overview)

H-Store – In memory row based relational database (A complete overview)

H-Store is an open source, in-memory, row based and relational research database. It is a specialized database to handle only OLTP data. It is completely ACID complaint and runs on cluster of shared nothing machines. Multiple single threaded engines coordinate to provide efficient execution of OLTP transactions. H-Store is written from scratch and does not...
DAMN !!! A bug - A 4th iteration nightmare

DAMN !!! A bug – A 4th iteration nightmare

A story of a horrible bug which left me clueless. I had seen many bugs but this one really made me sleepless. Even though it was difficult but I still had fun solving it. Read it you will enjoy it.
Salesforce and Office 365 integration

Salesforce and Office 365 integration

This articles explains list different integration possibilities of Office 365 and Salesforce. Both of these systems can be integrated with each other using a third party solution or a custom built solution.
Apache Hama Random number example

Apache Hama Random number example

A simple random number generator using Apache Hama. This examples helps in understanding how Sync and send works.