Techroz is my personal blog. I write articles regarding my daily interaction with Computer Science. My recent interests are BigData, Cloud Computing and improving my programming skills.
Monthly archive March, 2016
Hadoop Distributed File System (HDFS)

Hadoop Distributed File System (HDFS)

Apache Hadoop is an open source framework for distributed storage and processing. The distributed storage part of framework is commonly known as Hadoop Distributed Filesystem (HDFS) [1] which is the flagship filesystem for Hadoop. The processing part is known as MapReduce. Hadoop runs on commodity hardware and has the capabilities to store and process very...
Spark vs Hama

Spark vs Hama

Spark is an opensource, in-memory and iterative computing engine that can run on any Hadoop cluster, just like Hama. Recently, it got a lot of attention due to its capabilities to outperform Hadoop in iterative algorithms. Spark provides a clean programming interface and users write distributed programs as if they were doing serial implementation. This...
Apache Hama - General Introduction

Apache Hama – General Introduction

Apache Hama [1] is an opensource distributed computation framework written in Java and based on BSP programming model. It is currently a top level project at Apache Software Foundation (ASF). Hama runs on top of Hadoop [2] and can work seamlessly in any Hadoop environment. It processes data which is stored in HDFS and can...
Bulk Synchronous Model (BSP)

Bulk Synchronous Model (BSP)

Bulk Synchronous Parallel (BSP) programming model is also an analogous bridge but for parallel computation, introduced by Valiant and Leslie G. A BSP computer can be defined by p processors, each with its local memory, connected via some means of point-to-point communication.