Scalable System Design Patterns Looking back after 2.5 years since my previous post on scalable system design techniques , I've observed an emergence of a set of common...

NOSQL Patterns Over the last couple years, we see an emerging data storage mechanism for storing large scale of data. These storage solution differs quite...

MongoDb Architecture NOSQL has become a very heated topic for large web-scale deployment where scalability and semi-structured data driven the DB requirement tow...

Designing algorithms for Map Reduce Since the emerging of Hadoop implementation, I have been trying to morph existing algorithms from various areas into the map/reduce model. ...

Machine Learning in R: Clustering Clustering is a very common technique in unsupervised machine learning to discover groups of data that are "close-by" to each othe...

Predictive Analytics: Overview and Data visualization I plan to start a series of blog post on predictive analytics as there is an increasing demand on applying machine learning technique to ana...