Hadoop’s Rise in Life Sciences By now the ‘Big Data’ challenge is familiar to the entire life sciences community. Modern high-throughput experimental technologies generate vast data sets that can only be tackled with high performance computing (HPC). Genomics, of course, is the leading example. It’s worth noting a single base pair typically represents about 100 bytes of data (raw, analyzed, and interpreted). The need to manage and analyze massive data sets has spurred many new approaches to HPC infrastructure and one – the Hadoop storage and compute framework – is emerging as a preferred (cost and performance) architecture. Learn why and how Hadoop is ideally suited for life science applications and how EMC Isilon has made working with Hadoop easier. |