At some point, every company must scale its infrastructure, and in turn, must face the challenges of handling larger and larger amounts of data. In this session, leaders from the Apache Hadoop project will showcase how to easily write and run applications that store and process petabytes of data.
Questions Answered:
How can I leverage Hadoop for scaling my applications?
What are Hadoop’s strengths? Weaknesses?
How has each company implemented Hadoop?
Why did each company choose Hadoop for this work?
How is Hadoop like MapReduce and GoogleFS?
What kind of data sets can you use Hadoop to analyze?
What is the biggest set of data Hadoop can handle?