Want to try out Hadoop with the Microsoft Stack and figure out what capabilities this brings to you? We point to some resources that can help.
Microsoft announced a partnership with Hortonworks last year to bring Hadoop to Windows Server and Windows Azure. Microsoft’s vision revolves around making Hadoop and related Big Data tools trivially accessible to the regular IT end-user and to this end it integrates with SQL Server Analysis and Reporting Services as well as Excel PowerPivot.
So what are the various things you can actually do right now? Here are some links that can show you that and also get you started -
- How To Connect Excel PowerPivot To Hive on Azure
- Hadoop Basics for SQL Server DBAs
- Hadoop on Azure is currently an invitation only preview. Roger Jennings has a good walkthrough with a sample project
- Introduction to Hadoop on Windows Azure video on channel9
- Generic based Framework for .NET Hadoop MapReduce Job Submission
If you are new to Hadoop and MapReduce, you can also refer to the following -
The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model. The library itself is designed to detect and handle failures at the application layer, making it possible to deliver a highly-available service on top of a cluster of commodity machines, each of which may be prone to failures.