So, now that I’ve done my best to explain the origins of Hadoop and where it is today, as well as what it can do for us tomorrow, I’d like to spend a bit of time talking about the potential Cisco sees in Hadoop, and what they’re doing to put that power in their users’ hands.
The first thing any server manufacturer who wants to cash in on the Hadoop hype needs to do is to choose a Hadoop distribution to support. Seeing that Cisco and EMC are already tightly aligned in several areas within the data center, it was a natural fit for Cisco to piggyback onto the EMC strategy and utilize their expertise in the area.
The Cisco Hadoop solution consists of Cisco’s C-series rack servers, built into half rack and full rack combinations (for both performance and for capacity approaches). Cisco then layers on the EMC Greenplum MR stack (which is an OEM version of MapR’s Hadoop distribution) and some added treats to help accelerate performance and add extra value to the MapR stack.
Cisco UCS has shown that it can provide some of the industry’s finest density, performance and management without sacrificing anything along the way, so coupling the platform with a tried and tested Hadoop solution is a natural step in Cisco’s evolution within the server market.
All very fun stuff, but wait….. There’s more!
Cisco has taken a look at where many organizations struggle when it comes to running large Hadoop environments, and has decided that workload scheduling is the major roadblock to enterprise-wide Hadoop adoption. The reason for this is that most Hadoop implementations start really small and are used for a specific problem a company is looking to solve. As the company realizes Hadoop can be used for many different areas, they start to deploy many small Hadoop clusters, each a silo that is focused on a specific area. As the technology gains momentum internally, and as users learn how to apply the Hadoop approach to a broader cross section of questions to which they want answers, their Hadoop number of nodes expand quickly from the 10 to 100 and beyond!
At this point, managing workload scheduling and execution for many applications and hundreds of Hadoop jobs is an unruly task and one that requires extra tools, as well as a centrally-managed approach versus a departmental management approach.
To facilitate this kind of control Cisco is leaning on the acquisition it made of Tidal Software, and more specifically the Tidal Enterprise Scheduler, which is a comprehensive workload scheduler and management system. With the 6.1 release the software can now control workloads and their complex scheduling of up to around 40,000 jobs per month, and is certified to do so on Cloudera, MapR and EMC/Greenplum distributions.
Cisco is obviously leading the Tier-1 charge towards providing a comprehensive and easy to manage solution for clients looking to dive into the exciting world of Hadoop. Not only that, but they have taken it a step further with the ability to manage the Tidal Workload Scheduler from an iPhone. Yes, that’s right: you can now run 1000’s of Hadoop jobs while sipping a cold beer hundreds or thousands of miles away from the physical iron. Now that’s convenience, and with the Enterprise in mind.

