Balas Blog..: Hadoop

Thursday, October 17, 2013

Search Engine For Hadoop

Vertascale Announces SimpleSearch(beta), The Search Engine For Hadoop - See more at: http://www.toolsjournal.com/cloud-articles/item/1449-vertascale-simplesearch#sthash.i44tZugb.dpuf

Big Data software developer Vertascale has announced the opening of its private beta for the company’s inaugural product, SimpleSearch. The SimpleSearch software provides powerful real-time query and summary analysis capability for structured and mix-structured data stored in Amazon S3 or the Hadoop File System (HDFS).

SimpleSearch - The Search Engine For Hadoop is aimed at providing benefit to engineers, data scientists and business analysts looking for faster “time to answer” on Big Data, the company said.

While Hadoop has been successful in addressing the problems of Big Data storage and batch processing, Vertascale’s founders recognized early on that in order to democratize access to Big Data, users needed an intuitive ad-hoc query capability. Vertascale is based in Menlo Park, California, USA.

“Today’s problem is less about storing data and more about being able to actually find what you’re looking for in the data. Everyone working with Big Data is challenged by the ‘I don’t know what I don’t know’ problem, and the prohibitively long iteration cycles. SimpleSearch lets you find, explore and export large data sets quickly and easily in a way that’s scalable and cost effective,” said Vertascale CTO Geoffrey Hendrey.

Added Vertascale President James Ladd, “With the rapid adoption of Hadoop, companies that recognize the value inherent in their data are also looking for simplicity and speed in querying Big Data.”

Vertascale will be demonstrating SimpleSearch, The Search Engine For Hadoop, at the Innovators Pavilion P23 at Strata 2013 in Santa Clara, USA.

- See more at: http://www.toolsjournal.com/cloud-articles/item/1449-vertascale-simplesearch#sthash.i44tZugb.dpuf

Wednesday, October 9, 2013

What are the Parts available in Hadoop 1.0?

Hadoop consists of two pieces

* Map Reduce

* HDFS

*Map Reduce

This is Processing part of data.

for example: Job will run and nodes will be created.

*HDFS: Hadoop Distributed File System.

This is storing part of the data.

Happy coding.

Bala

What is the Apache Hadoop?

=> Solutions for Big Data.
It's deals with complexities of High Volume, Velocity and Variety of data.

=> Set of Open Source Projects.

=>Transforms commodity hardware in to services that

It's stores petabytes of data reliability
Allows huge distributed computations.

=>Key Attributes.

It's redundant and reliable.
Extremely powerful.
Batch processing centric.
Easy to program distributed applications.
It's run on commodity hardware.

Balas Blog..

Thursday, October 17, 2013

Search Engine For Hadoop

Wednesday, October 9, 2013

What are the Parts available in Hadoop 1.0?

What is the Apache Hadoop?

Technology

Start Counter