Hadoop:The Tool for Processing Big Data

In our previous blog we learned that the platform that processes and organizes Big Data is Hadoop. Here we will learn more about Hadoop which is a core platform for structuring Big Data and solves the problems of utilizing it for analytic purposes. It is an Open Source software framework for distributed storage and distributed processing of Big Data on clusters of commodity hardware.

Main characteristics of Hadoop:

Highly scalable (scaled out)
Commodity hardware based
Open Source, low acquisition and storage costs

Hadoop is basically divided into two parts namely : HDFS Und Mapreduce framework. A Hadoop cluster is specially designed for storing and analyzing huge amounts of unstructured data. Workload is distributed across multiple cluster nodes that work to process data in parallel.

History of Hadoop

Doug Cutting is the brains behind Hadoop which has its origin in Apache Und Nutch. Nutch was started in 2002 and it itself is an Open Source web search engine. Google published the paper that introduced the Mapreduce to the world. In early 2005 Nutch developers had a working Mapreduce implementation in Nutch. In February 2006 Hadoop was formed as an independent project by Nutch. In January 2008 Hadoop has made its own top level project at Apache and by this time major companies like Yahoo and Facebook started using Hadoop.

HDFS is the first aspect and Mapreduce is the secondary aspect of Hadoop. HDFS has an architecture which helps it in processing the data and organizing it. To get into details of HDFS, its architecture, functioning and several other concepts, keep an eye on the blogs that will be published in coming days.

Nehmen Sie Kontakt mit uns auf.

Manasa Heggere

Leitender Ruby on Rails-Entwickler

Abonnieren Sie die neuesten Updates

Über den Autor des Beitrags

Administrator

Siehe Beiträge des Autors

Plätzchen	Dauer	Beschreibung
cookielawinfo-checkbox-analytics	11 Monate	Dieses Cookie wird vom DSGVO-Cookie-Zustimmungs-Plugin gesetzt. Das Cookie dient zur Speicherung der Nutzereinwilligung für die Cookies der Kategorie „Analytics“.
cookielawinfo-checkbox-funktional	11 Monate	Das Cookie wird durch die DSGVO-Cookie-Zustimmung gesetzt, um die Zustimmung des Benutzers für die Cookies in der Kategorie „Funktional“ aufzuzeichnen.
cookielawinfo-checkbox-notwendig	11 Monate	Dieses Cookie wird vom DSGVO-Cookie-Zustimmungs-Plugin gesetzt. Die Cookies werden verwendet, um die Zustimmung des Benutzers für die Cookies in der Kategorie „Notwendig“ zu speichern.
cookielawinfo-checkbox-others	11 Monate	Dieses Cookie wird vom DSGVO-Cookie-Zustimmungs-Plugin gesetzt. Das Cookie wird verwendet, um die Einwilligung des Nutzers für die Cookies in der Kategorie „Sonstige“ zu speichern.
cookielawinfo-checkbox-performance	11 Monate	Dieses Cookie wird vom DSGVO-Cookie-Zustimmungs-Plugin gesetzt. Das Cookie dient zur Speicherung der Nutzereinwilligung für die Cookies der Kategorie „Leistung“.
angesehene_cookie_policy	11 Monate	Das Cookie wird vom DSGVO-Plugin „Cookie Consent“ gesetzt und dient dazu, zu speichern, ob der Benutzer der Verwendung von Cookies zugestimmt hat oder nicht. Es werden keine personenbezogenen Daten gespeichert.

The Tool For Processing Big Data – Hadoop

Abonnieren Sie die neuesten Updates

Über den Autor des Beitrags

Administrator

Hinterlasse einen Kommentar Antwort verwerfen

Schnelle Navigation

Unsere Dienstleistungen

Kontaktinformation