What is ACID function and how it was impacting into Data lake storage environments? –Part5

Apache Hive

  • Tools to enable easy access to data via SQL, thus enabling data warehousing tasks such as extract/transform/load (ETL), reporting, and data analysis.
  • A mechanism to impose structure on a variety of data formats
  • HCatalog is a table and storage management layer for Hadoop that enables users with different data processing tools — including Pig and MapReduce — to more easily read and write data on the grid.
  • WebHCat provides a service that you can use to run Hadoop MapReduce (or YARN), Pig, Hive jobs. You can also perform Hive metadata operations using an HTTP (REST style) interface.

Insert Data

Update Data

Delete Data

Merge Data

Conclusion

--

--

--

I am Big Data Engineer & Solution Architect experience in various Cloud & Big data distribution systems, primarily on Hadoop & AWS Cloud services.

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

Build SMS Spam Classification Model using Naive Bayes & Random Forest

Understanding Press Briefing through the Lens of TEXT Analytics: President Trump’s Meetings with…

How to handle Imbalanced Classification Problems

A Solution to Transfer Data from MySQL8 Database to a Data Warehouse

What are the some of the over hyped terms and what do they mean?

First steps, searching for data with elastic

Testing SQL the hard way

Machine Learning Algorithns With Scikit-Learn

Image by Peter Kraayvanger from Pixabay

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Selvam Rangasamy-Big Data Engineer & Solution Arch

Selvam Rangasamy-Big Data Engineer & Solution Arch

I am Big Data Engineer & Solution Architect experience in various Cloud & Big data distribution systems, primarily on Hadoop & AWS Cloud services.

More from Medium

Partitioning a Database — An Unified Reflection — Draft Relas

Attempt to connect to cloudera manager issue resolved.

Join operations in Hadoop

Import/Export Using Apache Sqoop