Click here for our Udemy course on Machine Learning :- Machine Learning Deep Learning Model Deployment
Apache NiFi step by step for all level : Beginner to Expert
Udemy Course Link:- Apache NiFi step by step for all level : Beginner to Expert .
What is DBT (Data Build Tool)? How it is different from normal ETL tools?
What is DBT (Data build tool) and how it is different from existing ETL tools in industry
Apache Nifi Custom Processor and Controller services
1. Generate Nifi Processor Template code using the maven archetype :- 1. Open command prompt and run “mvn archetype:generate” You will get list of Archtypes […]
Installation of JDK and Apache Maven
Installation of Java and Apache Maven
Structured Streaming Data storage in Hive Table
In this post we talk about how you can read data from files using Spark Structured Streaming and store the output in a Hive table […]
Capture bad records while loading csv in spark Dataframe
Loading a csv file and capturing all the bad records is a very common requirement in ETL projects. Most of the relational database loaders like […]
Deployment modes and Job submission in Apache Spark
Spark is a Scheduling Monitoring and Distribution engine, it can also acts as a resource manager for its jobs. When Spark runs job by itself […]
What is an RDD and Why Spark needs it?
Resilient Distributed Data set(RDD) is the core of Apache Spark. It is the fundamental data structure on top of which all the spark components reside. […]
What is Apache NiFi?
Apache NiFi is an open source software to automate and manage the flow of data between different systems. It provides a web-based UI for creating monitoring […]