Skip to content
Skip to Content
Menu
Menu
About Us
Big Data
Big Data Pipeline Development
Big Data Services
Contact Us
Data Analytics
Data Analytics Services
Data Governance and Compliance
California Consumer Privacy Act (CCPA)
Children’s Online Privacy Protection Act (COPPA)
Data Governance Compliances in USA
Fair Credit Reporting Act (FCRA)
Family Educational Rights and Privacy Act (FERPA)
General Data Protection Regulation (GDPR)
Health Insurance Portability and Accountability Act (HIPAA)
Internal Revenue Code Section 7216 (IRC 7216)
Sarbanes-Oxley Act (SOX)
Data Pipeline Development
Data Services
Home
Nixon Data: Powering AI by moving datasets from various data sources to Data lake
Increasing Kafka Partition
Interview Screening Service
IT Staffing
Knowledge Articles
Amazon Web Services Essentials
AWS EKS Tutorial
Comparing ECS and EKS: A Detailed Look at the Key Differences between Amazon’s Container Services
ETL in AWS
GitHub host a website for free, Steps and example
How to communicate between 2 AWS Accounts?
How to create a Website using React Js, AWS Lambda, AWS S3 ?
How to save cost on AWS S3?
How to secure infrastructure running on AWS ?
Most Important Linux Commands Cheat Sheet
Reasons you always pay more on AWS S3 than your estimates
What are AWS EC2, ECS, and EKS, and their Comparision, advantages, disadvantage, and example
What are the advantages and disadvantages of using AWS Lambda, and how to secure it?
What is Amazon Web Services(AWS) Kenesis, what are its advantages, and disadvantages, and how do setup
What is AWS VPS, and what are its components
What is IAM role, policy, group and assumeRole in AWS
What is the difference between AWS SNS, SQS, Kinesis, MKS
Apache Spark For Experts
How to build a serverless streaming pipeline on AWS
How to calculate cluster configuration in Apache Spark
How to calculate the number of tasks for a job in apache spark
How to create a Big Data Pipeline on AWS cloud infrastructure
How to create Data pipeline on AWS
How to create delta lake pipeline on AWS
How to decide Driver and Executor config in Apache Spark?
How to run Apache Spark on AWS Lambda
How to tune the number of executors, tasks, and memory allocation for a Spark application
Learn How to Setup Apache Spark on AWS EC2
Running Apache Spark on Amazon Web Services, Services, how to run and examples
Spark Functions vs UDF Performance Comparision in Apache Spark
Steps to create a Streaming Data pipeline using AWS Glue
Understanding Apache Spark’s Important Application Properties and Optimization Recommendations
Ways to optimize Spark job – 1
What are the ways to optimize Apache Spark Job
What is Catalyst Optimizer in Apache Spark
What is Compression in Apache Spark, how to use it, advantages and disadvantages
What is Data Skew? How to identify Data Skew from Data Skew? Impact? Fix?
What is Delta Lake ? What is its advantage
What is Tungsten Memory Manager in Apache Spark, how to use, advantage and disadvantage
What is Vaccum in Apache Spark, Its advantages, disadvantages, and how to use Vaccum
What is Z ordering or Morton ordering or Z-curve ordering, what are its advantages and disadvantages, and how to use it
Apache Spark Fundamentals
What is SparkContext? A Comprehensive Guide to Apache Spark’s Execution Engine
accumulators and broadcast variables in spark
Creating Empty Dataframe in Apache Spark
Dealing with SparkPartitionCoalescingException in Apache Spark: Reasons and Solutions
Difference between SparkContext and SQLContext in Apache Spark
List of Important Apache Spark Commands commonly used
Maximizing Big Data Analytics with Spark Session: A Complete Guide
Narrow Vs Wide Transformation
Reasons for Spark Job Failure
Spark.app.name Property in Apache Spark: Understanding and Utilization
Spark.driver.cores Property in Apache Spark: Understanding and Utilization
SQLContext in Apache Spark – Tutorial
Tutorial on User Defined Function (UDF) in Apache Spark
Understanding HiveContext in Apache Spark: A Comprehensive Guide
Understanding Narrow and Wide Transformations in Apache Spark
Understanding Resilient Distributed Datasets (RDDs) in Apache Spark: A Comprehensive Guide
Understanding the Differences between repartition() and coalesce()
What are Actions and Transformations in apache spark
What are Driver and Executor in Apache Spark
What are Jobs, Stage, Task in Apache Spark
What are RDD, Dataframe and Dataset in Apache Spark
What are the Common reasons behind Job Failure in Apache Spark?
What is _delta_log in Delta Lake Table
What is an accumulator in Apache Spark, how to create accumulator, usecase and accumulator variable example
What is Apache Spark ? What are its advantages and where is it being used ?
What is narrow and wide transformation in spark
What is RDD in Apache Spark ? How to create an empty RDD ?
What is SparkDriverExecutionException, reasons and resolution
What is SparkSession in Apace Spark – Full tutorial
What is the best language to write an apache spark application?
What is the Broadcast Variable in Apache spark, how to create a broadcast variable
What is the difference between Apache Spark and Hadoop
What is the difference between Batch and Structured Streaming in Apache Spark
What is the Prerequisites before start learning Apache Spark
What is Wide and Narrow Transformation in Apache Spark
When to use the broadcast variables in Apache Spark
Wide Transformation In Spark
Big data Fundamentals
Apache NiFi
Apache Pinot
Avro Vs Parquet
ETL Tools
Full Guide to Read Data from Facebook Marketing APIs and writing it to a Kafka topic
Full Guide To Read Data from Google Ads APIs and Writing Data to a Kafka Topic
Full Guide to read data from Google Analytics APIs and writing it to a Kafka topic
How to Host a Website for Free on GitHub
Important YARN Commands
List of ETL Tools, their Advantage, Disadvantage and Use cases
List of Top Open Source ETL Tools
Slowly Changing Dimension (SCD)
Top 20 Popular Free/Open-Source ETL Tools for 2023
What are CDC (Change Data Capture) events?
What are the Big Data File formats used
What are the challenges with Big Data?
What is a big data pipeline ?
What is Cloud Event, what are its advantages, and where it is being used?
What is Data Cleansing and Transformation in Big Data?
What is Debezium, How to use it
What is ETL, What are ETL Tools, List of Open Source ETL Tools, ETl Tools available in AWS
What is Garbage Collection
What is Jstat
What is JVisualVm, How to use it to capture garbage collection?
What is OLAP and OLTP, use-cases, comparison, and examples
What is Parquet?
What is Spring Data Flow, and what are its advantages and disadvantages
What is the difference between Avro and Parquet file format
Hadoop Fundamentals
Hadoop Commands Cheat Sheet
What is Hadoop, what are its advantage, and where it is used?
What is HDFS , what are its advantages and where is it being used?
What is YARN (Yet Another Resource Negotiator), uses and advantages
What the list of Important YARN Commands
Hive Fundamentals
How to Compare 2 Hive Partitions?
How to Compare 2 Hive Tables?
List of most asked Hive Interview Questions and Answers
Tutorial on Types Of Hive tables
What are the components in Apache Hive
What is Hive Metastore (HMS), What are its uses and Steps to create Hive metastore on AWS
What is Hive, its uses, and advantages?
Interview Questions
50+ Data Engineering Interview Questions
Apache Spark Interview Question
HikariCP: A High-Performance JDBC Connection Pool for Java
How to Make Good Reproducible Apache Spark Examples
How to Select the First Row of Each Group in Apache Spark
List of Java Exceptions with reason and Examples
Python Script to Fetch Youtube Subscriber Count
Kafka Fundamentals
All About Exactly Once in Apache Kafka Delivery Guarantee
Apache Camel Kafka: A Comprehensive Guide for Developers
Apache Kafka GoldenGate Adapter: A Guide
Apache Kafka: Delivery Guarantees
At Least Once Delivery in Apache Kafka
At Most Once Delivery in Apache Kafka
AWS MSK
How is data stored in Apache kafka ?
How to create Kafka Connect to read from AWS MySql RDS Instance
java.lang.IllegalStateException: Error processing condition on org.springframework.boot.autoconfigure.kafka.KafkaAutoConfiguration.kafkaProducerListener
Kafka Connect MySQL: A Comprehensive Guide
Kafka connect usecases
Kafka Hash Partitioner: The Ultimate Guide
org.apache.kafka.common.errors.GroupAuthorizationException: Not Authorized to Access Group
org.apache.kafka.common.errors.topicauthorizationexception not authorized to access topics
org.apache.kafka.common.KafkaException: Failed to Construct Kafka Producer
What is a Kafka broker and what role does it play in the Kafka ecosystem? Replication, Load balancing, network partitioning, handle failure scenario
What is a message Broker, what types of message brokers, and list of message brokers available in the market
What is Apache Kafka and what are its common use cases?
What is Debezium Kafka Connector? What are its usecases? How to create Debezium Kafka Connector? What are its Advantages, Disadvantages and limitations?
What is Kafka Connect Used For
What is Kafka Connect, Types,use cases, apache kafka connector list
What is Kafka Partition? What is its internal working? How does the partitioning strategy determine which partition a message is written to? How do partitions enable scalability in Kafka?How do consumers read data from partitions in Kafka?
What is Schema Registry? How to create a schema registry? What is schema evolution?
What is the Difference between Apache Kafka and Apache Flink
Kubernetes Fundamentals
Full Kubernetes Kubectl Cheat Sheet
Kubernetes Tutorial for Beginners: Basics, Features, Architecture
Referral Partnership Program
NixonData
Remote Development Center
Services
Close Menu
Close Menu
About Us
Big Data
Big Data Pipeline Development
Big Data Services
Contact Us
Data Analytics
Data Analytics Services
Data Governance and Compliance
California Consumer Privacy Act (CCPA)
Children’s Online Privacy Protection Act (COPPA)
Data Governance Compliances in USA
Fair Credit Reporting Act (FCRA)
Family Educational Rights and Privacy Act (FERPA)
General Data Protection Regulation (GDPR)
Health Insurance Portability and Accountability Act (HIPAA)
Internal Revenue Code Section 7216 (IRC 7216)
Sarbanes-Oxley Act (SOX)
Data Pipeline Development
Data Services
Home
Nixon Data: Powering AI by moving datasets from various data sources to Data lake
Increasing Kafka Partition
Interview Screening Service
IT Staffing
Knowledge Articles
Amazon Web Services Essentials
AWS EKS Tutorial
Comparing ECS and EKS: A Detailed Look at the Key Differences between Amazon’s Container Services
ETL in AWS
GitHub host a website for free, Steps and example
How to communicate between 2 AWS Accounts?
How to create a Website using React Js, AWS Lambda, AWS S3 ?
How to save cost on AWS S3?
How to secure infrastructure running on AWS ?
Most Important Linux Commands Cheat Sheet
Reasons you always pay more on AWS S3 than your estimates
What are AWS EC2, ECS, and EKS, and their Comparision, advantages, disadvantage, and example
What are the advantages and disadvantages of using AWS Lambda, and how to secure it?
What is Amazon Web Services(AWS) Kenesis, what are its advantages, and disadvantages, and how do setup
What is AWS VPS, and what are its components
What is IAM role, policy, group and assumeRole in AWS
What is the difference between AWS SNS, SQS, Kinesis, MKS
Apache Spark For Experts
How to build a serverless streaming pipeline on AWS
How to calculate cluster configuration in Apache Spark
How to calculate the number of tasks for a job in apache spark
How to create a Big Data Pipeline on AWS cloud infrastructure
How to create Data pipeline on AWS
How to create delta lake pipeline on AWS
How to decide Driver and Executor config in Apache Spark?
How to run Apache Spark on AWS Lambda
How to tune the number of executors, tasks, and memory allocation for a Spark application
Learn How to Setup Apache Spark on AWS EC2
Running Apache Spark on Amazon Web Services, Services, how to run and examples
Spark Functions vs UDF Performance Comparision in Apache Spark
Steps to create a Streaming Data pipeline using AWS Glue
Understanding Apache Spark’s Important Application Properties and Optimization Recommendations
Ways to optimize Spark job – 1
What are the ways to optimize Apache Spark Job
What is Catalyst Optimizer in Apache Spark
What is Compression in Apache Spark, how to use it, advantages and disadvantages
What is Data Skew? How to identify Data Skew from Data Skew? Impact? Fix?
What is Delta Lake ? What is its advantage
What is Tungsten Memory Manager in Apache Spark, how to use, advantage and disadvantage
What is Vaccum in Apache Spark, Its advantages, disadvantages, and how to use Vaccum
What is Z ordering or Morton ordering or Z-curve ordering, what are its advantages and disadvantages, and how to use it
Apache Spark Fundamentals
What is SparkContext? A Comprehensive Guide to Apache Spark’s Execution Engine
accumulators and broadcast variables in spark
Creating Empty Dataframe in Apache Spark
Dealing with SparkPartitionCoalescingException in Apache Spark: Reasons and Solutions
Difference between SparkContext and SQLContext in Apache Spark
List of Important Apache Spark Commands commonly used
Maximizing Big Data Analytics with Spark Session: A Complete Guide
Narrow Vs Wide Transformation
Reasons for Spark Job Failure
Spark.app.name Property in Apache Spark: Understanding and Utilization
Spark.driver.cores Property in Apache Spark: Understanding and Utilization
SQLContext in Apache Spark – Tutorial
Tutorial on User Defined Function (UDF) in Apache Spark
Understanding HiveContext in Apache Spark: A Comprehensive Guide
Understanding Narrow and Wide Transformations in Apache Spark
Understanding Resilient Distributed Datasets (RDDs) in Apache Spark: A Comprehensive Guide
Understanding the Differences between repartition() and coalesce()
What are Actions and Transformations in apache spark
What are Driver and Executor in Apache Spark
What are Jobs, Stage, Task in Apache Spark
What are RDD, Dataframe and Dataset in Apache Spark
What are the Common reasons behind Job Failure in Apache Spark?
What is _delta_log in Delta Lake Table
What is an accumulator in Apache Spark, how to create accumulator, usecase and accumulator variable example
What is Apache Spark ? What are its advantages and where is it being used ?
What is narrow and wide transformation in spark
What is RDD in Apache Spark ? How to create an empty RDD ?
What is SparkDriverExecutionException, reasons and resolution
What is SparkSession in Apace Spark – Full tutorial
What is the best language to write an apache spark application?
What is the Broadcast Variable in Apache spark, how to create a broadcast variable
What is the difference between Apache Spark and Hadoop
What is the difference between Batch and Structured Streaming in Apache Spark
What is the Prerequisites before start learning Apache Spark
What is Wide and Narrow Transformation in Apache Spark
When to use the broadcast variables in Apache Spark
Wide Transformation In Spark
Big data Fundamentals
Apache NiFi
Apache Pinot
Avro Vs Parquet
ETL Tools
Full Guide to Read Data from Facebook Marketing APIs and writing it to a Kafka topic
Full Guide To Read Data from Google Ads APIs and Writing Data to a Kafka Topic
Full Guide to read data from Google Analytics APIs and writing it to a Kafka topic
How to Host a Website for Free on GitHub
Important YARN Commands
List of ETL Tools, their Advantage, Disadvantage and Use cases
List of Top Open Source ETL Tools
Slowly Changing Dimension (SCD)
Top 20 Popular Free/Open-Source ETL Tools for 2023
What are CDC (Change Data Capture) events?
What are the Big Data File formats used
What are the challenges with Big Data?
What is a big data pipeline ?
What is Cloud Event, what are its advantages, and where it is being used?
What is Data Cleansing and Transformation in Big Data?
What is Debezium, How to use it
What is ETL, What are ETL Tools, List of Open Source ETL Tools, ETl Tools available in AWS
What is Garbage Collection
What is Jstat
What is JVisualVm, How to use it to capture garbage collection?
What is OLAP and OLTP, use-cases, comparison, and examples
What is Parquet?
What is Spring Data Flow, and what are its advantages and disadvantages
What is the difference between Avro and Parquet file format
Hadoop Fundamentals
Hadoop Commands Cheat Sheet
What is Hadoop, what are its advantage, and where it is used?
What is HDFS , what are its advantages and where is it being used?
What is YARN (Yet Another Resource Negotiator), uses and advantages
What the list of Important YARN Commands
Hive Fundamentals
How to Compare 2 Hive Partitions?
How to Compare 2 Hive Tables?
List of most asked Hive Interview Questions and Answers
Tutorial on Types Of Hive tables
What are the components in Apache Hive
What is Hive Metastore (HMS), What are its uses and Steps to create Hive metastore on AWS
What is Hive, its uses, and advantages?
Interview Questions
50+ Data Engineering Interview Questions
Apache Spark Interview Question
HikariCP: A High-Performance JDBC Connection Pool for Java
How to Make Good Reproducible Apache Spark Examples
How to Select the First Row of Each Group in Apache Spark
List of Java Exceptions with reason and Examples
Python Script to Fetch Youtube Subscriber Count
Kafka Fundamentals
All About Exactly Once in Apache Kafka Delivery Guarantee
Apache Camel Kafka: A Comprehensive Guide for Developers
Apache Kafka GoldenGate Adapter: A Guide
Apache Kafka: Delivery Guarantees
At Least Once Delivery in Apache Kafka
At Most Once Delivery in Apache Kafka
AWS MSK
How is data stored in Apache kafka ?
How to create Kafka Connect to read from AWS MySql RDS Instance
java.lang.IllegalStateException: Error processing condition on org.springframework.boot.autoconfigure.kafka.KafkaAutoConfiguration.kafkaProducerListener
Kafka Connect MySQL: A Comprehensive Guide
Kafka connect usecases
Kafka Hash Partitioner: The Ultimate Guide
org.apache.kafka.common.errors.GroupAuthorizationException: Not Authorized to Access Group
org.apache.kafka.common.errors.topicauthorizationexception not authorized to access topics
org.apache.kafka.common.KafkaException: Failed to Construct Kafka Producer
What is a Kafka broker and what role does it play in the Kafka ecosystem? Replication, Load balancing, network partitioning, handle failure scenario
What is a message Broker, what types of message brokers, and list of message brokers available in the market
What is Apache Kafka and what are its common use cases?
What is Debezium Kafka Connector? What are its usecases? How to create Debezium Kafka Connector? What are its Advantages, Disadvantages and limitations?
What is Kafka Connect Used For
What is Kafka Connect, Types,use cases, apache kafka connector list
What is Kafka Partition? What is its internal working? How does the partitioning strategy determine which partition a message is written to? How do partitions enable scalability in Kafka?How do consumers read data from partitions in Kafka?
What is Schema Registry? How to create a schema registry? What is schema evolution?
What is the Difference between Apache Kafka and Apache Flink
Kubernetes Fundamentals
Full Kubernetes Kubectl Cheat Sheet
Kubernetes Tutorial for Beginners: Basics, Features, Architecture
Referral Partnership Program
NixonData
Remote Development Center
Services
Search for:
Amazon Web Services Essentials
Nixon Data
Amazon Web Services Essentials
[child_pages cols=’2’ depth=’0’]
Top
Back to Top