Karthik Sharma – Medium

Karthik Sharma

Snowflake Performance Tuning: Part II (Identifying the join explosion using…

In the previous blog (Snowflake Performance Tuning: Part I), we explored how to identify and resolve join explosions using the query…

Mar 22

Snowflake Performance Tuning: Part II (Identifying the join explosion using…

Mar 22

Snowflake Performance Tuning: Part I (Join Explosion)

Snowflake is a cloud-based data platform that simplifies storing, processing, and analyzing large amounts of data seamlessly across…

Mar 15

Snowflake Performance Tuning: Part I (Join Explosion)

Mar 15

Internals of YARN architecture

Overview

Jul 28, 2021

Internals of YARN architecture

Jul 28, 2021

Different types of failures in Hadoop

One of the major advantage of using Hadoop is its ability to handle failures and allow jobs to complete successfully. In this article we…

Jun 15, 2021

Jun 15, 2021

Understanding different ID’s that are generated during the Map Reduce Application.

In Hadoop 2, Map Reduce jobs are executed using the YARN(Yet Another Resource Negotiator). Let us understand the different id’s that are…

Jun 10, 2021

Understanding different ID’s that are generated during the Map Reduce Application.

Jun 10, 2021

Deep dive into YARN Scheduler options

In real world the clusters are busy and the resources are limited, as a result the applications often need to wait to have some of its…

Jun 9, 2021

Deep dive into YARN Scheduler options

Jun 9, 2021

HDFS Erasure Coding (EC)

Before we start our discussion on what exactly is Erasure coding, let us understand the below two terms and see how HDFS achieve them.

Jun 4, 2021

HDFS Erasure Coding (EC)

Jun 4, 2021

Understanding HDFS commands with examples

Hadoop Distributed File System (HDFS) is file system of Hadoop designed for storing very large files running on clusters of commodity…

Jun 1, 2021

Understanding HDFS commands with examples

Jun 1, 2021

Integrating Kafka with PySpark

In this blog we are going to discuss about how to integrate Apache Kafka with Spark using Python and its required configuration.

Jan 16, 2021

Integrating Kafka with PySpark

Jan 16, 2021

Understanding Parquet and its Optimization opportunities

Introduction to Parquet

Dec 10, 2020

Understanding Parquet and its Optimization opportunities

Dec 10, 2020

Karthik Sharma

Karthik Sharma

Lead Data Engineer

Following

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech