Open in app

Sign in

Write

Sign in

Karthik Sharma
Karthik Sharma

137 followers

Home

About

Snowflake Performance Tuning: Part II (Identifying the join explosion using…

In the previous blog (Snowflake Performance Tuning: Part I), we explored how to identify and resolve join explosions using the query…

Mar 22
Snowflake Performance Tuning: Part II (Identifying the join explosion using…
Snowflake Performance Tuning: Part II (Identifying the join explosion using…
Mar 22

Snowflake Performance Tuning: Part I (Join Explosion)

Snowflake is a cloud-based data platform that simplifies storing, processing, and analyzing large amounts of data seamlessly across…

Mar 15
Snowflake Performance Tuning: Part I (Join Explosion)
Snowflake Performance Tuning: Part I (Join Explosion)
Mar 15

Internals of YARN architecture

Overview

Jul 28, 2021
Internals of YARN architecture
Internals of YARN architecture
Jul 28, 2021

Different types of failures in Hadoop

One of the major advantage of using Hadoop is its ability to handle failures and allow jobs to complete successfully. In this article we…

Jun 15, 2021
Jun 15, 2021

Understanding different ID’s that are generated during the Map Reduce Application.

In Hadoop 2, Map Reduce jobs are executed using the YARN(Yet Another Resource Negotiator). Let us understand the different id’s that are…

Jun 10, 2021
Understanding different ID’s that are generated during the Map Reduce Application.
Understanding different ID’s that are generated during the Map Reduce Application.
Jun 10, 2021

Deep dive into YARN Scheduler options

In real world the clusters are busy and the resources are limited, as a result the applications often need to wait to have some of its…

Jun 9, 2021
Deep dive into YARN Scheduler options
Deep dive into YARN Scheduler options
Jun 9, 2021

HDFS Erasure Coding (EC)

Before we start our discussion on what exactly is Erasure coding, let us understand the below two terms and see how HDFS achieve them.

Jun 4, 2021
HDFS Erasure Coding (EC)
HDFS Erasure Coding (EC)
Jun 4, 2021

Understanding HDFS commands with examples

Hadoop Distributed File System (HDFS) is file system of Hadoop designed for storing very large files running on clusters of commodity…

Jun 1, 2021
Understanding HDFS commands with examples
Understanding HDFS commands with examples
Jun 1, 2021

Integrating Kafka with PySpark

In this blog we are going to discuss about how to integrate Apache Kafka with Spark using Python and its required configuration.

Jan 16, 2021
2
Integrating Kafka with PySpark
Integrating Kafka with PySpark
Jan 16, 2021
2

Understanding Parquet and its Optimization opportunities

Introduction to Parquet

Dec 10, 2020
2
Understanding Parquet and its Optimization opportunities
Understanding Parquet and its Optimization opportunities
Dec 10, 2020
2
Karthik Sharma

Karthik Sharma

137 followers

Lead Data Engineer

Following
  • Clark Perucho

    Clark Perucho

  • Barkha Saxena

    Barkha Saxena

  • Vetrivel_PS

    Vetrivel_PS

  • Deniz Parmaksız

    Deniz Parmaksız

  • Halil Ertan

    Halil Ertan

See all (10)

Help

Status

About

Careers

Press

Blog

Privacy

Rules

Terms

Text to speech