skip to content

February 14, 2026

CLoudOps Now!

  • Tools
  • Support
  • Certification
  • Courses
  • My Daily Story

Month: August 2025

Mohammad Gufran Jahangir August 13, 2025 0

Databricks COPY INTO — Idempotent Ingestion & Exactly‑Once Design (Deep‑Dive)

Databricks COPY INTO is a lightweight, SQL‑first way to load files from cloud storage (ADLS/S3/GCS) into Delta tables. It’s retryable and idempotent—you can schedule it safely without creating duplicates because…

READ MORE +
Mohammad Gufran Jahangir August 11, 2025 0

Mastering Databricks Utilities (dbutils) — Hierarchy, Examples, and Use Cases

Databricks provides the dbutils module as a built-in set of utilities to interact with Databricks services directly from notebooks.These utilities allow you to: 1. Hierarchy Overview The main structure of…

READ MORE +
Mohammad Gufran Jahangir August 11, 2025 0

Understanding DBFS, ABFSS, and Other Storage Locations in Databricks

When working with Databricks, you will store and access data from various locations — some are Databricks-native (like DBFS), and others are external (like Azure Data Lake Storage via ABFSS).Choosing…

READ MORE +
Mohammad Gufran Jahangir August 11, 2025 0

Liquid Clustering & Deletion Vectors in Delta Tables – A Complete Guide with Examples

1. Introduction As datasets in Delta Lake grow into terabytes or petabytes, performance and storage optimization become critical.Liquid Clustering and Deletion Vectors are two modern techniques in Databricks that solve:…

READ MORE +
Mohammad Gufran Jahangir August 11, 2025 0

Fixing the “cannot import name ‘DSSKey’ from ‘paramiko'” Error in Databricks When Using pysftp

If you’ve been working with SFTP connections in Databricks using the pysftp library, you might have run into this frustrating error: This typically pops up when your Databricks job or…

READ MORE +
Mohammad Gufran Jahangir August 9, 2025 0

Understanding Roles and Permissions in Databricks: Part-4

Databricks provides a role-based access control (RBAC) model to manage permissions across workspaces, data, and administrative functions. Knowing what each role does is essential for governance, security, and efficient platform…

READ MORE +
Mohammad Gufran Jahangir August 9, 2025 0

Databricks High-Level Architecture: Control Plane vs Data Plane:Part-3

Databricks is designed to separate management from data processing for better security, scalability, and compliance. This is achieved through two main components: the Control Plane and the Data Plane. 1.…

READ MORE +
Mohammad Gufran Jahangir August 9, 2025 0

Understanding Databricks Account, Workspaces, and Governance Setup: Part-2

Databricks offers a multi-workspace architecture that can span across multiple cloud providers like AWS, Azure, and GCP. Managing this setup effectively requires understanding accounts, workspaces, user management, and metastore governance.…

READ MORE +
Mohammad Gufran Jahangir August 9, 2025 0

Understanding the Databricks Lakehouse Architecture – Powered by Generative AI -Part- 1

The Databricks Lakehouse Platform brings together data engineering, data science, machine learning, and analytics into a single unified platform. The diagram above visually breaks down how different components work together…

READ MORE +
Mohammad Gufran Jahangir August 9, 2025 0

Exploring Advanced Data Ingestion & Sharing Features in Databricks

Databricks is constantly evolving, introducing features that make data ingestion, sharing, and collaboration more seamless. While you may be familiar with the basics of loading data into Databricks, there’s a…

READ MORE +
Mohammad Gufran Jahangir August 9, 2025 0

Ingesting Semi-Structured Data: JSON in Databricks

Semi-structured data is everywhere — from web APIs and IoT devices to log files and application events. One of the most common semi-structured formats is JSON (JavaScript Object Notation). It’s…

READ MORE +
Mohammad Gufran Jahangir August 9, 2025 0

Working with the Rescued Data Column in Databricks

In real-world data ingestion scenarios, it’s common to deal with inconsistent or malformed data. When you’re enforcing a schema in Databricks, these mismatches can cause ingestion failures or loss of…

READ MORE +
Mohammad Gufran Jahangir August 9, 2025 0

Data Ingestion from Cloud Storage in Databricks: CTAS, COPY INTO, and Auto Loader Compared

In today’s data-driven world, the ability to ingest data from cloud storage into Databricks efficiently can make or break the success of your data pipelines. Databricks provides multiple ingestion methods,…

READ MORE +
Mohammad Gufran Jahangir August 8, 2025 0

Understanding Executors in Databricks: A Complete Guide with Debugging Tips

Executors in Databricks covering: Understanding Executors in Databricks: A Complete Guide with Debugging Tips When running workloads in Databricks, one of the most important concepts to understand for performance tuning…

READ MORE +
Mohammad Gufran Jahangir August 7, 2025 0

Basics of Writing PowerShell Scripts

🧩 1. What is PowerShell? 🛠️ 2. Basic Structure of a PowerShell Script Save as myscript.ps1 and run it in PowerShell. 🧱 3. Core Building Blocks ✅ Variables ✅ Strings…

READ MORE +
Mohammad Gufran Jahangir August 7, 2025 0

General Build Pipeline (CI) in Azure DevOps

Let’s walk through a general-purpose Azure DevOps build pipeline template that can be applied to most common application types (web apps, APIs, microservices, etc.) using CI/CD best practices, excluding Azure…

READ MORE +
Mohammad Gufran Jahangir August 7, 2025 0

How to Monitor Long running session in Azure Databricks

To monitor and troubleshoot long-running sessions in Azure Databricks, it’s essential to understand the execution hierarchy and how to navigate through the Spark UI. Below is a complete guide with…

READ MORE +
Mohammad Gufran Jahangir August 7, 2025 0

What is Shuffle Read & Shuffle Write?

🔄 What is Shuffle Read & Shuffle Write? ▶️ Shuffle is Spark’s mechanism to redistribute data across partitions, typically during wide transformations like: 🔵 Shuffle Read: 🔴 Shuffle Write: ✅…

READ MORE +
Mohammad Gufran Jahangir August 7, 2025 0

Salting, Repartitioning, and Broadcast joins in Spark Databrick

Here’s a clear and structured explanation of salting, repartitioning, and broadcast joins in Spark — including how they work and when to use them — with simple examples. 🔹 1.…

READ MORE +
Mohammad Gufran Jahangir August 7, 2025 0

What is spark.sql.shuffle.partitions?

🔍 What is spark.sql.shuffle.partitions? spark.sql.shuffle.partitions is a Spark SQL configuration parameter that controls the number of output partitions created during shuffling operations, such as: 🧠 Why is it important? Shuffling…

READ MORE +

Posts pagination

Previous 1 2 3 Next

Recent Posts

  • Practical Learning Guide for AiOps Certified Professional
  • Terraform vs CloudFormation vs Pulumi: which fits which team (the practical, engineer-first guide)
  • MLOps Certified Professional Enterprise Career Guide
  • Terraform State Management: Remote State, Locking, Drift, Recovery (the engineer’s survival guide)
  • Site Reliability Engineering Certified Professional Learning Path
  • Terraform for Beginners: Modules, State, Workspaces, Best Practices (with real examples)
  • DevSecOps Certified Professional Learning Guide for Engineers
  • Reliability patterns that keep systems alive: retries, timeouts, circuit breakers, bulkheads
  • Technical Guide to DevOps Engineering Program Value
  • Performance testing for microservices: k6/JMeter strategy + KPIs (a practical, engineer-friendly guide)
  • Capacity Planning in Cloud: CPU/Memory, QPS, Latency, Scaling (the engineer-friendly playbook)
  • Career Focused Guide for DevOps Certified Professional Aspirants
  • Certified DevOps Manager Guide for DevOps Leadership Roles
  • Alert fatigue fix: actionable alerts, routing, dedup, suppression
  • Complete Guide to Certified DevOps Architect for Senior Engineers
  • Certified DevOps Professional (CDP): Who Should Take It and Why
  • Prometheus + Grafana fundamentals: dashboards that engineers use
  • Certified DevOps Engineer Complete Guide for DevOps Beginners
  • DevOps Certified Professional Training for Scalable Software Delivery
  • Reduce MTTR: Playbooks, Runbooks, Alert Tuning, and Ownership (the engineer’s step-by-step guide)

Recent Comments

  1. https://sites.google.com/view/vavada-online-casino on Error Code: 18456 – Login Failed for User
  2. Foxibet on Job Failures with External Libraries in Databricks: Causes and Solutions
  3. http://Boyarka-inform.com/ on 30 Common Issues Related to Databricks Unity Catalog
  4. Ashwani on How to install AZ Modules in Power shell. How to Connect Azure from PowerShell?
  5. Abhishek singh on How to install AZ Modules in Power shell. How to Connect Azure from PowerShell?

Recent Posts

  • Practical Learning Guide for AiOps Certified Professional
  • Terraform vs CloudFormation vs Pulumi: which fits which team (the practical, engineer-first guide)
  • MLOps Certified Professional Enterprise Career Guide
  • Terraform State Management: Remote State, Locking, Drift, Recovery (the engineer’s survival guide)
  • Site Reliability Engineering Certified Professional Learning Path

Archive List

  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • February 2025
  • January 2025
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • November 2023
  • October 2023

Tags

#AgileProjectManagement #Ansible #Artifact Repository #AWSCertification #AWSTraining #CI_CD #CloudAndDevOps #CloudDevOps #CloudInfrastructure #CloudNative #DevOpsCareer #DevOpsCareers #DevOpsCertification #DevOpsMonitoring #DevOpsSchool #DevOps Tools #DevOpsTraining #DevSecOps #InfrastructureAsCode #ITCertification #ITOperations #JiraTraining #LearnDevOps #MachineLearning #NagiosTraining #Observability #PlatformEngineering #RajeshKumar #SiteReliabilityEngineering #SRE #TechCareerDevelopment #TechCareerGrowth #TechSkills Azure Databricks Azure SQL Database Cloud Databricks DevOps DevOpsSchool Kubernetes Microsoft Fabric MotoShare Snowflake terraform Unity Catalog

Categories

  • AI
  • Amperity
  • Ansible
  • Ataccama
  • AWS
  • Azure
  • Azure Databricks
  • Azure DevOps
  • Azure Fundamental
  • Azure SQL database
  • AZURE SQL SERVER
  • Azure Synapse Analytics
  • bash scripting
  • CloudOps
  • CrowdStrike
  • Data Engineering
  • Databricks
  • DevOps
  • DevSecOps
  • Docker
  • FinOps
  • Git and GitHub
  • Git and GutHub
  • Google BigQuery
  • Kubernetes
  • Linux
  • Microsoft Fabric
  • MLOps
  • MotoShare
  • Oracle GoldenGate
  • php
  • powershell
  • Python
  • SFTP Server
  • Snowflake
  • SRE
  • Teradata
  • terraform
  • Tools & Technologies
  • ubuntu
  • Uncategorized
  • Unit Catalog
  • Vector Databases
  • virtual machine
  • WakilSahab

2026 CLoudOps Now! | Blogging WordPress Theme by Legacy Themes