- Blog
- Blog
- Homepage
- Homepage
-
PySpark Dataframe: Skipping First Rows and Counting Null Values
This PySpark guide covers skipping rows (beyond header) and counting NULLs for each column of a DataFrame.
-
Master PySpark Functions: Collect_list, Explode, left_anti, Split
The article covers PySpark’s Explode, Collect_list, and Anti_join functions, providing code examples and their respective outputs.
-
How to Use Databricks Time Travel for Delta Lake Recovery
Databricks’ time travel feature allows users to recover earlier versions of a Delta Lake table, enabling corrections of incorrect data while adhering to retention policies.
-
Python Strings: Tricky Programs on Remove, Sort, and Count
The examples showcase Python operations, including string manipulation, list sorting, and character counting, providing practical techniques for beginners.
-
How to Share Workspace in Databricks: Comprehensive Guide
To share a Databricks workspace, manage access controls and permissions for users and groups in various ways.
-
Row Vs. Range SQL Window Functions: Top Differences
ROW and RANGE window functions in SQL operate within a window, but differ in row versus value consideration.
-
External Vs. Managed Tables in Databricks: Top Differences
In Databricks, managed tables store and manages both data and metadata, while external tables store data externally and manage only metadata.
-
Writing Dataframes into Delta Tables in PySpark: 6 Top Benefits
Writing DataFrames to Delta tables offers data persistence, optimized performance, schema enforcement, transactional consistency, and integration with data systems.
-
A Complete Guide to Databricks Utilities (DBUtils)
PySparkDatabricks Utilities (DBUtils) provides functionalities like accessing DBFS files, managing clusters, and widgets.
-
SQL Query: Extracting Employees Sal > Avg Salary
The SQL query, Pandas, and PySpark code extract employees earning more than the average salary.
-
Using PySpark to Compare Employee Salaries with Their Managers
The PySpark code demonstrates two methods to compare employee salaries with their manager’s and retrieve the results.
-
Free E-book: 30 PySpark Interview Questions with Answers
The provided content introduces a free e-book with 30 PySpark interview questions and answers for preparation.