Skip to content
  • About Srini
  • Jobs
  • Amazon
  • Udemy
  • Contact us

Join 1,898 other subscribers

Srinimf

Srinimf

Blog

https://srinimf.com/

Blog

https://srinimf.com/

Profile

https://srinimf.com/author/moonrobot/

Profile

https://srinimf.com/author/moonrobot/

Homepage

https://srinimf.com

Homepage

https://srinimf.com

Posts & Replies Posts
  • 5 Must-know AWS Glue Interview Questions Beforehand

    Here are the top AWS Glue interview questions on jobs and monitoring. These can be expected in any interview and are helpful to review beforehand. AWS Glue Interview Questions 01. What is an AWS Glue job? An AWS Glue job is a service from Amazon Web Services that helps you create and run scripts to…

  • Infogain: 5 Tricky Data Engineer Interview Questions

    The Infogain interview Q&A covers PySpark aspects like partitioning, bucketing, reading petabyte-size files, Delta Lake, & schema-less files.

  • PySpark Dataframe: Skipping First Rows and Counting Null Values

    This PySpark guide covers skipping rows (beyond header) and counting NULLs for each column of a DataFrame.

  • Master PySpark Functions: Collect_list, Explode, left_anti, Split

    The article covers PySpark’s Explode, Collect_list, and Anti_join functions, providing code examples and their respective outputs.

  • How to Use Databricks Time Travel for Delta Lake Recovery

    Databricks’ time travel feature allows users to recover earlier versions of a Delta Lake table, enabling corrections of incorrect data while adhering to retention policies.

  • Python Strings: Tricky Programs on Remove, Sort, and Count

    The examples showcase Python operations, including string manipulation, list sorting, and character counting, providing practical techniques for beginners.

  • How to Share Workspace in Databricks: Comprehensive Guide

    To share a Databricks workspace, manage access controls and permissions for users and groups in various ways.

  • Row Vs. Range SQL Window Functions: Top Differences

    ROW and RANGE window functions in SQL operate within a window, but differ in row versus value consideration.

  • External Vs. Managed Tables in Databricks: Top Differences

    In Databricks, managed tables store and manages both data and metadata, while external tables store data externally and manage only metadata.

  • Writing Dataframes into Delta Tables in PySpark: 6 Top Benefits

    Writing DataFrames to Delta tables offers data persistence, optimized performance, schema enforcement, transactional consistency, and integration with data systems.

  • A Complete Guide to Databricks Utilities (DBUtils)

    PySparkDatabricks Utilities (DBUtils) provides functionalities like accessing DBFS files, managing clusters, and widgets.

  • SQL Query: Extracting Employees Sal > Avg Salary

    The SQL query, Pandas, and PySpark code extract employees earning more than the average salary.

Previous Page
1 … 16 17 18 19 20 … 177
Next Page

About Srinimf

We share solutions for software developers and interview questions.

2,760,398 hits

Subscribe for DAILY TIPS

Join our mailing list to stay notified about new blog posts. No spam, we guarantee.

  • Tumblr
  • Facebook
  • Instagram
  • WordPress
  • X

Srinimf

Designed with WordPress

Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.
To find out more, including how to control cookies, see here: Cookie Policy
  • Subscribe Subscribed
    • Srinimf
    • Join 266 other subscribers
    • Already have a WordPress.com account? Log in now.
    • Srinimf
    • Subscribe Subscribed
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar