Skip to content
  • About Srini
  • Jobs
  • Amazon
  • Udemy
  • Contact us

Join 1,898 other subscribers

Srinimf

Srinimf

Blog

https://srinimf.com/

Blog

https://srinimf.com/

Profile

https://srinimf.com/author/moonrobot/

Profile

https://srinimf.com/author/moonrobot/

Homepage

https://srinimf.com

Homepage

https://srinimf.com

Posts & Replies Posts
  • Parquet vs ORC vs Avro: Top Differences Explained

    This content compares the performance and features of three data formats: Parquet, ORC, and AVRO. Parquet and ORC are columnar formats optimizing storage and query performance, while AVRO is row-oriented, supporting schema evolution for varied workloads. Each format is suited for specific big data applications, emphasizing efficiency and compatibility.

  • AWS Step Functions and AWS Glue Job Workflow Configuration

    Here’s how you can set up an architecture. An Amazon S3 file upload triggers an AWS Lambda function via Amazon EventBridge (formerly known as CloudWatch Events). This function then starts an AWS Step Function workflow. This workflow triggers an AWS Glue job. Step-by-Step Overview Step 1: Configure S3 Bucket to Emit Events Step 2: Configure…

  • AWS: 3 Easy to Write Lambda Functions

    Here are three examples of AWS Lambda functions for different use cases. These include the hello world function, image resizing, and fetching data from DynamoDB. 1. Basic Hello World Function This is a simple AWS Lambda function that returns a “Hello, World!” message. It’s often used in the AWS Lambda to understand the basics. def lambda_handler(event, context): return { ‘statusCode’:…

  • How to Delete Source Object After Glue Job Run Complete

    Deleting S3 objects post-Glue job streamlines data management, frees up space, and maintains a clean dataset for analysis.

  • CSV Column Validation Using PySpark: Step-by-Step Guide

    The Python code demonstrates CSV file validation using PySpark. Validation rules are applied to columns, and the resulting dataframes are written to S3 and PgSQL.

  • 20 Python Pandas Interview Questions and Answers

    Pandas is a data manipulation library for Python, offering Series, DataFrame, CSV, merging, grouping, and visualization capabilities.

  • Group By Vs Partition By: Here’s the Right Answer

    SQL uses GROUP BY to aggregate data into summary rows, while PARTITION BY aids window functions in dividing result sets.

  • How to Create FastAPI in VisualStudioCode

    This content explains creating a FastAPI application with a binary number divisibility check endpoint and its interaction using Swagger.

  • Effective Strategies for Databricks Cluster and Job Optimization

    Optimizing performance in Databricks involves best practices for Spark, cluster config, data management, and code optimization.

  • How to Read Secret Manager Data in AWS Glue

    You can read a secret from AWS Secrets Manager in AWS Glue using boto3 library for Python. Ensure IAM permissions.

  • PySpark Quiz: Crack Your Interview Effortlessly

    PySpark quiz covers main features, distributed computing, DataFrame creation, SparkSession, data manipulation, lazy evaluation, missing values, and data I/O.

  • AWS Logging Best Practices for Effective Monitoring

    AWS provides CloudWatch and AWS CloudTrail for log monitoring, troubleshooting, and auditing your cloud environment.

Previous Page
1 … 14 15 16 17 18 … 177
Next Page

About Srinimf

We share solutions for software developers and interview questions.

2,760,380 hits

Subscribe for DAILY TIPS

Join our mailing list to stay notified about new blog posts. No spam, we guarantee.

  • Tumblr
  • Facebook
  • Instagram
  • WordPress
  • X

Srinimf

Designed with WordPress

Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.
To find out more, including how to control cookies, see here: Cookie Policy
  • Subscribe Subscribed
    • Srinimf
    • Join 266 other subscribers
    • Already have a WordPress.com account? Log in now.
    • Srinimf
    • Subscribe Subscribed
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar