Upload date
All time
Last hour
Today
This week
This month
This year
Type
All
Video
Channel
Playlist
Movie
Duration
Short (< 4 minutes)
Medium (4-20 minutes)
Long (> 20 minutes)
Sort by
Relevance
Rating
View count
Features
HD
Subtitles/CC
Creative Commons
3D
Live
4K
360°
VR180
HDR
69 results
Azure Databricks Learning: Performance Optimization: Spark/Databricks Interview Question Series - II ...
13,929 views
2 years ago
The SQL tab in the Spark UI provides a lot of information for analysing your spark queries, ranging from the query plan, to all ...
18,166 views
5 years ago
Learn PySpark, an interface for Apache Spark in Python. PySpark is often used for large-scale data processing and machine ...
1,679,693 views
4 years ago
Examples of these cost-based optimizations include choosing the right join type (broadcast-hash-join vs. sort-merge-join), ...
9,533 views
In this video I have talked about salting in spark Directly connect with me on:- https://topmate.io/manish_kumar25 Discord ...
39,404 views
Spark SQL provides a convenient layer of abstraction for users to express their query's intent while letting Spark handle the more ...
6,314 views
In rapidly changing conditions, many companies build ETL pipelines using ad-hoc strategy. Such an approach makes automated ...
6,715 views
Nowadays, Spark is widely adopted in the big enterprise by handling the large volume of data. In PayPal, more and more complex ...
534 views
Learn about RDDs, DataFrames, optimization techniques, and more, with detailed explanations and practical examples tailored to ...
303 views
1 year ago
The Delta Architecture pattern has made the lives of data engineers much simpler, but what about improving query performance ...
8,887 views
You've seen the technical deep dives on Spark's Catalyst query optimizer. You understand how to fix joins, how to find common ...
1,422 views
Over the last year, we have added a series of optimizations in Apache Spark to solve the above problems for Parquet.
1,607 views
In this video tutorial we walk through a time series forecasting example in python using a machine learning model XGBoost to ...
586,995 views
3 years ago
To this end, we'll discuss several catalyst optimizations around implementing a hybrid skew join in Spark (that broadcasts ...
2,407 views
To this end, we'll discuss several catalyst optimizations to automatically rewrite feature injection/reaping queries as a SQL ...
2,818 views
This talk will break down merge in Delta Lake—what is actually happening under the hood—and then explain about how you can ...
16,097 views
These file formats also employ a number of optimization techniques to minimize data exchange, permit predicate pushdown, and ...
8,668 views
Over the last year, we have added a series of optimizations in Apache Spark to eliminate the above limitations so that the new ...
7,095 views
Join us for a four part learning series: Introduction to Data Analysis for Aspiring Data Scientists. This is the fourth of four online ...
20,538 views
Streamed 5 years ago
This talk will introduce TeraCache, a new scalable cache for Spark that avoids both garbage collection (GC) and serialization ...
366 views