Databricks Related Exams
Databricks-Certified-Associate-Developer-for-Apache-Spark-3.5 Exam
You have:
DataFrame A: 128 GB of transactions
DataFrame B: 1 GB user lookup table
Which strategy is correct for broadcasting?
21 of 55.
What is the behavior of the function date_sub(start, days) if a negative value is passed into the days parameter?
A data engineer has been asked to produce a Parquet table which is overwritten every day with the latest data. The downstream consumer of this Parquet table has a hard requirement that the data in this table is produced with all records sorted by the market_time field.
Which line of Spark code will produce a Parquet table that meets these requirements?