Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.

Unlimited Access

Get Unlimited Contributor Access to the all ExamTopics Exams!
Take advantage of PDF Files for 1000+ Exams along with community discussions and pass IT Certification Exams Easily.

Exam Certified Data Engineer Professional topic 1 question 172 discussion

Actual exam question from Databricks's Certified Data Engineer Professional
Question #: 172
Topic #: 1
[All Certified Data Engineer Professional Questions]

The data engineer is using Spark's MEMORY_ONLY storage level.

Which indicators should the data engineer look for in the Spark UI's Storage tab to signal that a cached table is not performing optimally?

  • A. On Heap Memory Usage is within 75% of Off Heap Memory Usage
  • B. The RDD Block Name includes the “*” annotation signaling a failure to cache
  • C. Size on Disk is > 0
  • D. The number of Cached Partitions > the number of Spark Partitions
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
03355a2
1 week, 3 days ago
Selected Answer: C
It's simple, if MEMORY_ONLY is used, anything spilled to disk would indicate a problem.
upvoted 1 times
03355a2
1 week, 3 days ago
The RDD answer is incorrect for this question due to the fact that while this indicates a failure to cache, it is more specific to identifying individual blocks that failed to cache rather than providing a general signal of a suboptimal performance for the entire cached table.
upvoted 1 times
...
...
hpkr
3 weeks, 4 days ago
Selected Answer: C
C is correct here
upvoted 1 times
...
Freyr
1 month ago
Selected Answer: B
Correct Answer: B Option B, is the most correct and relevant choice for an indicator that a cached table is not performing optimally in a MEMORY_ONLY scenario. If an RDD block includes a "?" annotation, it strongly suggests issues with caching, which would directly impact the performance and expected behavior of MEMORY_ONLY caching. This indication points to a failure to cache the data entirely in memory, which is what MEMORY_ONLY intends to do. Option C, could also be a relevant indicator in general caching scenarios (e.g., MEMORY_AND_DISK), but it contradicts the MEMORY_ONLY setting directly. Therefore, Option B is chosen based on the specific storage level described.
upvoted 1 times
Freyr
3 weeks, 6 days ago
*THE CORRECT ANSWER IS: C* PLEASE IGNORE MY PREVIOUS ANSWER. Long story short, B is correct in the context of non-functional requirement, but the question is based in functional requirement, and sorry for the confusion.
upvoted 1 times
...
...
imatheushenrique
1 month, 1 week ago
B. This annotation says that some partitions of the cached data have been spilled to disk because there wasn't enough memory to keep them.
upvoted 1 times
...
MDWPartners
1 month, 1 week ago
I would say C
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
ex Want to SAVE BIG on Certification Exam Prep?
close
ex Unlock All Exams with ExamTopics Pro 75% Off
  • arrow Choose From 1000+ Exams
  • arrow Access to 10 Exams per Month
  • arrow PDF Format Available
  • arrow Inline Discussions
  • arrow No Captcha/Robot Checks
Limited Time Offer
Ends in