Interviewed with the Lead Engineer on Spark, although the recruiter said it was on Python. Not a big deal.
Was asked to perform a group by aggregation. I did a groupby and sum, then joined the results. Was then asked if I could do it without a join. I accomplished this with a sum over a partition. They inquired if there was another way. I added a withColumn statement and placed the groupby statement as the second argument. The interviewer mentioned it was starting to look good, although that's not possible in PySpark.
The recruiter later let me know I failed the tech screen.
What is an inner vs. left join?
How would you deduplicate a table?
The following metrics were computed from 3 interview experiences for the Disney Senior Data Engineer role in United States.
Disney's interview process for their Senior Data Engineer roles in the United States is extremely selective, failing the vast majority of engineers.
Candidates reported having very negative feelings for Disney's Senior Data Engineer interview process in United States.