A: "They focused heavily on how I’ve implemented end-to-end data pipelines using Azure Data Factory and Databricks—especially how I handled data ingestion, transformation using PySpark, and loading into a Delta Lake. They wanted to see both my technical depth and how I troubleshoot production issues."
Data Engineer Interview Questions
Data Engineer Interview Questions
Data engineers are IT professionals who are needed in almost every industry. Data engineers monitor data trends to determine best next steps for companies. A critical part of a data engineer job is to process raw data into usable data by creating data pipelines and building data systems.
Top Data Engineer Interview Questions & How To Answer
Question #1: Can you describe in detail your level of expertise with programming languages?
Question #2: Explain data engineering in your own words.
Question #3: Can you describe your experience working with Apache Hadoop and cloud data management environments?
20,193 data engineer interview questions shared by candidates
Why do you use Cloud Functions in GCP?
absolute difference in SQL
Define inheritance and how it is applicable in your java project?
Shared in DescriptionQuestion1) If we have input.csv, we need to find the output. File and desired output are given below. username, mobile user1,999999991:888888882 user3,777777771 user2,777777234:823232351 user5,734452343:943433434:834323434 user1,999999991:9994433777 output user1:3 user2:2 user3:1 Question2) How can we read a csv file into dataframe Question3) Option to modify the encoding while reading a file in Scala Question 4) Optin to modify the timestamp while reading a file Question 5) How to introduce separators like "," while reading a file Question 6) How to infer Schema =============================== Question 7) How have below 2 tables, we need to find out users who visited a bank but didn't make any transactions? -- Visits table: -- +---------+------------+ -- | user_id | visit_date | -- +---------+------------+ -- | 1 | 2020-01-01 | -- | 2 | 2020-01-02 | -- | 12 | 2020-01-01 | -- | 19 | 2020-01-03 | -- | 1 | 2020-01-02 | -- | 2 | 2020-01-03 | -- | 1 | 2020-01-04 | -- | 7 | 2020-01-11 | -- | 9 | 2020-01-25 | -- | 8 | 2020-01-28 | -- +---------+------------+ -- Transactions table: -- +---------+------------------+--------+ -- | user_id | transaction_date | amount | -- +---------+------------------+--------+ -- | 1 | 2020-01-02 | 120 | -- | 2 | 2020-01-03 | 22 | -- | 7 | 2020-01-11 | 232 | -- | 1 | 2020-01-04 | 7 | -- | 9 | 2020-01-25 | 33 | -- | 9 | 2020-01-25 | 66 | -- | 8 | 2020-01-28 | 1 | -- | 9 | 2020-01-25 | 99 | -- +---------+------------------+--------+
Why Deloitte? Why consultancy? Why not other consultancy firms but Deloitte? What kind of projects did you work on before? What do you expect from the job?
Finding Top N within nested Categories
Implement a graph in List Format and perform DFS Traversal
Projects, Sorting Algorithms, database related question( sql and mysql)
Please tell me about your career to date and why are you applying for this position.
Viewing 1211 - 1220 interview questions