Data Interview Questions

132,688 data interview questions shared by candidates

business sense: There are two types of cars A and B. The number of people in US who use A and B are the same. They drive the same distances each month. Now there are two new technologies, X and Y (of equal cost). If apply X, mpg of A would increase from 50 mpg to 75 mpg; If apply Y, mpg of B would increase from 10 mpg to 11 mpg. The goal is to decrease the dependence on foreign oil, or to decrease the consumption of gasoline. Question: which technology would you apply? Follow up question: after applying the technology of your choice, assume there's money available for research on new technology, which car would you choose to conduct research on?
avatar

Data Scientist

Interviewed at Meta

3.6
Jan 7, 2015

business sense: There are two types of cars A and B. The number of people in US who use A and B are the same. They drive the same distances each month. Now there are two new technologies, X and Y (of equal cost). If apply X, mpg of A would increase from 50 mpg to 75 mpg; If apply Y, mpg of B would increase from 10 mpg to 11 mpg. The goal is to decrease the dependence on foreign oil, or to decrease the consumption of gasoline. Question: which technology would you apply? Follow up question: after applying the technology of your choice, assume there's money available for research on new technology, which car would you choose to conduct research on?

In python code, given a json object with nested objects, write a function that flattens all the objects to a single key value dictionary. Do not use the lib that actually performs this function. { a:{b:c,d:e} } becomes {a_b:c, a_d:e} ( not, a:"b:c,d:e" }
avatar

Data Engineer

Interviewed at Amazon

3.5
Apr 29, 2020

In python code, given a json object with nested objects, write a function that flattens all the objects to a single key value dictionary. Do not use the lib that actually performs this function. { a:{b:c,d:e} } becomes {a_b:c, a_d:e} ( not, a:"b:c,d:e" }

Given three tables, user_dimension (user_id and account_id), account_dimension (account_id and paying_customer), and download_facts (date, user_id, and downloads), find the average number of downloads for free vs paying customers broken out by day.
avatar

Data Analyst

Interviewed at Uber

3.7
Jun 2, 2017

Given three tables, user_dimension (user_id and account_id), account_dimension (account_id and paying_customer), and download_facts (date, user_id, and downloads), find the average number of downloads for free vs paying customers broken out by day.

SQL Select the value of a column based on the max of a different column from each grouping of yet a third column. Column A, Column B, Column C. For each group based on Column A, give value of Column B, where Column C is max for that group.
avatar

Data Engineer

Interviewed at Amazon

3.5
Apr 29, 2020

SQL Select the value of a column based on the max of a different column from each grouping of yet a third column. Column A, Column B, Column C. For each group based on Column A, give value of Column B, where Column C is max for that group.

SQL part1 Question # 1 : you have the following table - User_id , action (post, edit, cancel) , date == > what is the post rate Question # 2 : you have also the following table (active users) - User_id , country , active , date == > What is the average post rate for yesterday for all active users by country
avatar

Data Scientist

Interviewed at Meta

3.6
Aug 7, 2019

SQL part1 Question # 1 : you have the following table - User_id , action (post, edit, cancel) , date == > what is the post rate Question # 2 : you have also the following table (active users) - User_id , country , active , date == > What is the average post rate for yesterday for all active users by country

Viewing 81 - 90 interview questions

Glassdoor has 132,688 interview questions and reports from Data interviews. Prepare for your interview. Get hired. Love your job.