recruiter > tech screen on coderpad with advanced sql and some python data manipulation. the onsite was a 5 round loop: data modeling, pipeline design, python coding, advanced sql, and a googleyness/behavioral chat. the pipeline design round is the absolute filter imo - they throw a massive ques at you, like designing a real-time analytics pipeline for ad-click tracking system, and you have to map out the batch vs streaming architecture, idempotency, and how to handle late-arriving data from scratch. the modeling round was all about star schemas, handling slowly changing dimensions (SCDs), and optimizing tables. it felt very practical; it’s more about how you structure data at a massive scale and handle edge cases. for prep, i focused on kimball methodology and practicing streaming architectures. i did a few mocks on Prepfully with google DEs specifically for the pipeline design round and that helped me lot. Skimmed through DataLemur for ques and practice and yeah ofcourse blind/reddit for recent experiences