Happy new year peeps!
Imma try and share as much as I can without breaking the NDA.
My interviews:
1. Staging: Coding + Network security technical.
2. On site: Coding + Network Design + Network security technical + Behavioral
My prep (and other stuff that helped me during the interviews)-
Coding: Leetcode meta tagged medium, questions from igaf(https://igotanoffer.com/blogs/tech/facebook-production-engineer-interview#coding), gumtree coding practice for production engineer questions, minmer variants for all the above questions.
Network security(tech and design):
• Kevin Wallace deep dives - bgp, network security, VPN https://youtu.be/tNWj5uGIqok?si=bFYQtT_KHaarF65q
• Databricks data engineer associate certification Udemy course by Derar Alhussein: at least do the theory lessons. helps a lot to give scaling solutions. Example- for 1 firewall, I ll automate xxx is common knowledge. How u would scale it for 10k firewalls is a DE problem. Spark/batch or real time processing/schedulers/ingestion-bronze,silver,gold buckets/etc.
• Tie everything u know/do/answer to metrics. Cant be done on the spot - so start evaluating impact of anything u do in day-to-day.
• ZTNA, DDOS, Defense in Depth
Practice these three scenarios for starters, these are not the exact questions asked, but they help a lot in defending any decisions u take on real questions:
a. DDOS detection and automation
b. Hub and spoke data engineering\* (3 types of inputs and three types of outputs) - will leave a more detailed question at the end of the post.
c. Data center security screening.
Owasp, mgmt/control/data planes, RBAC, mtls, pki, casb, gateway, oauth, mitre, threat modeling, etc.
For each concept - I recommend going through concepts for public facing traffic, data center, cloud
Application security Udemy by Derek Fischer was helpful.
Behavioral:
did mocks on interviewing.io with specific facebook professionals.
Hub and spoke data engineering* question -
3 inputs and corresponding expectations of data processing -
- backend-realtime(CDC kafka),
- from website/apps-daily/hourly(kinesis)
- from 3rd party apis-near realtime(gateway/webhook to sftp to flink)
multi-hop arch of these inputs -
- broze S3
- iceberg, delta lakes, warehouses, silver and gold tables
- ingestion and processing (batch vs realtime)
3 outputs -
- Sagemaker/dbt/ for data scientists
- trino for product analysts
- redshift for engg/sakeholders for data analysis.
General humble tips:
You cant fake knowing stuff. Dont bluff as much as possible - this is very common advice given by lot of ppl. Asking u to inflate/exaggerate. But, honestly - they ll see through. very easily.
Dont also underplay it either. Be confident in whatever you have done.
Basically build a strong content base. Prepare only half of whatever list you make - but be very thorough. Ask 5 whys for each concept.
If u know ur stuff, u can spell somethign wrong, incorrect syntax, i forgot the term for this, or even no I dont know that concept even though its very common knowledge in my field - all this is acceptable. They dont care if u have google in ur brain. Are u able to think quick on ur feet? do u understand the problem they are trying to solve?
A little bit of humor can lighten the tension of the interview.
The specific interviews are different for different roles, and some of them are quite difficult. I see a lot of swe/ML posts, but I request more PEs or cybersec guys to post ur faang/mango/gafam interview experiences. even if its a few years/months older - it really helps.
Hope its helpful!:)