Staff Scala Engineer I, Distributed Systems
- Türkiye
- Kontrat
- Tam zamanlı
- Be a part of a team that will build and maintain foundational ML infrastructure tools for generating training datasets for model training with point-in-time accuracy that will support various high visibility Machine Learning teams at Etsy including Search Ranking, Personalized Recommendations, etc.
- Build high scale Spark & Dataflow jobs that interact with feature systems and model training. This includes designing and architecting large-scale distributed systems and ML systems, and building distributed services with technologies such as Docker, Kubernetes, gRPC, Spark, etc.
- Develop and maintain distributed infrastructure with a deep understanding of batch distributed computation engines and columnar file storage optimization techniques to speed up feature retrieval and training.
- Coach and train junior team members.
- Actively work with leadership and product teams on the roadmap and new features.
- Lead and coordinate cross-team projects across the organization and actively promote best practices and industry standards to maintain the high quality of the product and services.
- Of course, this is just a sample of the kinds of work this role will require! You should assume that your role will encompass other tasks, too, and that your job duties and responsibilities may change from time to time at Etsy’s discretion, or otherwise applicable with local law.
- Infrastructure background with 10+ years of industry experience (with at least 1-2 years of experience as a Tech lead) working on distributed systems, ML infrastructure and other high scale applications.
- You have a background in Computer Science and related fields, equivalent to a B.S. degree or above in Computer Science, Electrical Engineering, Information Technology, or related engineering fields.
- You understand that a core component of the Staff Engineering role is to be a force multiplier, leveling up the contributions and thinking of the engineers around you.
- You have practical experience in designing and architecting large-scale distributed systems and machine learning infrastructure.
- You demonstrate strong engineering and coding skills, with the ability to write high-performance production code. Proficiency in Scala is required and experience with highly concurrent frameworks (e.g. Cats effects) is required.
- You possess strong proficiency in creating and optimizing large-scale Spark & Dataflow pipelines, as well as general data infrastructure technologies. This includes the ability to optimize storage formats and reduce job runtime & costs.
- You have experience in building distributed services with technologies such as Docker, Kubernetes, gRPC, Spark, and are familiar with concurrency frameworks and performance tuning that build low latency applications.
- Experience with streaming data and its application to ML solutions is a plus.
Vacancies in Turkey