Distributed Control System Specialist
Job Posting: Distributed Systems Specialist – Synthetic Data Infrastructure
Location: Remote (Open to all time zones)
Type: Full-time, Contract (1 year with potential for extension)
Start Date: ASAP
Work Hours: 40 hours/week, Monday–Friday
Travel: None required
About the Role
We are looking for a Distributed Systems Specialist to join our Synthetic Data Infrastructure team. This role is ideal for a generalist engineer with deep expertise in distributed systems who thrives in a fast-paced, collaborative environment. You will be instrumental in designing, developing, and optimizing large-scale data generation pipelines that support machine learning workflows.
- Key Responsibilities
- Design and optimize data processing pipelines using DAGs/Graphs and distributed workers/jobs.
- Develop scalable solutions to handle large fan-out loads.
- Collaborate with cross-functional teams to identify and prioritize system improvements.
- Work closely with data scientists and engineers to integrate and scale solutions.
- Continuously identify and implement performance optimizations to reduce resource utilization.
- Contribute to scheduling systems and support pipeline submissions.
- Participate in on-call rotations and provide support for production systems.
- Required Qualifications
- 5+ years of experience in software development, with a strong focus on distributed systems.
- Proven experience with large fan-out loads and distributed job execution.
- Proficiency in at least one of the following languages:
- Python (preferred)
- Rust
- C++
- Hack (PHP)
- Experience with cloud infrastructure (AWS or GCP).
- Strong problem-solving skills and ability to work independently.
- Excellent communication and collaboration skills.
- Bachelor’s degree in a technical field.
- Preferred Qualifications
- Experience running compute clusters or building data infrastructure.
- Background in companies operating at large data scales.
- Familiarity with Digital Content Creation (DCC) tools like Blender.
- Experience with machine learning frameworks such as TensorFlow or PyTorch.
What Makes This Role Exciting
This is a unique opportunity to work on cutting-edge infrastructure that powers large-scale compute workloads. You’ll take ownership of critical systems, collaborate with a talented team, and make a tangible impact on performance, scalability, and reliability. If you enjoy solving complex problems and optimizing systems at scale, this role is for you.
- Interview Process
- 2 Rounds Total:
- 1 Technical Coding Interview (45 minutes)
- 1 Behavioral Interview (30 minutes)
Apply Job!
Apply to this Job