Technical Lead / Architect – Distributed Infrastructure Platform

Build the System That Challenges the System.
WEKA is building one of the most advanced distributed data platforms in the world - powering the next generation of AI and accelerated compute environments.
As AI infrastructure scales, traditional systems collapse under complexity. WEKA’s NeuralMesh™ architecture was built differently: adaptive, massively scalable, and engineered to thrive under extreme performance demands.
We are now building a next-generation engineering platform designed to simulate, stress, and intelligently challenge distributed systems at scale — long before production does.
This is not a traditional testing role.
This is a deep infrastructure engineering challenge at the intersection of distributed systems, large-scale architectures, reliability engineering, and AI-driven infrastructure.
The Role
We are looking for a senior hands-on technology leader - someone who can operate as a Tech Lead, Architect, or Team Lead - to lead the technical direction of a small elite engineering team building this platform from the ground up.
You should come from a strong backend/infrastructure/distributed systems background and be passionate about understanding how complex systems behave under scale, concurrency, failures, and unpredictable real-world conditions.
You will architect systems that generate massive workloads, simulate production behaviors, inject failures, analyze system bottlenecks, and continuously evolve alongside the core product.
This role is highly technical and highly influential.
You’ll work closely with core architecture and infrastructure teams and help shape how one of the industry’s most advanced storage systems is validated, hardened, and scaled.
What You’ll Do
- Lead the architecture and development of a distributed infrastructure platform operating at massive scale
- Design systems that simulate real-world production environments, workloads, and failure scenarios
- Build intelligent infrastructure for stress testing, fault injection, concurrency analysis, and large-scale system behavior
- Drive technical leadership for a small, high-impact engineering team
- Stay deeply hands-on in system design and development
- Work closely with core storage, cloud, and infrastructure teams to identify architectural gaps and scalability challenges
- Build frameworks and tooling that help engineers understand, reproduce, and resolve complex distributed system behaviors
- Explore AI-driven approaches for infrastructure analysis, validation, and system optimization
What You Bring
- Strong experience building large-scale distributed systems or infrastructure platforms
- Background in storage systems, cloud infrastructure, networking, databases, runtime systems, or scalable backend architectures
- Deep understanding of concurrency, multithreading, distributed architectures, scalability, and system reliability
- Proven experience leading complex technical initiatives and mentoring engineers
- Strong software engineering skills in Python, Go, C++, Rust, or similar
- Ability to operate as a technical leader while remaining highly hands-on
- Passion for solving complex infrastructure and systems problems
Big Advantages
- Experience with storage systems or high-performance distributed environments
- Background in infrastructure reliability, performance engineering, or production-scale platform engineering
- Experience building internal engineering platforms or developer infrastructure
- Familiarity with simulation systems, fault injection, or observability tooling
- Interest in applying AI technologies to large-scale infrastructure systems
Why This Role Is Different
Most engineering roles build products.
This role builds the system that pushes the product to its limits.
You’ll solve some of the hardest infrastructure challenges in the company - working on deep technical problems involving scale, distributed behavior, performance, resilience, and intelligent system analysis.
If you love understanding how systems truly behave under pressure - this role was built for you.
You'll be redirected to
the company's application page