ExodusPoint Capital, founded in 2017 by Michael Gelband, began managing investor capital in 2018. The firm employs a global multi-strategy investment approach, seeking to deliver compelling asymmetric returns by combining complementary liquid strategies managed by experienced investment professionals within a robust risk framework. ExodusPoint brings together an accomplished team with hands-on experience running multi-manager businesses to create an institutional investment management firm.
Job description
ExodusPoint is seeking a motivated individual to join our global Site Reliability Engineering (SRE) team of six, split between the US and UK. As a Junior SRE, you will collaborate with experienced engineers to support, automate, and optimize our infrastructure stack. This role offers an exciting opportunity to work closely with both our development/business user base and our infrastructure teams—making you a key liaison in ensuring reliability and smooth operations across the organization.
Responsibilities
- Infrastructure Liaison & DevOps Support
- Serve as a bridge between development/business teams and infrastructure teams.
- Collect and translate requirements from diverse stakeholders into actionable solutions.
- Assist in designing and building CI/CD pipelines, then hand them over to user teams.
- Platform & Tooling Management
- Contribute to the deployment and maintenance of technologies like Kafka, Kubernetes, GitLab, and Airflow.
- Explore and implement various DevOps tools to streamline and optimize workflows.
- Collaborate on integrating new and existing systems to ensure smooth interoperability.
- Monitoring & Observability
- Support the monitoring infrastructure by setting up, automating, and managing monitoring tools.
- Onboard development teams onto our observability platform, enabling them to easily track and respond to system metrics and alerts.
- Collaborate with teams to fine-tune dashboards and alerting rules for effective and proactive monitoring.
- Automation & Reliability Engineering
- Help drive reliability and scalability by automating repetitive tasks and integrating self-service capabilities for the user base.
- Work with senior engineers to implement best practices for container orchestration (Kubernetes) and data streaming (Kafka).
- Troubleshoot system issues and propose long-term fixes to enhance performance.
- Collaboration & Continuous Improvement
- Interact with a wide array of stakeholders, from highly technical engineers to non-technical business users.
- Participate in team knowledge sharing to increase understanding of SRE best practices.
- Regularly evaluate existing processes and systems, suggesting innovative improvements
Qualifications
- Basic Technical Foundation: Familiarity with Linux environments, containerization concepts (Kubernetes), and DevOps practices.
- Eager to Learn: Strong desire to acquire new skills in areas like Kafka, CI/CD tools (GitLab or similar), and workflow automation (Airflow).
- Problem-Solving Mindset: Curiosity and perseverance in tackling technical challenges, paired with the resourcefulness to find solutions.
- Team Player: Excellent communication and collaboration skills, with a willingness to help—and learn from—others.
- Adaptability & Ownership: A proactive attitude toward stepping into new tasks and taking responsibility for outcomes.
Why join us
- Hands-On Experience: Work on real, production-level infrastructure, gaining invaluable experience in SRE best practices.
- Mentorship & Growth: Be part of a small, supportive team where you’ll receive guidance and have room to grow your technical expertise.
- Cutting-Edge Technologies: Get exposed to a wide range of modern tools—from Kubernetes and Kafka to Airflow—essential in today’s tech landscape.
- Global Collaboration: Engage with colleagues across different time zones and backgrounds, enhancing both technical and soft skills.
- Impactful Work: Help shape our monitoring, DevOps tooling, and reliability processes, directly influencing organizational success.