ExodusPoint Capital, founded in 2017 by Michael Gelband, began managing investor capital in 2018. The firm employs a global multi-strategy investment approach, seeking to deliver compelling asymmetric returns by combining complementary liquid strategies managed by experienced investment professionals within a robust risk framework. ExodusPoint brings together an accomplished team with hands-on experience running multi-manager businesses to create an institutional investment management firm.

    Job description

    ExodusPoint is seeking a motivated individual to join our global Site Reliability Engineering (SRE) team of six, split between the US and UK. As a Junior SRE, you will collaborate with experienced engineers to support, automate, and optimize our infrastructure stack. This role offers an exciting opportunity to work closely with both our development/business user base and our infrastructure teams—making you a key liaison in ensuring reliability and smooth operations across the organization.

    Responsibilities

    • Infrastructure Liaison & DevOps Support
    • Serve as a bridge between development/business teams and infrastructure teams.
    • Collect and translate requirements from diverse stakeholders into actionable solutions.
    • Assist in designing and building CI/CD pipelines, then hand them over to user teams.
    • Platform & Tooling Management
    • Contribute to the deployment and maintenance of technologies like Kafka, Kubernetes, GitLab, and Airflow.
    • Explore and implement various DevOps tools to streamline and optimize workflows.
    • Collaborate on integrating new and existing systems to ensure smooth interoperability.
    • Monitoring & Observability
    • Support the monitoring infrastructure by setting up, automating, and managing monitoring tools.
    • Onboard development teams onto our observability platform, enabling them to easily track and respond to system metrics and alerts.
    • Collaborate with teams to fine-tune dashboards and alerting rules for effective and proactive monitoring.
    • Automation & Reliability Engineering
    • Help drive reliability and scalability by automating repetitive tasks and integrating self-service capabilities for the user base.
    • Work with senior engineers to implement best practices for container orchestration (Kubernetes) and data streaming (Kafka).
    • Troubleshoot system issues and propose long-term fixes to enhance performance.
    • Collaboration & Continuous Improvement
    • Interact with a wide array of stakeholders, from highly technical engineers to non-technical business users.
    • Participate in team knowledge sharing to increase understanding of SRE best practices.
    • Regularly evaluate existing processes and systems, suggesting innovative improvements

    Qualifications

    • Basic Technical Foundation: Familiarity with Linux environments, containerization concepts (Kubernetes), and DevOps practices.
    • Eager to Learn: Strong desire to acquire new skills in areas like Kafka, CI/CD tools (GitLab or similar), and workflow automation (Airflow).
    • Problem-Solving Mindset: Curiosity and perseverance in tackling technical challenges, paired with the resourcefulness to find solutions.
    • Team Player: Excellent communication and collaboration skills, with a willingness to help—and learn from—others.
    • Adaptability & Ownership: A proactive attitude toward stepping into new tasks and taking responsibility for outcomes.

    Why join us

    • Hands-On Experience: Work on real, production-level infrastructure, gaining invaluable experience in SRE best practices.
    • Mentorship & Growth: Be part of a small, supportive team where you’ll receive guidance and have room to grow your technical expertise.
    • Cutting-Edge Technologies: Get exposed to a wide range of modern tools—from Kubernetes and Kafka to Airflow—essential in today’s tech landscape.
    • Global Collaboration: Engage with colleagues across different time zones and backgrounds, enhancing both technical and soft skills.
    • Impactful Work: Help shape our monitoring, DevOps tooling, and reliability processes, directly influencing organizational success.