What are the responsibilities and job description for the Director of Site Reliability Engineering position at Millennium Advisors, LLC?
Director of Site Reliability Engineering
This is an on-site role in Charlotte, NC.
About Us
Millennium Advisors is a leading technology-driven broker-dealer specializing in providing liquidity in fixed income securities, including corporate and municipal bonds, treasuries, fixed income ETFs and MBS. Millennium Advisors is headquartered in Charlotte, North Carolina, with offices in Chicago, London, and Singapore. We value an office where people like to work and like each other.
Founded in 2009, Millennium is consistently in the top ten dealers by trade count on most major fixed income electronic trading platforms. We have leveraged our superior trading technology and data modeling to create a high-margin, high-productivity business model. Our trading counterparties include institutional investors, asset managers, banks, registered investment advisors, wealth managers, insurance companies and other broker-dealers.
Role
We're looking for a Director of our Site Reliability Engineering practice. In this role, you will be the backbone of our pre-open and release workflows, fostering increased productivity across the code-writing portion of our organization. The mission is to maintain both stability and extensibility of our trading platform, ensure it is 24x7 reliable and incident-ready without adding bureaucracy.
Key Responsibilities
- Release Stewardship: oversee release and testing processes via Source Code Management, Containerization and CI/CD.
- Dev Ops Practice: oversee the Orchestration and Dynamic Container Management platforms.
- Tooling: Ensure developers, Quants and Traders have the right tools to test, taste and release correct code and associated data model changes. (Possibly: Coordinate system meta-data with internal LLMs/RAG AI tooling for end-user discoverability.)
- Facilitate support and regional hand-offs: Ensure support has necessary release notes, monitoring instructions, and can execute rollbacks and escalate via a “run book”.
- Incident Coordination: manage processes for capturing production incidents, determine which require root cause analyses. Leverage metrics and ticketing system to improve prioritization and coordination.
- Test Infrastructure Enhancements: In partnership with quant and dev teams, retrofit regression tests into automated testing. Extend system monitoring to include UAT reliability.
- Advise on platform growth: work with our system and network administrators and senior management to invest in and harden our compute resources.
Typical Day
- Morning: Collect and organize overnight incidents from Singapore and London.
- Daytime: Work with teams to test and validate release candidates. Implement scripts, test data infrastructure and other automation tooling. Provide system engineering advice such as memory usage, I/O bandwidth constraints and logging to empower developers and quants to write supportable and scalable code. Work with our project manager to ensure support and engineering work is accounted for in project planning.
- The end of Charlotte day corresponds to pre-market open for Asia. The day’s code and data changes should be tested and staged. Oversee release and support hand-off, with emphasis on CI automation.
Qualifications
- BS or MS in Computer Science, Math, Engineering or similar discipline.
- 7 years’ experience with supporting delivery of componentized software into a platform with both real-time and batch aspects.
- CI/CD tooling such as git, Jenkins, Jira.
- Dynamic Container Management platforms such as Kubernetes.
- Orchestration platforms such as Airflow.
- Notebook management platforms such as JupyterLab.
- Proficiency with Linux and networking.
- Observability frameworks such as ELK (Elastic, Logstash and Kibana.)
- Working knowledge of a scripting language such as bash, Python, etc…
- Working knowledge of SQL/S3, managing large datasets with some proficiency in tuning queries.
Preferred Skills
- Knowledge of front and middle office trading and risk systems.
- Experience working with quant and other teams deploying into production.
- A technical blog or other relevant publications.
Compensation and Benefits
In return for a long-term commitment, Millennium Advisors offers a highly competitive compensation package that includes base salary commensurate with experience, and incentive compensation, which could include equity in the firm. We have a comprehensive benefits package that includes:
- Daily breakfast (Charlotte) and lunch provided.
- Monthly massages (Charlotte.)
- Paid time off and various leave options.
- Company-sponsored medical and dental plans.
- 401(k) (US) / Pension plan (UK) with company matching contributions.
Culture
At Millennium, you’ll join a team of intelligent, ambitious, and positive professionals in an environment that fosters collaboration and innovation. Our Charlotte office features amenities such as pool and ping pong tables, music, and comfortable spaces for meetings and team activities. We maintain a casual dress code and a strong focus on building meaningful relationships among team members.
We are committed to building an outstanding small company culture, where people like to work and like each other, where they can be challenged intellectually, exercise their creative and entrepreneurial interests, and still have a life outside of work.
If you’re seeking to join a company focused on global expansion and are a forward-thinking individual with a passion for driving impactful innovation, we invite you to join us and shape the future of Millennium Advisors.
How to Apply
Interested candidates should submit their resume and a cover letter via this link: https://millenniumadvisors.recruitee.com/o/director-of-site-reliability-engineering