What are the responsibilities and job description for the Production Support Engineer/SRE position at CTC USA, LLC?
Responsibilities:
- 3-4 years of experience in production engineering and site reliability engineering (SRE) to design, implement, and maintain highly available, scalable, and resilient systems.
- Own end-to-end operational responsibilities include monitoring, incident response, root cause analysis, capacity planning, and automation to ensure optimal system performance and reliability in production environments.
- Collaborate cross-functionally with development, QA, and infrastructure teams to streamline CI/CD pipelines, automate deployments, and enforce best practices for security, compliance, and disaster recovery.
- Utilize a broad set of tools and technologies to proactively detect, troubleshoot, and resolve production issues, minimizing downtime and improving service-level objectives (SLOs) and service-level agreements (SLAs).
Requirements:
- Snowflake Developer, Oracle, Python Develop, maintain, and optimize data pipelines and workflows using Snowflake and Oracle databases to ensure reliable data availability in production.
- Write and optimize advanced SQL and PL/SQL queries and stored procedures for efficient data processing and transformation.
- Automate data ingestion, validation, and monitoring tasks using Python scripting and orchestration tools like Apache Airflow or Prefect.
- Monitor database health, query performance, and resource utilization using Snowflake Resource Monitors, Oracle Enterprise Manager, and cloud monitoring tools.
- Troubleshoot and resolve production incidents related to data inconsistencies, pipeline failures, or performance degradation.
- Implement security best practices including role-based access control, data masking, and encryption in Snowflake and Oracle environments.
- Collaborate with DevOps teams to integrate database changes into CI/CD pipelines using Git, Jenkins, or Azure DevOps.
- Perform root cause analysis for recurring issues and implement automation to reduce manual intervention.
- Manage cloud resource costs by tuning Snowflake warehouse sizes and Oracle instance configurations.
- Document operational procedures, runbooks, and system architecture for knowledge sharing and compliance.
- Java, JavaScript, Cloud-based Microservices, Spring Boot, AWS Build, deploy, and maintain cloud-native microservices using Java, Spring Boot, and JavaScript frameworks, ensuring high availability and scalability.
- Design and implement RESTful APIs and event-driven architectures using AWS services such as Lambda, ECS/EKS, SQS, and SNS. Develop and maintain CI/CD pipelines with Jenkins, GitLab CI, or AWS Code Pipeline for automated testing and deployment.
- Monitor application and infrastructure health using AWS CloudWatch, Prometheus, Grafana, and distributed tracing tools like Jaeger or AWS X-Ray. Troubleshoot production issues, perform root cause analysis, and implement fixes to improve system reliability.
- Implement security controls including IAM roles, OAuth2, JWT, and encryption for data in transit and at rest.
- Collaborate with cross-functional teams to design fault-tolerant, resilient systems with automated failover and recovery.
- Optimize cloud resource usage and cost through rightsizing and autoscaling configurations.
- Automate operational tasks and incident response using scripting and infrastructure as code (Terraform, CloudFormation).
- Maintain detailed documentation of system architecture, deployment processes, and operational runbooks.
- SAP Finance and Accounting Techno-Functional Provide production support and incident management for SAP FI/CO modules, ensuring minimal downtime and business continuity.
- Analyze and troubleshoot system issues related to configuration, custom code (ABAP), and interfaces with external systems.
- Use SAP Solution Manager for incident management, change requests, transport management, and deployments across development, QA, and production landscapes, ensuring smooth coordination and control of SAP system changes.
- Monitor batch jobs, system performance, and error logs using SAP CCMS and ST22 transaction codes.
- Automate routine operational tasks and workflows using SAP Business Workflow and background job scheduling.
- Collaborate with functional teams to validate system changes and support end-user issue resolution.
- Participate in SAP upgrades, patches, and integration projects, ensuring smooth transitions and minimal impact.
- Implement and maintain security roles, authorizations, and compliance controls within SAP.
- Document support procedures, configuration changes, and troubleshooting guides. Use monitoring and alerting tools to proactively detect and resolve production issues.
Salary : $55 - $60