What are the responsibilities and job description for the Production Engineer/ Site Reliability Engineer position at Jobs via Dice?
Dice is the leading career destination for tech experts at every stage of their careers. Our client, AVA Consulting, is seeking the following. Apply via Dice today!
AVA Consulting is seeking a Production Engineer/ Site Reliability Engineer
Location: Plano, TX
U.S. Citizens and those authorized to work in the U.S. are encouraged to apply. We are unable to sponsor at this time.
Company Background: Our client, a major employer in the area, is looking for a Production Engineer to be part of its team in its North American operations.
Responsibilities:
Ron Tolson
AVA Consulting
Fax:
Web:
AVA Consulting is seeking a Production Engineer/ Site Reliability Engineer
Location: Plano, TX
U.S. Citizens and those authorized to work in the U.S. are encouraged to apply. We are unable to sponsor at this time.
Company Background: Our client, a major employer in the area, is looking for a Production Engineer to be part of its team in its North American operations.
Responsibilities:
- Own end-to-end operational responsibilities include monitoring, incident response, root cause analysis, capacity planning, and automation to ensure optimal system performance and reliability in production environments.
- Collaborate cross-functionally with development, QA, and infrastructure teams to streamline CI/CD pipelines, automate deployments, and enforce best practices for security, compliance, and disaster recovery.
- Utilize a broad set of tools and technologies to proactively detect, troubleshoot, and resolve production issues, minimizing downtime and improving service-level objectives (SLOs) and service-level agreements (SLAs).
- 3-4 years of experience in production engineering and site reliability engineering (SRE) to design, implement, and maintain highly available, scalable, and resilient systems.
- Develop, maintain, and optimize data pipelines and workflows using Snowflake and Oracle databases to ensure reliable data availability in production.
- Write and optimize advanced SQL and PL/SQL queries and stored procedures for efficient data processing and transformation.
- Automate data ingestion, validation, and monitoring tasks using Python scripting and orchestration tools like Apache Airflow or Prefect.
- Monitor database health, query performance, and resource utilization using Snowflake Resource Monitors, Oracle Enterprise Manager, and cloud monitoring tools.
- Implement security best practices including role-based access control, data masking, and encryption in Snowflake and Oracle environments.
- Collaborate with DevOps teams to integrate database changes into CI/CD pipelines using Git, Jenkins, or Azure DevOps.
- Manage cloud resource costs by tuning Snowflake warehouse sizes and Oracle instance configurations.
- Build, deploy, and maintain cloud-native microservices using Java, Spring Boot, and JavaScript frameworks, ensuring high availability and scalability.
- Design and implement RESTful APIs and event-driven architectures using AWS services such as Lambda, ECS/EKS, SQS, and SNS.
- Develop and maintain CI/CD pipelines with Jenkins, GitLab CI, or AWS CodePipeline for automated testing and deployment.
- Monitor application and infrastructure health using AWS CloudWatch, Prometheus, Grafana, and distributed tracing tools like Jaeger or AWS X-Ray.
- Troubleshoot production issues, perform root cause analysis, and implement fixes to improve system reliability.
- Implement security controls including IAM roles, OAuth2, JWT, and encryption for data in transit and at rest.
- Provide production support and incident management for SAP FI/CO modules, ensuring minimal downtime and business continuity.
- Analyze and troubleshoot system issues related to configuration, custom code (ABAP), and interfaces with external systems.
- Use SAP Solution Manager for incident management, change requests, transport management, and deployments across development, QA, and production landscapes, ensuring smooth coordination and control of SAP system changes.
- Monitor batch jobs, system performance, and error logs using SAP CCMS and ST22 transaction codes.
- Collaborate with functional teams to validate system changes and support end-user issue resolution.
Ron Tolson
AVA Consulting
Fax:
Web: