The Senior Site Reliability Engineer plays a crucial role in enhancing the performance and reliability of applications within the Disney organization. This position involves planning, architecting solutions, implementing automation, and mentoring junior engineers while utilizing cloud-native and open-source tools. The role requires collaboration across teams to develop scalable and efficient systems, focusing on continuous integration and service delivery improvements.
Job Summary:
Our Performance and Reliability teams are leading the improvements, optimization, and availability of applications across the Disney organization and business units, taking a consultative approach to Reliability Engineering by supporting, educating, mentoring, and delivering automation to foster performance and resiliency in best practice.
The Senior Site Reliability Engineer is a key member of our Performance and Reliability embedded teams. We focus on planning, scoping, solution architecting, software design, and implementation based on functional and performance capability requirements. We leverage cloud-native, commercial, and open-source tools and frameworks to solve complex business needs. These solutions touch a wide range of functional areas. This role will collaborate with Software Engineers, Product Owners, and others across teams and business areas to influence solutions and platforms across the organization.
Responsibilities:
• Build solutions for problems of sizable scope and complexity that have been successfully deployed to customers.
• Champions Infrastructure as Code (IaC); provides thought leadership; establishes enterprise level infrastructure patterns.
• Builds and enhances Continuous Integration and Delivery (CI/CD) pipelines.
• Regularly review existing systems, policies, and practices, while identifying solutions that enhance service delivery efficiency, and enhance the current environment.
• Mentors less experienced engineers. Collaborates with product engineering leaders to find innovative solutions for moderately complex problems.
• Writes code that establishes and enhances frameworks, typically for software programs and systems that have little or no precedent.
• Reviews code for the design, testability, and clear usability
• Develops specifications for assigned components, projects or fixes.
• Builds solutions that scale and perform.
• Participates in project proposal, architecture, and design. Contributes to architecture design and implementation of assigned projects and may lead in the effort.
• Oversees technical maintenance. Performs troubleshooting for systems that tend to be large and highly complex.
• Design, development, documentation and/or testing.
• Applies experience to resolve a variety of complex issues.
• Decisions and actions regularly have a moderate influence on the work of team members, other teams or assigned projects.
• Identifies problems and opportunities and recommends the development of solutions.
• Serves as a high-level technical resource and “go-to” person for less experienced software engineers.
Basic Qualifications:
• Bachelor's degree in computer science or related field, or equivalent training or work experience.
• 5+ years within the Reliability Engineering field.
• Well-versed with Reliability Engineering principles, patterns, and best practices.
• Ability to understand the business domain from both a technical and product viewpoint.
• 5+ Years experience working with AWS Cloud Infrastructure and Resources.
• 5+ Years experience in designing and implementing automation tools.
• 5+ Years experience running and monitoring large scale distributed systems.
• Proficient in Python and/or other coding language.
• Well-versed with modern infrastructure services and concepts such as containerization, distributed systems and microservices.
• Experience designing and implementing automation tools.
• Well-versed in Software Engineering principles and patterns.
• Experience working with globally distributed teams.
• Experience as a coach and mentor within a business environment.
• Experience working within an Agile environment.
#DisneyTECHSite Reliability Engineer, SRE, Reliability Engineering, Infrastructure as Code, Cloud Infrastructure, CI/CD, Automation, Distributed Systems, Python, Software Development
...experienced and talented professionals? Do you want to work in data analytics? If so, you are exactly the type of person we are looking for... ...-changing solutions. Utilize real data to support and drive business strategy. On your first day, well expect that youll have:...
EOE Statement We are an equal employment opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status or any other characteristic protected...
...Responsibilities: Social Media: Develop and implement social media strategies to increase... ...workload effectively in a remote work environment Problem-Solving: Capacity... ...Work: Enjoy the flexibility of working from home with a supportive team environment Competitive...
...PURPOSE AND OBJECTIVES/POSITION STATEMENT: The PSS provides peer support services in accordance with the Peer Connect curriculum... ...supervisor. EDUCATION, EXPERIENCE AND CERTIFICATES Peer Support Specialists must be between the ages of 18-26 and have lived experience...
...Medicine, we're saving lives, building careers, and reimagining healthcare. Ready to grow with us? Lowell General Hospital Public Safety is dedicated to Delivering Safety and Service Excellence! Public Safety officers work collaboratively within the department and...