Site Reliability Engineer (SRE) Job at Tensorwave, Inc., Las Vegas, NV

U01KQmpZN1BzOFhoNXV5UTZFNGFvMmxR
  • Tensorwave, Inc.
  • Las Vegas, NV

Job Description

About Tensorwave Inc.

At TensorWave Inc., we're revolutionizing AI computing by offering the most advanced cloud services, highlighted by our deployment of AMD Instinct MI300x GPUs. Our mission is to accelerate AI innovation by removing hardware limitations and ensuring scalable, efficient solutions for AI workloads. To support our rapid growth, we're seeking a Site Reliability Engineer to join our team.

About the role

We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team and play a critical role in ensuring the reliability, performance, and scalability of our corporate infrastructure. As an SRE, you will collaborate with development, operations, and infrastructure teams to design, implement, and maintain systems that are highly available, resilient, and efficient.

What you'll do

  • Incident Response: Lead incident response efforts, perform root cause analysis, and implement solutions to prevent future occurrences.
  • System Reliability: Design, implement, and maintain monitoring and alerting systems to proactively identify and resolve issues.
  • Automation: Develop and automate processes to improve operational efficiency and reduce manual tasks.
  • Capacity Planning: Analyze system performance and capacity, and make recommendations to optimize resource utilization.
  • Security: Collaborate with security teams to identify and mitigate security vulnerabilities.
  • Collaboration: Work closely with development teams to ensure that applications are designed for reliability and performance.
  • On-Call Rotation: Participate in on-call rotations to respond to incidents outside of regular business hours.
Essential Skills and Qualifications

  • Strong understanding of system administration and networking concepts.
  • Proficiency in scripting languages (Python, Bash, etc.) and configuration management tools (Ansible, Puppet, Chef).
  • Experience with cloud platforms (AWS, GCP, Azure).
  • Knowledge of containerization technologies (Docker, Kubernetes).
  • Experience with monitoring and logging tools (Prometheus, Grafana, ELK Stack).
  • Strong problem-solving and troubleshooting skills.
  • Excellent communication and collaboration skills.
Essential Skills and Qualifications

  • Experience with infrastructure as code (Terraform, CloudFormation).
  • Knowledge of CI/CD pipelines and automation tools (Jenkins, GitLab CI/CD).
  • Experience with databases (MySQL, PostgreSQL, MongoDB).
  • Certifications in relevant technologies (AWS, GCP, Kubernetes).
Benefits:

We offer a competitive salary and benefits, including:

  • Stock Options
  • 100% paid Medical, Dental and Vision Benefits for employees
  • Life and Voluntary supplemental life insurance
  • Short-term disability insurance
  • Flexible Spending Account
  • 401(k)
  • Flexible PTO
  • Paid Holidays
  • Parental Leave
  • Mental Health Benefits through Spring Health
Tensorwave, Inc.

Job Tags

Holiday work, Temporary work, Flexible hours,

Similar Jobs

WizeHire, Inc

Real Estate Transaction Coordinator Job at WizeHire, Inc

Our real estate company is seeking a transaction coordinator who can effectively assist our agents with their valued home buyers and sellers to navigate through...  ...Associates degree preferredObtained real estate license or are currently pursuingMotivated to serve people... 

Hazen and Sawyer

Water/Wastewater Assistant Engineer (Entry-Level) Job at Hazen and Sawyer

 ...local office leadership. Role: Hazen and Sawyer is seeking an entry-level engineer for a variety of water-related assignments. The successful...  ...Educational Requirements: B.S. degree in environmental, civil, chemical, or mechanical engineering (MS preferred) We provide a... 

Volkswagen Group Services GmbH

ESPECIALISTA AMS - SAP Job at Volkswagen Group Services GmbH

 ...activamente en auditoras de seguridad y cumplimiento.Mnimo 5 aos de experiencia comprobable en Application Management Services (AMS) y proyectos SAP.Experiencia slida en SAP MM (Materials Management) y SAP FI (Financial Accounting).Capacidad para analizar problemas... 

Vibrus Group

RN Case Manager Job at Vibrus Group

 ...Remote RN Case Manager 13-Week Contract | $42/Hour (Paid Weekly) Location: Remote (Must be licensed in Michigan) Schedule...  ...case workflows, and ensure optimal health outcomesall while working from home. If you have a strong background in case management and a passion... 

LSG Sky Chefs

Sous Chef Job at LSG Sky Chefs

 ...Job Description Job Title: Sous Chef Job Location: Miami-USA-33142 Work Location Type: On-Site About us LSG Sky Chefs is one of the worlds largest airline catering and hospitality providers, known for its outstanding reputation and dynamic approach in...