Site Reliability Engineer Job at Altimetrik, Austin, TX

U01kQWhvVEp0Y0RrNU95YzdVZ2RwbWxVZHc9PQ==
  • Altimetrik
  • Austin, TX

Job Description

Location: Austin, Tx or Fort Mill, SC

Key Responsibilities

  • Design, develop, and maintain ELK Stack solutions to ensure efficient log management, monitoring, and search capabilities.
  • Implement, optimize, and troubleshoot data pipelines for telemetry, analytics, and observability using Logstash, Beats, Kafka, or other ETL tools.
  • Customize Elasticsearch indexing, queries, and storage solutions to enhance system performance and scalability.
  • Develop dashboards, visualizations, and alerting mechanisms in Kibana and other monitoring tools to improve system observability.
  • Integrate ELK solutions with cloud environments (AWS, Azure, or GCP) and implement security best practices for data storage and access.
  • Monitor and optimize system performance, resource utilization, and search efficiency to maintain high availability and reliability.
  • Collaborate with DevOps, Security, and Software Engineering teams to enhance log processing, alerting, and data enrichment strategies.
  • Automate deployments and configurations using tools like Ansible, Terraform, Kubernetes, and CI/CD pipelines.
  • Stay updated with the latest ELK Stack developments and industry trends to implement best practices and new features.

Required Skills and Qualifications

  • 6-10 years of hands-on experience with Elasticsearch, Logstash, Kibana (ELK Stack) in enterprise environments.
  • Strong knowledge of log aggregation, indexing, and data parsing techniques.
  • Proficiency in scripting and automation using Python, Bash, or Groovy .
  • Experience with observability platforms and telemetry solutions, including Dynatrace .
  • Knowledge of distributed systems, clustering, and high-availability architectures .
  • Experience in tuning and scaling Elasticsearch clusters for performance optimization.
  • Hands-on experience with Kafka, Fluentd, Prometheus, Grafana, OpenSearch (preferred).
  • Strong background in Linux systems, networking, and security .
  • Familiarity with CI/CD pipelines, Git, Kubernetes, and containerization (Docker) .
  • Experience with cloud services like AWS OpenSearch, Azure Elastic Stack, or Google Cloud Logging .
  • Strong problem-solving skills and the ability to thrive in fast-paced environments .

Job Tags

Similar Jobs

Dane Street

Diagnostic Radiology- Medical Record Reviewer Job at Dane Street

 ...Job Title: Physician Reviewer/Advisor For Independent Medical Exams (IME) As Physician Reviewer/Advisor for Independent Medical Exams (IME), you will utilize clinical expertise and reviews insurance appeals, and prospective and retrospective claims. The Physician Reviewer... 

Memphis Staffing

Warehouse Worker (Part Time Flex, 2nd Shift) Job at Memphis Staffing

 ...Anticipated hourly range: $20.08 per hour - $21.40 per hour based on experience (includes shift differential). Benefits: Paid time off in...  ...in the facility includes order picker (cherry picker), forklift, reach truck, turret truck, pallet jack, and walkie rider.... 

Providence Health and Services

Telemetry Technician Job at Providence Health and Services

 ...Telemetry Technician monitors and reports ECG cardiac rhythms under nurse supervision, serving as a critical communication link in patient care. The role requires medical terminology knowledge, ECG training, and BLS certification. This position is situated within a respected... 

PwC

Customs & International Trade Tax - Senior Associate Job at PwC

 ...& International Trade Industry/Sector: Not Applicable Time Type: Full time Travel Requirements: Up to 20% At PwC, our people in tax services focus on providing advice and guidance to clients on tax planning, compliance, and strategy. These individuals help... 

ABM Industries

Valet Parking Attendant Job at ABM Industries

 ...basic customer and team member inquiries. Monitor illegal parking, and immediately store vehicles. Interact with guests in a...  ...guest keys Predict and communicate traffic flow peaks and lot issues Identify and arrange for extra support for extended...