#вакансія
Але я навіть хз, тут таких напевно немає.
Walmart, Senior Principal SRE
PLEASE NOTE YOU DO NOT NEED TO HAVE EXPERIENCE IN ALL THE BELOW (BUT THE MORE THE BETTER)
Qualifications:
• Bachelor's Degree or Master’s Degree with 15+ years of experience in technology with at least one implementation experience Cloud based Platform engineering, DevOps/SRE Platform services from Development to Production at scale.
• 5+ years of experience in Messaging and Event Driven Architecture based on Kafka, Cassandra, Mysql – Data Platform
• 5+ years of Experience In multi-cloud (Azure, Google is preferred)
• Experience in End to End Incident Management framework running in prod with integration of Monitoring System, AIOps, Intelligent alerting, Auto remediation and Run book automation
• Experience in shared SRE/DevOps platform Design, Architecture, Implementation and Operations at Scale
• Experience in fully SRE Managed Platform Services Operability, Error Budget, SLO, SLA, SLI and Golden Signals
• Experience in driving cross functional Incident RCAs, Post-mortem, across full stack.
• Experience in Defining Service operations reporting, DORA Metrics
• Experience in Cloud Capacity Planning, Performance at Scale, Reliability modeling - Infra
• Experience in Resiliency/Chaos Engineering practices and tools - Infra
• Proficient in Java, Golang, Node, Scala, ansible, shell, java script and other scripting languages – developer platform – at least 2 languages and 2 scripts
• Familiarity with promQL, splunkQL and graph databases like Neo4j will be a bonus
• Experience in designing, coding, investigating, analysing, and troubleshooting large-scale enterprise systems.
• Methodical and systematic problem-solving approach, combined with a solid awareness of ownership, initiative, and drive.
• Fluency with running services at scale; In depth understanding of Unix systems internals and networking.
• Networking knowledge and in depth understanding of network concepts, such as different protocols (TCP/IP, UDP, ICMP, etc.), MAC addresses, IP packets, DNS, OSI layers, and load balancing).
• Understanding of Unix/Linux systems from kernel to shell and beyond, taking in system libraries, file systems, and client-server protocols along the way. Experience administering Linux systems in a production environment.
• Experience with distributed version control like Git or similar
• Experience with IaaS and PaaS providers such as AWS, AZURE OpenStack, GCP ( at least one cloud provider)
• Experience with containerisation and container platforms. (e.g., Docker, Kubernetes, Docker EE, OpenShift, Mesosphere). – Must for all
• Experience with enterprise monitoring solutions like AppDynamics, New Relic, Prometheus, Graphite, Grafana, Nagios, Sensu and Splunk, Yagger at scale ( Metrics: 200M metrics/day, Logs: 100 TB logs per day, Traces 200 TB traces per day)
• Familiarity with continuous integration/deployment processes and tools such as Jenkins, Maven, Nexus, etc.,
• Must have cloud solution architecture certifications – for X7 and X8
• Must have Excellent verbal, written communication skills