Site Reliability Engineer, Business ServicesThe Business Services Site Reliability Engineer will assist in maintaining the operational aspects of Limelight Networks' platforms and act as an escalation point for troubleshooting of systems and business services issues.
Working with development and operations teams, this resource will use their experience in systems and related software to ensure the reliability of platforms under the Business Services umbrella.
This will include developing procedures for deployment and day to day maintenance of these platforms and ensuring customer satisfaction and the ability of operations to support the products.
In this role, the Business Services Site Reliability Engineer will be responsible for developing and reporting on KPIs that prove the reliability of the environment.
They will need to identify opportunities for improvement in the products and track progress toward those improvements. The Business Services SRE is the operational owner of the products they are responsible for.
Develop and enforce procedures for the maintenance and changes to products.
Develop and report on KPIs that are relevant to the success of products.
Coordinate with development and operations teams to ensure the reliability of products.
Identify gaps in the operation of products and services, and drive enhancements.
Continually evaluate release processes and tools to find areas for improvement.
Contribute to the release and change management process by collaborating with the developers and other operations groups.
Actively participate in development meetings and implement required changes to the operational architecture, standards, processes, or procedures and ensure they are in place prior to release (e.
g. monitoring, documentation, metrics, etc.).
Work in a fast-paced, collaborative environment while providing exceptional visibility to management and end-to-end ownership of incidents, projects, and tasks.
Maintain a positive demeanor and a high level of professionalism at all times.
Maintain prescribed levels of security and enforce security policies within the products.
Act as a point of escalation and participate in resolution of issues with the products.
Other duties as assigned by management.
Bachelor’s degree in Computer Science or Information Systems or equivalent experience typically obtained by five or more years related work experience.
3 years of experience with Linux systems as a systems engineer.
Familiarity with configuration management and release engineering processes and methodologies.
Experience with Version Control, Shell scripting and one or more scripting languages including Python, Perl, Ruby, PHP.
Experience supporting internally developed as well as third-party customer facing products.
Proven self-starter with the ability to document technical data and track incidents, projects, and tasks on a daily basis and in a clear and concise manner.
Experience mentoring and supporting other team members.
Excellent coordination, planning, and written and verbal communication skills in the English language.
Experience with use of Hadoop including workflow management systems and monitoring job execution.
Experience supporting open-source messaging solutions such as RabbitMQ or ActiveMQ.
Experience with Zabbix, ELK, Grafana and OpenTSDB.
Experience with Java applications, including webapps in Tomcat.
Experience with SALT Stack for configuration management.
Physical Demands (Physical, Mental Demands or Exposures)
Typical office environment.
Ability to be available for escalations 24x7.
Travel to the US may be required.