Share this Job

Sr. Site Reliability Engineer (remote)

Date: Feb 18, 2021

Location: remote, MA, US, remote

Company: EBSCO Industries Inc

EBSCO Information Services (EIS) provides a complete and optimized research solution comprised of e-journals, e-books, and research databases — all combined with the most powerful discovery service to support the information needs and maximize the research experience of our end-users. Headquartered in Ipswich, MA, EIS employs more than 3,300 people worldwide. We are the leader in our field due to our cutting-edge technology, forward-thinking philosophy, and top-notch workforce. EIS, a division of EBSCO Industries Inc., based in Birmingham, AL, is ranked in the top 200 of the nation’s largest, privately held corporations according to Forbes magazine. EBSCO is a company that will motivate you, inspire you, and allow you to grow. We are looking for the best. If you are too, we encourage you to explore our unique opportunities.

 

We are looking for experienced Site Reliability and DevOps Engineers to join our Site Reliability Engineering Team.  This person will bring to bear software engineering principals and techniques in solving application and infrastructure operational challenges.  Site Reliability Engineers are always looking to eliminate performance bottlenecks, isolating failures, and automating day to day system operation processes.  Partnering with development teams, this role will establish best practices in operational readiness as well as incident response with the goal of making systems more reliable.

Primary Responsibilities

  • Participate in a rotating on-call schedule to troubleshoot and resolve production issues
  • Monitor and troubleshoot availability and performance issues
  • Automate deployment, alerting, monitoring, management, and incident response
  • Develop and improve operational practices and procedures
  • Partner with development teams by measuring and monitoring service level objectives and indicators
  • Develop software platforms and frameworks for maintenance and review of dashboards, alerts, capacity planning, and operational readiness checklists
  • Be the driving force for correct incident response and blameless postmortems
  • Develop playbooks and tools to streamline processes and shorten problem resolution time
  • Ability to operate in the high-pressure environment and troubleshoot complex issues quickly, while successfully handling multiple priorities
  • Promote a culture of feedback loops, trust, and partnership with our internal community.
  • Drive and champion Site Reliability and DevOps culture

Requirements

  • 5+ years of experience working in operations
  • 5+ years of experiencing with Linux and/or Windows operating systems internals and administration
  • Scripting languages like Python, Bash, PowerShell, Groovy, or JavaScript
  • Experience with testing, automation, continuous integration frameworks and best practices
  • Demonstrated knowledge of software engineering best-practices (e.g., linting, testing)
  • Experience operating/supporting highly available runtime softwre
  • Experience with AWS Services (EC2, VPC, ELB, EFS, CloudFormation, etc.)
  • Experience with monitoring/alerting tools (Opsgenie, AppDynamics, BMC, Nagios)
  • Experience with logging tools (Sumo Logic, CloudWatch, ELK)
  • Systematic problem-solving approach, with effective communication skills and a sense of ownership and drive

Preferred Skills

  • Experience with Systems Operations on AWS
  • Experience with releasing, coordinating, and launching services
  • A sense of urgency, and a strong bias for action
  • Strong communication skills and a natural inclination to collaborate
  • Experience with web-based tools for collaboration and communication
  • Exhibits sound judgement and can make decisions despite ambiguity, identifies root causes, and gets beyond treating symptoms

EBSCO Industries, Inc.is an equal opportunity employer and complies with all applicable federal, state, and local fair employment practices laws.  EBSCO strictly prohibits and does not tolerate discrimination against employees, applicants, or any other covered persons because of race, color, sex (including pregnancy), age, national origin or ancestry, ethnicity, religion, creed, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected class.  This policy applies to all terms and conditions of employment, including, but not limited to, hiring, training, promotion, discipline, compensation, benefits, and termination of employment.

EBSCO complies with the Americans with Disabilities Act (ADA), as amended by the ADA Amendments Act, and all applicable state or local law.

View EEO PDF


Job Segment: Engineer, Linux, Cloud, Testing, Software Engineer, Engineering, Technology