Job Posting Banner

Site Reliability Engineering (SRE)

Lincoln, UK ● United Kingdom Req #1641
Friday, May 3, 2024
For more than 30 years, ECI Software Solutions has been providing industry-specific, cloud-based business management software and services to small and medium-sized businesses. With divisions focused on manufacturing, wholesale/retail distribution, building and construction, and field service, ECI's solutions integrate into every aspect of a customers' business to help them level the playing field, run day-to-day operations more efficiently, and free them up to focus on what matters most. It’s how business gets done.
 
Who is ECI?
 
At ECI, our mission is to enable the entrepreneurial spirit of small and medium-sized business owners. But ECI doesn’t simply deliver amazing software solutions; we also have an award-winning company culture.
  • We offer competitive benefits focused on employee well-being, including paid volunteer time off!
  • We have been named by Achievers on its prestigious 50 Most Engaged Companies To Work For list for the last five years.
  • We have received international recognition for our high levels of employee engagement through Certification as a Great Place to Work six years in a row.
  • Our culture of creativity, innovation, and leadership has garnered over a dozen International Business Awards (Stevie®).
Come join a worldwide team with a strong culture of inclusion, professional development, and collaboration.
 
To apply for this position, please attach a detailed resume that demonstrates your qualifications and skill set pertaining to this position. Applications without a resume will not be considered.

We are seeking a talented SRE (Site Reliability Engineer) to join our growing team. The ideal candidate should be at a senior level, but we are open to candidates who are close to this level and have relevant experience. As an SRE, you will be responsible for ensuring that our products are reliable, available, and scalable. You will work closely with our development teams to identify and resolve issues before they impact our customers while advocating for the improvements needed to provide a world class cloud experience for our customers. The SRE will be responsible for building systems and tooling to enable and empower the dev teams to work more efficiently while fortifying a cloud-first mentality. As a member of this team, you’ll get exposure to many skill sets spanning all the major cloud providers and technology stacks.

Please note that this is a remote position for candidates located in the United Kingdom.

Responsibilities

  • Implement, operate, and maintain ECI’s cloud data center sites
  • Evangelize and advocate for operational improvements within ECI’s software stack.
  • Empower developers with Continuous Development/Continuous Improvement pipelines.
  • Engage in and improve the application life-cycle.
  • Write code for cloud infrastructure in reusable, composable blocks.
  • Support services before they go live (e.g., project management, system design consulting).
  • Solve problems to prevent recurrence by automating responses.
  • Measure and improve performance and operational characteristics.
  • Implement and automate disaster recovery practices.
  • Apply configuration management tools to manage infrastructure as code.
  • Perform management and application support of multiple tech stacks such as RDS, Web, Serverless, etc.
  • Engage in service capacity planning and demand forecasting.
  • Refine and implement DevSecOps security practices.
  • Architect systems for HA, Disaster Recovery, and Load Balancing decisions.
  • Write playbooks and inform the incident response practices.
  • Participate in an on-call rotation for 24x7 support.

 

Qualities and Skills Required

  • Bachelor's Degree in Computer Science, Engineering, IS or equivalent demonstrated experience
  • 5+ years of experience in a related roles (SRE, DevOps, Infrastructure, Software Engineer, etc.)
  • 5+ years of experience with programing languages such as Python, Terraform, and PowerShell
  • Strong experience with Windows systems, RDS, and Web applications
  • Foundational understanding of Linux environments
  • Foundational understanding of Active Directory and Group Policy
  • Experience with CI/CD tools and implementing modern practices
  • Experience with monitoring and logging tools (e.g. DataDog, ELK stack, Prometheus, Grafana)
  • Foundational understanding of virtualization concepts, containerization, continuous integration, cloud computing, performance tuning/optimization, and troubleshooting.
  • Experience with cloud-based IAAS (Azure, AWS)
  • Foundational understanding of Software Development Lifecycle
  • Foundational understanding of information systems security principles and methods used to ensure data confidentiality, integrity, and availability
  • Deep understanding of backup and disaster recovery procedures
  • Willingness to participate in on-call rotation and occasional after-hours work
  • Strong experience with automation and configuration management tools (e.g. Ansible, Chef, Puppet, ConnectWise) a plus

In addition to our competitive salary and award winning culture, we offer an excellent benefit package. We even offer our employees a day off to serve their community! Our company core values are our “CODE”: Crave Greatness, Own the Outcome, Deliver Awesome and Embrace Community.

Other details

  • Job Family Development
  • Pay Type Salary
Location on Google Maps
  • Lincoln, UK
  • United Kingdom