Hybrid Site Reliability Engineer & Go Developer
If the "Apply Now" button above does not present an email form, it could be related to your browser settings (try this). Or, send us an email at hr@netgate.com. Thank you!
Department: IT Type: Full Time Location: Austin, TX or Remote (US Only)
Job Overview
We are looking for a proactive and experienced Hybrid Site Reliability Engineer & Go Developer to join our team. This role is a unique blend of traditional SRE responsibilities with a strong emphasis on Go language development. You will ensure the performance, availability, and reliability of our websites and services while also building high-performance software solutions. Working closely with development and IT teams, you will help maintain a scalable, robust infrastructure that aligns with the company's business goals. Your responsibilities will include monitoring, troubleshooting, coding, and automating processes to optimize system performance.
Don't meet every single requirement? We encourage you to apply anyway! If you bring relevant skills from a related field or are excited to learn and grow, we want to hear from you. Your unique experience may be a great fit for this role.
Key Responsibilities
- Design and implement highly available and scalable systems, focusing on reliability and performance.
- Develop features and applications using Go to improve system reliability and performance.
- Collaborate with cross-functional teams to define and implement service level objectives (SLOs) and service level agreements (SLAs) for critical systems.
- Monitor systems and applications, identifying and resolving performance bottlenecks and availability issues.
- Create, maintain, and optimize Go-based applications and services to improve infrastructure performance.
- Build and maintain monitoring tools, alerts, and dashboards for system visibility.
- Perform root cause analysis post-incident and implement preventive measures.
- Automate repetitive tasks and processes through Go-based tooling and scripting.
- Create and maintain documentation for infrastructure, application, and system architecture, configuration, troubleshooting procedures, runbooks for common tasks and maintenance, after-action reports, and any other relevant operational guides to ensure smooth and efficient processes.
- Collaborate with development teams to implement and deploy new features and enhancements, ensuring they meet reliability standards.
- Stay current with industry best practices, new technologies, and emerging trends in SRE and Go development.
- Participate in an on-call rotation to provide support across all systems and services, responding to critical issues as they arise and ensuring minimal downtime during off-hours.
Requirements
- 3-5 years of experience in a technical role such as SRE, DevOps, or Go development, supporting live services in production.
- Proficiency in Go language development is essential, with familiarity in other languages like Java, Python, or Ruby.
- Strong analytical and problem-solving skills with a focus on automation and continuous improvement.
- Experience troubleshooting complex technical issues in distributed systems.
- Ability to work independently, manage multiple projects, and collaborate across teams.
- Proactive and enthusiastic approach, fostering a collaborative and learning-based environment.
Preferred Skills and Qualifications
- Solid understanding of SRE principles and software development methodologies.
- Experience with agile and iterative development processes.
- Familiarity with CI/CD pipelines (e.g., Jenkins, GitLab).
- Experience with version control systems such as Git, SVN, or Perforce.
- Strong understanding of Linux/Unix systems, command-line tools, and scripting languages (e.g., Bash, Python).
- Experience with configuration management tools (e.g., Salt, Ansible, Puppet, Chef).
- Familiarity with cloud platforms (AWS, Azure, or Google Cloud).
- Strong understanding of networking principles (TCP/IP, HTTP, DNS).
- Expertise with monitoring and logging tools (Prometheus, Grafana, ELK Stack, etc.).
- Knowledge of security best practices in production environments.
Compensation
The U.S. base salary range for this full-time position is $124,000 - $150,000 DOE + bonus + benefits. When available, the salary range posted for this position reflects the projected minimum and maximum hiring range for new hires in U.S. locations. Compensation details reflect the base salary only and do not include bonuses or benefits. Individual pay is determined by the work location and additional factors, including but not limited to skillset, experience, and relevant education, certifications, or training. Applicants may not be eligible for the full salary range based on their U.S. hiring location. The manager can share more details about compensation for the role in your location during the hiring process.