Site Reliability Engineer (Remote)
We are seeking Site Reliability/DevOps Engineer candidates for a direct hire opportunity with a growing & innovative Physical/Digital Security software company. This is a high-impact and new role in the Product Development team that will strengthen Site Reliability/DevOps with an emphasis in AWS/Kubernetes deployments.
This is a remote role for candidates located in EST or CST time zones. United States Work Authorization without sponsorship is required.
- Create, deploy, configure, and manage applications using Kubernetes, with a strong focus on AWS Fargate/EKS and GCP CloudRun. Responsibilities include optimizing deployments for security, stability, and monitoring.
- Manage AWS or GCP-based infrastructure, including responsibility for deploying, scaling, and monitoring infrastructure components to ensure availability, resilience, and performance.
- ???????Implement and maintain security best practices, including encryption in transit and at rest.
- ???????Configure and monitor necessary security components (VPC, IAM, etc.) to ensure data and infrastructure security.
- Work with development teams to optimize containerized applications, specifically Golang, Python, and Node.js, for performance, scalability, and resource efficiency.
- Set up and manage monitoring tools (CloudWatch, CloudTrail, Prometheus, Grafana, and Google Cloud Monitoring tools, etc.) to track performance, identify bottlenecks, and maintain overall system health.
- Experience in running load tests to ensure applications can handle expected traffic. Analyze results and recommend optimizations based on performance metrics.
??????? Qualifications:
- 3+ years of experience in a Site Reliability Engineer, DevOps, or Infrastructure Engineer role.
- Demonstrable experience with Kubernetes in deployment, optimization, security, and monitoring.
- Strong understanding of cloud services, including best practices for deployment, monitoring, and security.
- Deep knowledge of securing cloud environments and data, with hands-on experience configuring encryption mechanisms.
- Experience with Docker and optimizing Golang, Python and Node.js-based containers for performance and resource utilization.
- Bonus: Knowledge of app load testing techniques and tools.
Preferred Skills:
- Automation: Experience with Infrastructure-as-Code (IaC) tools such as Terraform.
- Collaboration: Work closely with development teams to improve CI/CD pipelines and automate deployment processes.
- Problem Solver: Strong troubleshooting skills with a proactive approach to identifying and resolving infrastructure issues.
- Security Mindset: Consistent focus on security across development and operational practices.

