Software Engineer, Site Reliability (SRE)
Software Engineer, Site Reliability (SRE)
at
Sierra Business Solution .
About Us
We are an inperson company based in San Francisco with growing offices in Atlanta, New York, and London, building a platform that helps businesses create better, more human customer experiences with AI.
Our core values are Trust, Customer Obsession, Craftsmanship, Intensity, and Family.
Company founders: Bret Taylor, former Salesforce and Facebook executive; Clay Bavor, former Google Labs leader.
What Youll Do
Own Sierras observability stackmonitoring, alerting, logging, and tracingto give engineers clear visibility into system health and performance.
Partner with product and platform engineers to design reliable, scalable systems from day one.
Design and implement scalable, secure cloud infrastructure (AWS) using Terraform and modern DevOps tooling.
Improve reliability and scalability of LLM deployments, ensuring robust, costeffective operation.
Lead improvements to deployment pipelines, CI/CD tooling, and incidentmanagement processes.
Define the foundation of SRE practices at Sierra, influencing culture, tooling, and best practices.
What Youll Bring
5+ years of handson experience in Site Reliability or infrastructure engineering for complex SaaS or cloudbased systems.
Experience designing for availability, scalability, and reliability at both infrastructure and application layers.
Deep experience with Terraform, AWS services, container orchestration, and cloud networking (IAM, VPC).
Strong background in observability systems (Prometheus, Grafana, Datadog, or similar).
Experience working with enterprise customers and familiarity with compliance and networking needs.
Comfortable working in fastmoving environments and collaborating across teams.
Degree in Computer Science or equivalent professional experience.
Even Better
Experience with LLM infrastructureoptimizing inference, managing finetuned models, or largescale deployment.
Earlystage startup experience defining SRE culture and tooling from scratch.
Familiarity with incidentmanagement automation or selfhealing infrastructure patterns.
Benefits
Unlimited Paid Time Off
Medical, Dental, and Vision benefits
Life Insurance and Disability Benefits
401(k) retirement plan with company match
Parental Leave and fertility benefits via Carrot
Lunch, snacks, coffee, and discretionary stipend
Equity plans per applicable policies
Equality & Diversity
We actively encourage applicants of all backgrounds to apply. We strive to evaluate all applicants consistently without regard to race, color, religion, gender, sexual orientation, age, disability, veteran status, or any other protected characteristic.
#J-18808-Ljbffr
...At accentedge , we recognize the vital role of secure and efficient network architecture in today's digital landscape. We are seeking a talented Palo Alto Infrastructure Engineer to join our team and help design, implement, and manage advanced cybersecurity solutions...
...Job Description Job Description Project Manager Project Manager - Mission Critical/ Data Center - Electrical Focused East Coast... ...Electrical Engineering, or a related field. Minimum of 5 years of experience in project management, preferably in data center or...
...seaboard. We are currently seeking a skilled and dedicated Traveling Drywall Carpenter to join our dynamic team. Job Description: As a... ...to various job sites to perform drywall installation, finishing, and repairs. This role requires a high level of skill, precision...
Remote Recruiter - Unlimited Earning Potential! Company: gpac (Growing People and Companies) Location: 100% Remote (Work from Home) Earning Potential: Commission-based (Top producers earn $200K-$500K+) Who We Are gpac is a family-owned executive search firm with...
...Turing is looking for experienced financial advisors and planners with a strong background in personal finance, investment management, retirement planning, and tax optimization. Role Overview: In this role, you will work on projects that help fine-tune and evaluate...