Punchh is the world leader in innovative digital marketing products for brick and mortar retailers, combining AI and machine learning technologies, mobile-first expertise, and omni-channel communications designed to dramatically increase lifetime customer value.
Punchh is seeking a Director of DevOps based in the San Mateo headquarters office to lead the DevOps definitions and implementations of the company’s production infrastructure. He/she will create a backend support infrastructure that will scale globally, utilizing automation tools to maximize operational efficiency. Reporting directly to the CTO/Co-founder, the Director of DevOps at Punchh will have multiple teams reporting under him/her including SRE, Security, Performance & Scale, Automation and General Administration).
At a high level, the Director of DevOps will be ultimately be responsible and accountable for the following in an environment with company operations and customers in the US and internationally:
*SRE (Site Reliability Engineering) — Making sure that all systems are UP and running all the time and proactively take action if/when systems are down or there are interruptions in service. He/she will put monitors and alerts within the system and take corrective actions based on Runbooks they will build. When things are stable, he/she will need to perform “Chaos Engineering” to prepare for disaster. Additionally, the DevOps Director will define a DR (Disaster Recovery) plan and test it.
*Performance / Scale — Making sure that the application(s) are operating as fast as promised (performance) and that those applications can scale to handle higher loads. He/she will work with developers to find hotspots and work towards re-architecting / re-designing as needed. This team must work closely with the engineering team to provide architectural input as needed.
*Automation — Making sure Punchh can create new stacks for new customers, zero-downtime deploys and ability to automate daily chores. He/she would build our Continuous Delivery and Continuous Integration pipelines
*Security - He/she will not only put processes and systems in place to prevent leakage and breakage and guard and monitor the application but also help teams with security awareness
*Housekeeping (Systems/Operations) / Program Management
*Create, develop and maintain support systems and third-party apps to enable, track and report on the DevOps organization.
*Own the establishment, development, and ongoing nurturing of the Punchh DevOps team.
*Setup, grow and nurture Punchh’s Site Reliability Engineering team.
*Work with the Product/Program Management and Core Engineering teams to give the appropriate input for Sprints and Story planning. Ensure the assignment of tasks to individuals maximizes the team’s overall expertise.
*Collate information from the delivery platforms and influence the engineering roadmap based scale/performance/security feedback from production systems.
*Establish, monitor and report on the growth of the SaaS infrastructure. Analyze, review, and report on the effectiveness of new initiatives.
*Automate deployment pipeline and new stack creation.
*Manage escalations to minimize any client downtime, ensuring adherence to any contractual commitments.
*Ensure trends in DevOps tools and applications are constantly evaluated and assessed to improve Punchh’s production efficiency.
*Manage the on-going education, development and morale of the DevOps team to minimize attrition.
*Represent DevOps within Engineering scrums, bug scrubs, and cross-function group meetings.
*Reduce DevOps costs by implementing appropriate resources.
*Develop and manage a 24x7 support infrastructure and ability to work off-hours or on-call as needed within the SRE team.
*Some of the technologies in active use include: Kubernetes, AWS ECS, Docker Containers, CloudFormation, Jenkins, AWS Lambdas, New Relic, Grafana, ELK and the Director of DevOps is expected to have direct first hand, in depth experience with these.
*5+ years in DevOps with at least 2 years as a DevOps Lead or Manager with proven technical chops and leadership experience OR significant, deep experience in at least two of the following sub functions (SRE, Performance/Scale, Automation & Security) and desire to move to a leadership role.
*Experience in high-scale, large enterprise SaaS companies that operate systems supporting high/peak traffic
*An in-depth knowledge and understanding of DevOps infrastructure platforms and their deployments.
*Significant expertise on AWS ecosystem of services and products is a must.
*Solid foundation in Linux / Unix
*Strong professional experience programming in Python or a similar language.
*Exceptional ability to communicate complex information and ideas in written and verbal format. Problem-solving, consultative approach, creative and diplomatic interpersonal skills.
*Team player, with the confidence to take the lead and guide other departments when necessary.
*Exceptional command of and understanding of technical infrastructure, with the ability to pick up new solutions quickly.
*Understanding of and embracing the principles of good “customer service”, whether internal or external.
*BS in Computer Science/MIS or equivalent work-related experience.