What we're looking for
We are searching for a seasoned DevOps Engineer with a proven track record in optimizing and managing large-scale production systems, who is passionate about driving developer productivity and product reliability.
Responsibilities
- Lead the architectural migration of services to Kubernetes on GKE, focusing on efficiency and scalability.
- Enhance operational aspects of infrastructure, including deployment, logging, monitoring, and alerting systems.
- Develop best practices for production infrastructure management: provisioning, scaling, configuration, and monitoring.
- Establish robust CI/CD pipelines and development processes to support engineering workflows.
- Collaborate with the engineering team to address long-term platform requirements and operational guidelines, prioritizing reliability.
- Advance our engineering excellence by implementing best coding, testing, and deployment practices.
- Create and maintain comprehensive documentation related to processes and workflows.
- Manage and maintain local servers dedicated to blockchain node operations, ensuring high availability and performance.
Qualifications
- 6+ years as a DevOps or Site Reliability Engineer.
- Deep experience designing and operating large-scale, multi-region, multi-cloud production systems.
- Proficient with GCP and/or other cloud infrastructures.
- Skilled in containerization and orchestration with Docker and Kubernetes.
- Familiar with service mesh technologies like Istio or Linkerd.
- Adept at building CI/CD pipelines using tools like CircleCI and Spinnaker.
- Knowledgeable in real-time telemetry and tracing with tools such as Prometheus, Stackdriver, and DataDog.
- Strong experience with Infrastructure-as-Code (Terraform, Ansible, CloudFormation, etc).
- Competent in networking and VPC management.
- Informed about security best practices.
- Experience with streaming infrastructures like Kinesis or Kafka.
- A passion for startups, blockchain technologies, and Web3.