台北市中山區4年以上大學
Overview
As a Product Development Solution Architect (PDSA) at TrueWatch, you will play a key role in integrating diverse monitoring and observability solutions into our platform. This role combines strong DevOps/SRE expertise with hands-on integration and solution design, working directly with international clients and internal teams to deliver scalable, reliable, and insightful observability implementations.
Responsibilities
• End-to-End Observability Solutions: Design and implement complete observability solutions tailored to client requirements, from data collection to visualization and alerting.
• Integration Development: Build and maintain integrations with third-party monitoring tools such as Prometheus, Grafana, Datadog, OpenTelemetry, and others.
• Monitoring & Alerting: Define and configure meaningful monitoring dashboards and alert rules that provide actionable insights for system reliability.
• Infrastructure Expertise: Work across environments including VMs, Kubernetes, and serverless platforms to design robust solutions.
• Cloud Platforms: Architect and implement solutions leveraging Google Cloud Platform (GCP), Amazon Web Services (AWS), and Microsoft Azure.
• Cloud Networking & Security Awareness: Apply knowledge of CDN, WAF (e.g., Cloudflare), and general cloud networking practices. Maintain a solid understanding of security best practices to ensure secure deployments.
• Automation & Scripting: Write scripts and tooling (Python, Bash, etc.) to automate deployment, configuration, and integration workflows.
• Containerization & CI/CD: Apply knowledge of Docker and container ecosystems to streamline development and deployment pipelines.
• Collaboration: Engage with international clients and internal teams, understanding their needs and delivering tailored observability solutions from multiple perspectives.
Required Skills & Qualifications
• Education: Bachelor’s degree or above in Computer Science, Engineering, or related field.
• Experience: Minimum of 4 years in DevOps, SRE, Solution Architecture, or related technical roles.
• Strong background in DevOps and SRE practices, with a focus on monitoring, alerting, and reliability.
• Hands-on experience with Prometheus, Grafana, Datadog, OpenTelemetry (at least two of them in depth).
• Solid knowledge of infrastructure platforms: VMs, Kubernetes, and serverless computing.
• Proficiency with major cloud platforms (GCP, AWS, Azure).
• Familiarity with CDN, WAF (Cloudflare or similar), and cloud networking concepts.
• Demonstrated security awareness and understanding of best practices in secure system design and operations.
• Proficiency in Python and Bash scripting for automation.
• Understanding of Docker and containerization principles.
• Confident verbal and written communication skills in English, with the ability to effectively collaborate with international clients and internal teams across different technical and cultural backgrounds.
• Strong problem-solving skills, capable of proposing creative and practical solutions.
What We Offer
• Exposure to global clients and real-world observability challenges.
• Opportunities to learn from diverse perspectives and explore different solution approaches.
• A dynamic environment where innovation and collaboration are encouraged.