Key Responsibilities:
Manage the System Center System(MECM、SCOM、SCVMM...) infrastructure , including design, configuration, maintenance, and optimization, to ensure system performance and stability.
Administer System Center System Server, including configuration, maintenance, and upgrades.
Manage the Antivirus(TrendMicro Apex One) system infrastructure , including design, configuration, maintenance, and optimization, to ensure system performance and stability.
Administer Apex One Server, including policy configuration, maintenance, and upgrades.
Monitor system performance, promptly resolve system issues, and minimize downtime.
Collaborate with teams to implement new systems or upgrades, including requirements analysis, planning, and execution.
Prepare and update system documentation, including system settings and operational manuals.
Conduct regular system backups and disaster recovery planning.
Synology's product developers design cutting-edge products used by millions of users worldwide. Our extensive portfolio, including data storage, networking, and surveillance products, is powered by our own software solutions, giving us a distinct edge in the competitive market. We're actively seeking engineers capable of making a significant impact, bringing fresh and diverse ideas to further strengthen our team.
Data plays a pivotal role in driving modern businesses, economies, and societal advancement. Every second, an immeasurable amount of data is generated, with only a fraction of it being captured. At Synology, we specialize in designing data management solutions specifically crafted to empower businesses in their pursuit of innovation. As a Product Developer within one of Synology's core software teams, you will have the opportunity to work on the fundamental functionalities that underpin our storage solutions. Your contributions will be instrumental in shaping the future of our products.
You will:
- Developing robust and high-performance subsystems for Synology DSM (DiskStation Manager)
- Creating efficient system monitoring solutions
- Profiling and optimizing the performance of file-level protocols and block-level protocols
Cloud Infra Automation:
Design and deploy infrastructure on bare metal or cloud using Terraform, Ansible, or Helm. Automate workflows with Python or Go.
Platform Reliability:
Maintain and scale GPU clusters, Kubernetes, and AI-optimized storage (Ceph, BeeGFS, Weka) to ensure stability and performance.
Monitoring & Alerting:
Use Prometheus, Grafana, ELK, etc., to monitor system health and trigger alerts on anomalies.
Capacity Planning:
Analyze usage patterns and forecast infrastructure needs for AI workloads.
Incident Management:
Lead root cause analysis and manage SLOs/SLIs/SLAs to maintain high availability.
CI/CD Integration:
Work with DevOps/MLOps teams on CI/CD pipelines using GitLab, ArgoCD, or similar tools.
Security & Compliance:
Secure Linux systems, manage certificates, and enforce access controls (RBAC, LDAP SSO, TLS, segmentation).
Documentation & Playbooks:
Maintain architecture diagrams, runbooks, and incident playbooks to support knowledge sharing and onboarding.