工作內容

Cloud Infra Automation: Design and deploy infrastructure on bare metal or cloud using Terraform, Ansible, or Helm. Automate workflows with Python or Go. Platform Reliability: Maintain and scale GPU clusters, Kubernetes, and AI-optimized storage (Ceph, BeeGFS, Weka) to ensure stability and performance. Monitoring & Alerting: Use Prometheus, Grafana, ELK, etc., to monitor system health and trigger alerts on anomalies. Capacity Planning: Analyze usage patterns and forecast infrastructure needs for AI workloads. Incident Management: Lead root cause analysis and manage SLOs/SLIs/SLAs to maintain high availability. CI/CD Integration: Work with DevOps/MLOps teams on CI/CD pipelines using GitLab, ArgoCD, or similar tools. Security & Compliance: Secure Linux systems, manage certificates, and enforce access controls (RBAC, LDAP SSO, TLS, segmentation). Documentation & Playbooks: Maintain architecture diagrams, runbooks, and incident playbooks to support knowledge sharing and onboarding.

職務類別

、

工作待遇

待遇面議

（經常性薪資達 4 萬元或以上）

工作性質

全職

上班地點

桃園市八德區興豐路1899號

遠端工作

管理責任

不需負擔管理責任

出差外派

無需出差外派

上班時段

日班，08:30~17:30

休假制度

週休二日

可上班日

一個月內

需求人數

1人

條件要求

工作經歷

3年以上

學歷要求

大學、碩士

科系要求

工程學科類、自然科學學科類

語文條件

英文 -- 聽 /中等、說 /中等、讀 /中等、寫 /中等

擅長工具

不拘

工作技能

不拘

其他條件

Bachelor’s degree in Computer Science, Engineering, or a related field—or equivalent experience and 3-7 years of experience in the areas below is preferred. Proficiency in Linux (Ubuntu, RHEL/CentOS), containers (Docker, Podman), and orchestration (Kubernetes). Experience managing GPU compute clusters (NVIDIA / CUDA, AMD / ROCm) Hands-on experience with observability tools (Prometheus, Grafana, Loki, ELK, etc.). Strong scripting and coding skills (Bash, Python, or Go). Exposure to secure multi-tenant environments and zero trust architectures. Familiarity with network protocols, DNS, DHCP, BGP, ROCEv2, and InfiniBand or high-throughput Ethernet fabrics. Excellent collaboration and communication skills for cross-team, partner, and customer initiatives

顯示全部

公司環境照片(6張)

Supermicro中和總部-新北市中和區建一路150號3樓

Supermicro 亞太科技園區-桃園市八德區興豐路1899號

Supermicro馬來西亞廠-馬來西亞柔佛州新山市

家庭日活動

尾牙: 年終聖誕餐會

Super Micro Computer, Inc._美超微電腦股份有限公司企業形象

福利制度

法定項目

哺乳室勞保健保特別休假員工體檢

其他福利

◆ 薪酬類 1.每年發放 2 次績效獎金 2.中秋、端午及生日禮券 ◆ 保險類 1.勞保 2.健保 3.員工免費團保 4.眷屬優惠自費保險 ◆ 休假制度 1.週休二日 2.優於勞基法的特休制度 ◆ 補助類 1.結婚禮金 2.生育禮金 3.住院、喪禮慰問金 4.子女獎學金 5.運動健身補助 ◆ 其他類 1.公司週年慶、年終尾牙 2.定期電影欣賞會、電影票發放 3.定期部門聚餐 4.社團活動補助 5.免費供應午餐(八德) 6.免費機車停車位(八德) 站上國際舞台，開創非凡職涯：至美國加州矽谷、荷蘭出差與培訓的機會，與世界頂尖研發人員切磋交流

聯絡方式

聯絡人

HR

應徵回覆

合適者將於1個工作天內主動聯繫，不合適者將不另行通知

104人力銀行提醒您履歷關閉時仍可投遞履歷喔！面試時請遵守求職禮儀準時赴約並小心安全

求職安全專線【勞動部】0800-085-151【104人力銀行】02-29126104轉2 或來信詢問

建議使用104內建訊息功能，以保障您的求職權益，職缺內容可能包含第三方通訊軟體，敬請謹慎評估。

職場安全提醒

Site Reliability Engineer-AI Cloud_TC26896

「關注」讓企業知道你對職務感興趣

工作內容

職務類別

工作待遇

工作性質

上班地點

遠端工作

管理責任

出差外派

上班時段

休假制度

可上班日

需求人數

條件要求

工作經歷

學歷要求

科系要求

語文條件

擅長工具

工作技能

其他條件

公司環境照片(6張)

福利制度

法定項目

其他福利

聯絡方式

聯絡人

應徵回覆

Site Reliability Engineer-AI Cloud_TC26896

「關注」讓企業知道你對職務感興趣

工作內容

職務類別

工作待遇

工作性質

上班地點

遠端工作

管理責任

出差外派

上班時段

休假制度

可上班日

需求人數

條件要求

工作經歷

學歷要求

科系要求

語文條件

擅長工具

工作技能

其他條件

公司環境照片(6張)

福利制度

法定項目

其他福利

聯絡方式

聯絡人

應徵回覆

適合你大展身手的工作