• Familiar with the day-to-day operational support for Cluster, Storage, HPC, AI, Data Center and Cloud infrastructures.
• Builds Cluster, Storage, HPC, AI, Data Center and Cloud infrastructures in-house and onsite testing, deployment, and platforms accordingly to meet customer's requirement.
• Troubleshoot hardware and software issues in rack cabinet. Provide fixes in a timely manner.
• Documents complex test procedures and troubleshooting procedures related to servers/networks/clusters software and hardware.
• Familiar with Intel/AMD/NVIDIA development toolkits like CUDA, oneAPI, ROCm.
• Conduct tests and benchmarks against server hardware, storage, network, applications, HPC and AI/ML/DL workflows.
• Programming experience with web applications, including frontend or backend.
• Collect, visualize, and analyze test and benchmark results.
• Programming experience with Python, Ansible and Linux shell scripting.
• Write technical documentation including test reports and standard operating procedure (SOP).
1. Linux embedded firmware architecture design capability.
2. Linux embedded platforms porting experience.
3. Experience with programming language such as C,Perl,Script, Python.
4. Fundamental Linux platform knowledge
5. Familiar with networking knowledge
你是個對於各種電腦及網路設備都感興趣的人嗎?
喜歡玩新技術,喜歡學新東西,在現實中又可以靠這些新技術來解決使用者的問題,會讓你感覺到很有成就感嗎?
你認為玩技術不單只是會玩單品項的產品、只滿足於會安裝、懂基本設定,你還期望能綜合不同的技術,來兜出足以解決現實問題的方案嗎?
我們團隊專門為國外客戶開發及維護跨國大型網站,我們在乎任何形式的技術解決方案,「既使是速度快一秒,可靠度高 0.1%,也對我們的營運有極大的幫助」,所以我們有足夠的營運動機,可以讓你發揮你對技術上的各種熱情。
加入團隊,你將會知道什麼叫跨國大型網站的維運,你不再是外商公司放在台灣幫忙看機器的機器人,不是見樹不見林的螺絲釘,只要你願意自我學習、向上提升,你就是我們的主角。
歡迎有【熱忱】、【追求極致】、【樂於團隊合作】的夥伴一同打造一流的服務!
【工作內容】
1. Plan/Design/Maintain application servers on Physical servers, Private Cloud, Public Cloud to provide worldwide 24x7 services.
2. Cooperate with software developers to deploy / performance monitoring / troubleshooting Cloud applications.
3. Maintain infrastructure of application environment, such as GCP, K8s and Containers, AD, DNS, SMTP, Proxy, Load balancer, NTP, Linux Server, IaC tools, automation tools.
4. To provide telephone/remote and on-site 2nd level technical support for service continuity.
【工作技能】
- Linux Server
- Windows Server
- Kubernetes (K8s)
- Nginx
- HAproxy
- Prometheus, Grafana
- Terraform
- Hypervisor
- Ansible, Jenkins
o Support office IT operations, including personal computers, office networking, software installations, and user access management.
o Manage and maintain IT systems including Windows Server, Linux, GCP, AWS, etc.
o Configure, manage, and troubleshoot network devices including routers, switches, firewalls, and wireless access points (APs).
o Provide first line (L1) and second line (L2) support, including on-call rotations.