• Provide Hands & Feet support services which are tracked through Service Now tickets
• Equipment Rack and Stack
• Cable, terminate, and dress fiber or copper network cabling
• Assist with the diagnosis of hardware and software
• Power cycling of customer equipment
• Perform cabinet/cage audits
• Blade/Card, Memory, Hard Drive, and Transceiver installations or removal
• Testing and troubleshooting of copper and fiber optic circuits
• Perform migrations of equipment, power, or networking devices
• Shipping and receiving of packages
• Oversee and maintain the Access Control System for the facility
• Maintain the DVR and CCTV cameras for the facility
• Help Security administer badges for internal and external customers
• Receive, install, and test customer and internal cross connects of copper and fiber optic interconnections
• Provide timely customer and internal updates using a computer-based ticketing system
• Receive, log, and store internal packages
• Perform and document inventory verification/counts
• Perform infrastructure builds
• Perform DC power installations per Facility Engineering design specifications
• Monitor and report on Mechanical HVAC systems
• Monitor and report on Critical Power Systems
• Maintain overall Data Center cleanliness and appearance
• Work flexible shift schedules
• Provide rotational on-call coverage
• Report to the facility within a 1-hour average for emergency outage assistance
Job Description:
• Resources will assist client with rack, stack, and installation of key data center equipment.
• They will help troubleshoot common issues, work through various tickets, and act as a level 1 administrator for any data center issues.
• Do mechanical assembly/disassembly and install heavy GPU trays into systems
• Build up prototype open board systems on the benchtop and debug issues reported by users.
• Triage and debug systems and get systems to pass baseline diagnostics.
***NOTE: The candidate will be required to travel 50% of the time to other locations in the region (Japan, Hong Kong, Singapore, etc)***
Requirements:
• Strong data center experience with rack and stack,
• Minimum of 3 years on-prem large scale data center experience supporting the Data Center equipment and/or infrastructure (Power, Space, Cooling, Equipment)
• Strong cabling skills, cable management, and dressing
• Strong NW troubleshooting experience, ideally with Supermicro Servers, Infiniband Fabric, and Mellanox, Arista Hardware
• Strong attention to detail, independent, and out of box thinking
• Solid skills in UNIX (Ubuntu or RedHat) administration and knowledge of commands, including logging into the server’s to monitor for disk failures, debug systems, etc
• Break fix server expertise, including mechanical assembly/disassembly, installing heavy GPU trays, changing system config,s and building up prototypes.
* System Architecture Design: Design and modify system architecture based on existing functionalities, ensuring it meets business requirements and offers high scalability and reliability.
* Software Development: Develop backend services using Java 11+ and Spring Boot 2.x, implementing microservices architecture, and utilizing Kubernetes for container orchestration.
* Documentation: Update existing system documentation, including architecture diagrams, deployment guides, and user manuals, and create new documents such as API documentation, user guides, and operational manuals.
* Production Environment Issue Resolution: Monitor the production environment to identify and resolve potential issues promptly, and analyze system logs and performance metrics to enhance system stability and efficiency.
1. Software Sustaining and continuously improve current machine software
2. Responsible for software feature development and enhancement base on customer requirement
3. Provide supporting and troubleshooting to on-site software issue
4. Review and update the user requirement documents and operation procedures
5. Collaborate with mechanical, electrical, application team to implement new hardware feature or software feature to equipment
6. Develop and enhance software utilities to improve equipment setup and easy operation abilities