[Job Title]: Data Scientist - LLM(Large Language Model) Applications [About Us]: Founded by ex-googlers and started at Stanford’s StartX program, PowerArena now has offices in the States, Taiwan, Hong Kong and China. We are a fast-growing, research-driven company building AI solutions that helps manufacturing corporations and factories assembly operations overcome the challenges they face in productions every day. We are a fast-growing, research-driven company building AI solutions that helps corporate and factories overcome the challenges they face every day. Using novel machine learning techniques, we are revolutionizing the industry and have a track record of building things that others have ruled out as impossible. Our team is our best asset. We work with smart and talented individuals, who all enjoy a high degree of responsibility and independence in structuring their work. The team are looking for passionate data scientist who is keen to work with like-minded individuals in a rapidly evolving environment. =====[Job Description]===== [Position Overview]: We are looking for a Data Scientist specializing in LLM to drive our LLM appliactions. As a Data Scientist at PowerArena, you will work on projects about smart factory and smart city, and tackle a diverse range of challenges. You will play a pivotal role in developing and implementing LLM applications using machine learning, deep learning, , context engineering, prompting engineering and knowledge base techniques. [Key Responsibilities]: - Collaborate with cross-functional teams to understand project requirements and objectives. - Design and develop end-to-end solutions using LLMs for applications in smart factory and smart city domains. - Integrate LLM applications with structured and unstructured knowledge bases to enhance reasoning and retrieval capabilities. - Apply prompt engineering and context engineering to optimize LLM outputs and align them with business needs. - Conduct fine-tuning, evaluation, and benchmarking of LLM-based models for target use cases. - Stay up-to-date with state-of-the-art research in natural language processing, generative AI, and multi-modal learning. - Evaluate and benchmark different models to identify the best solutions for specific applications. - Share technical knowledge with team members. [Qualifications]: - Master's or Ph.D. in Computer Science, Machine Learning, Deep Learning, or a related field. - Proven experience with large language models (e.g., GPT, LLaMA, Gemini, QWen) and related frameworks. - Strong proficiency in programming languages such as Python, C/C++ and familiarity with relevant libraries (PyTorch, TensorFlow, etc.). - Expertise in prompt engineering, context engineering, and knowledge base integration (e.g., RAG pipelines, vector databases). - Excellent problem-solving skills and the ability to work independently and as part of a team. [Bonus Qualifications]: - Familiarity with VLM. - Familiarity with AI agent framework. - Familiarity with MLOps, model monitoring, and pipeline automation. - Background in smart factory, smart city, or industrial AI use cases. - Contributions to open-source LLM or NLP projects. - Familiarity with Industry 4.0 applications. - Familiarity with vision related work. (Object detection, Segmentation, Recognition...) [What We Offer]: - Flexible working environment - Competitive salary and benefits package. - Opportunity to work on cutting-edge projects with real-world impact. - Collaborative and innovative work environment. - Professional development and training opportunities.
月薪70,000元以上
(固定或變動薪資因個人資歷或績效而異)未填寫
- 組織扁平,任何人都能發表自己的想法,和創始人共同工作 - 完善年度之評分與職涯規劃,分享與公司成長的回報 - 優於勞基法給予特休假勤 - 海內外員工Team Building 活動 - 能於香港總部及其他地區辦公室國際化團隊合作,發揮影響力讓世界都看見