台北市中山區經歷不拘大學以上
[Job Title]: Data Scientist - LLM(Large Language Model) Applications
[About Us]:
Founded by ex-googlers and started at Stanford’s StartX program, PowerArena now has offices in the States, Taiwan, Hong Kong and China. We are a fast-growing, research-driven company building AI solutions that helps manufacturing corporations and factories assembly operations overcome the challenges they face in productions every day.
We are a fast-growing, research-driven company building AI solutions that helps corporate and factories overcome the challenges they face every day. Using novel machine learning techniques, we are revolutionizing the industry and have a track record of building things that others have ruled out as impossible. Our team is our best asset. We work with smart and talented individuals, who all enjoy a high degree of responsibility and independence in structuring their work. The team are looking for passionate data scientist who is keen to work with like-minded individuals in a rapidly evolving environment.
=====[Job Description]=====
[Position Overview]:
We are looking for a Data Scientist specializing in LLM to drive our LLM appliactions. As a Data Scientist at PowerArena, you will work on projects about smart factory and smart city, and tackle a diverse range of challenges. You will play a pivotal role in developing and implementing LLM applications using machine learning, deep learning, , context engineering, prompting engineering and knowledge base techniques.
[Key Responsibilities]:
- Collaborate with cross-functional teams to understand project requirements and objectives.
- Design and develop end-to-end solutions using LLMs for applications in smart factory and smart city domains.
- Integrate LLM applications with structured and unstructured knowledge bases to enhance reasoning and retrieval capabilities.
- Apply prompt engineering and context engineering to optimize LLM outputs and align them with business needs.
- Conduct fine-tuning, evaluation, and benchmarking of LLM-based models for target use cases.
- Stay up-to-date with state-of-the-art research in natural language processing, generative AI, and multi-modal learning.
- Evaluate and benchmark different models to identify the best solutions for specific applications.
- Share technical knowledge with team members.
[Qualifications]:
- Master's or Ph.D. in Computer Science, Machine Learning, Deep Learning, or a related field.
- Proven experience with large language models (e.g., GPT, LLaMA, Gemini, QWen) and related frameworks.
- Strong proficiency in programming languages such as Python, C/C++ and familiarity with relevant libraries (PyTorch, TensorFlow, etc.).
- Expertise in prompt engineering, context engineering, and knowledge base integration (e.g., RAG pipelines, vector databases).
- Excellent problem-solving skills and the ability to work independently and as part of a team.
[Bonus Qualifications]:
- Familiarity with VLM.
- Familiarity with AI agent framework.
- Familiarity with MLOps, model monitoring, and pipeline automation.
- Background in smart factory, smart city, or industrial AI use cases.
- Contributions to open-source LLM or NLP projects.
- Familiarity with Industry 4.0 applications.
- Familiarity with vision related work. (Object detection, Segmentation, Recognition...)
[What We Offer]:
- Flexible working environment
- Competitive salary and benefits package.
- Opportunity to work on cutting-edge projects with real-world impact.
- Collaborative and innovative work environment.
- Professional development and training opportunities.