职位描述:
岗位职责:
1、Design and develop scalable frameworks and pipelines for building machine learning models that support a broad range of business applications across domains.
2、Build models and systems that support knowledge acquisition, representation, and retrieval, including integration with enterprise knowledge bases and retrieval-augmented generation (RAG).
3、Apply cutting-edge techniques in large language models (LLMs) and LLM agents, with a strong understanding of transformer architectures, fine-tuning strategies, and evaluation methods.
4、Experiment with and apply advanced training methodologies, including reinforcement learning, supervised fine-tuning, and model distillation, to improve model performance on business applications
5、Stay at the forefront of research and tooling in AI/ML, evaluating new libraries, models, and approaches for real-world adoption.
任职要求:
1、Master’s degree or above in Computer Science, Machine Learning, Artificial Intelligence, or a related field, with 3+ years of hands-on experience in developing and deploying ML models.
2、Strong foundation in machine learning algorithms and deep learning architectures, with the ability to design and adapt solutions across a variety of problem spaces.
3、Proficient in Python and well-versed in modern ML libraries including PyTorch, JAX, HuggingFace, and familiarity with agent-oriented tooling like Pydantic AI.
4、Experience working with knowledge systems, semantic search, and retrieval-based techniques.
5、Deep understanding of LLM development, including pretraining concepts, fine-tuning methods, inference optimizations, and LLM-based agents.
6、Demonstrated experience applying techniques such as reinforcement learning (e.g. RLHF), model distillation, and instruction tuning.
7、Curious, self-driven, and comfortable working across disciplines; strong problem-solving skills and effective communication in English.