Join us in pioneering breakthroughs in healthcare. For everyone. Everywhere. Sustainably. Our inspiring and caring environment forms a global community that celebrates diversity and individuality. We encourage you to step beyond your comfort zone, offering resources
Company Description Do you want beneficial technologies being shaped by your ideas? Whether in the areas of mobility solutions, consumer goods, industrial technology or energy and building technology - with us, you will have the chance
【26届校招】大语言模型预训练算法工程师上海、深圳正式智能制造 / 工业互联网 / 工业自动化 - 研发职位描述职位描述我们正在寻找对大语言模型(LLM)的底层原理、性能优化和高效预训练充满热情的顶级算法工程师。您将加入我们的核心研发团队,主要负责LLM预训练阶段的算法设计、优化与实现,包括模型架构的探索、训练稳定性的提升、大规模分布式训练的优化等。我们的目标是基于业务需求,设计并训练对硬件计算友好的语言模型,从根本上突破模型的性能和训练效率极限,加速LLM在人形机器人、自动驾驶、多模态等前沿领域的落地。工作职责:1. LLM预训练算法研发与实现: 主导1~7B参数级别的Dense以及MoE Transformer模型以及其他前沿架构在预训练阶段的设计、实验与优化,以提升模型的基础能力和效率。2. 基准测试与性能优化: 负责模型训练过程中的关键性能指标监测与优化,特别是MMLU, GSM8K, MATH等常见标准化测试的表现。通过算法迭代,持续提高模型在理解、推理和泛化能力方面的分数。3. 训练稳定性与效率提升: 负责分析和解决超大规模训练过程中的数值不稳定、梯度爆炸/消失等问题,引入和实现如混合精度训练、梯度裁剪、学习率调度等优化策略。4. 前沿技术追踪与转化: 紧密追踪全球LLM预训练、Scaling Law、新型优化器(如AdamW、Lion)等最新研究进展,评估并将业界顶尖的算法创新快速转化为我们的核心竞争力。5. 跨团队协作: 与数据工程师紧密合作,分析数据对预训练效果的影响,并与系统/硬件工程师协作,共同调优底层计算资源以实现最高训练吞吐。职位要求1. 教育背景: 计算机、人工智能、数学、物理等相关专业硕士及以上学位,有顶级会议(如NeurIPS, ICML, ICLR, AAAI等)论文发表经验者优先。2. 核心算法理解: 深入理解Transformer、GPT、LLaMA、Qwen等主流模型架构的底层数学原理与训练细节,对Linear attention、RMSNorm、DynamicTanh (DyT)、Mixture of Experts (MoE)等关键模块有独到见解。3. 分布式训练实战经验: 具备主导或深度参与LLM预训练的实际经验,熟悉PyTorch、DeepSpeed、Megatron-LM等分布式训练框架。4. 专业素养: 具备严谨的实验设计和结果分析能力,能够主动发现并解决训练过程中的复杂算法问题。5.
Company Description Do you want beneficial technologies being shaped by your ideas? Whether in the areas of mobility solutions, consumer goods, industrial technology or energy and building technology - with us, you will have the chance
Company IntroductionPolymer Capital Management is a market-neutral, multi-manager investment platform focused on Asia. Our goal is to discover and nurture the regions top investment talent by combining established institutional support with extensive local financial market knowledge.
Job Posting Title:Broadcast Marketing Executive (Asset Library, 1-year contract) Req ID:10148934 Job Description: We are seeking a detail-oriented and organized professional to manage our Broadcast Marketing digital asset library and support post‑production activities. This role works
Job Posting Title:Broadcast Marketing Executive (Production, 1-year contract) Req ID:10148726 Job Description: We are looking for a highly organized and collaborative Broadcast Marketing Executive to support the planning and execution of video, photo, and media productions
职务目标 完成预定产量,保证生产任务及时有效的完成,达到最佳质量标准。 任职资格/技术水平要求 初中及以上文化程度,有机械背景者优先。 具有一定的生产或装配经验。 能够并愿意同其他操作工合作在装配线工作。 具有一般生产常识。 接受能力强。 工作职责 严格按照作业指导书进行正确装配,并保证组装质量。 按时完成产量并不断提高劳动效率。 按照作业指导书操作机器/装置,并对所使用的机器、设备进行日常的清理和维护。 操作前应对物料进行首件确认,操作中进行自检和互检。 保证本工位物料存放整齐,不合格的部件随时标识好并放置于工位红筐中。对于纸片垃圾随时放置于工位垃圾箱中。 以公司规定制度、员工手册、安全条例规范自己的行为。 We Don’t Just Build The World, We Build Innovative Technology Too. Joining the Stanley Black & Decker team means working in
Position Summary: Stanley Black & Decker’s Global Strategic Sourcing organization has an opportunity for Lead Category Management role. This position will be responsible for managing the commercial strategic direction for the BLACK+DECKER® Home Product and Vacuum/Dust
PLAY, GROW and WIN To be a part of Virtuos means to be a creator. At Virtuos, we harness the latest technologies to make games better and more immersive than ever before. That is why we
Company description Ferrero is a family-owned company with a truly progressive and global outlook and iconic brands such as Nutella®, Tic Tac®, Ferrero Rocher®, Raffaello®, Kinder Bueno® and Kinder Surprise®. As the love for our brands
LLM基座模型算法实习生 深圳、上海 实习 互联网 / 电子 / 网游 职位描述 职位描述1、负责大模型(LLM: Large Language Model)在人形机器人中的算法设计与开发,将LLM应用于人形机器人的对话、环境感知与人机交互任务;2、深度参与大语言模型从预训练到后训练的全流程工作,并通过数据合成等技术打造高质量的训练数据集;3、与机器人平台团队、硬件团队紧密协作,实现模型在实际机器人系统中的高效运行;4、跟踪前沿研究,探索学术边界,在充足的资源支持下开展技术预研与实验,推动新技术在产品中的落地应用。 职位要求 职位要求1、计算机、人工智能、自动化等相关专业硕士及以上学历;2、具备扎实的深度学习基础,熟悉Transformer、BERT、ViT、CLIP等主流视觉-语言模型架构;3、有大语言模型(LLM)的训练/微调/推理优化经验,熟悉其在下游任务中的应用;4、熟练使用Pytorch深度学习框架,具备良好的工程能力和代码实现能力;5、良好的团队协作与沟通能力,具备快速学习和解决问题的能力。加分项:1、有参与实际LLM预训练或后训练(SFT、DPO/RLHF)的相关项目经验;2、有使用大语言模型进行数据合成(Data synthesis)的相关经验;3、对大语言模型的预训练/后训练数据集有一定了解,有处理Trillion级别数据集的相关经验;4、熟悉开源大模型生态(如Qwen, Llama等),对大模型的模型结构优化(如linear attention,Mixture-of-Expert等)有一定了解。 投递...
Software Engineer Intern - Generalist 费利蒙 实习 职位描述 DescriptionFounded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies and services at
Research Intern- Deep Learning 费利蒙 实习 职位描述 DescriptionFounded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies and services at a
(Senior) Software Engineer, Deep Learning 费利蒙 全职 职位描述 DescriptionFounded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies and services at
LLM基座模型算法实习生 深圳、上海 实习 互联网 / 电子 / 网游 27届AI人才专项计划(实习生专场) 职位描述 职位描述1、负责大模型(LLM: Large Language Model)在人形机器人中的算法设计与开发,将LLM应用于人形机器人的对话、环境感知与人机交互任务;2、深度参与大语言模型从预训练到后训练的全流程工作,并通过数据合成等技术打造高质量的训练数据集;3、与机器人平台团队、硬件团队紧密协作,实现模型在实际机器人系统中的高效运行;4、跟踪前沿研究,探索学术边界,在充足的资源支持下开展技术预研与实验,推动新技术在产品中的落地应用。 职位要求 职位要求1、计算机、人工智能、自动化等相关专业硕士及以上学历;2、具备扎实的深度学习基础,熟悉Transformer、BERT、ViT、CLIP等主流视觉-语言模型架构;3、有大语言模型(LLM)的训练/微调/推理优化经验,熟悉其在下游任务中的应用;4、熟练使用Pytorch深度学习框架,具备良好的工程能力和代码实现能力;5、良好的团队协作与沟通能力,具备快速学习和解决问题的能力。加分项:1、有参与实际LLM预训练或后训练(SFT、DPO/RLHF)的相关项目经验;2、有使用大语言模型进行数据合成(Data synthesis)的相关经验;3、对大语言模型的预训练/后训练数据集有一定了解,有处理Trillion级别数据集的相关经验;4、熟悉开源大模型生态(如Qwen, Llama等),对大模型的模型结构优化(如linear attention,Mixture-of-Expert等)有一定了解。 投递...
Build a career with confidence Carrier Global Corporation, global leader in intelligent climate and energy solutions is committed to creating solutions that matter for people and our planet for generations to come. From the beginning, weve
Astera Labs (NASDAQ: ALAB) provides rack-scale AI infrastructure through purpose-built connectivity solutions. By collaborating with hyperscalers and ecosystem partners, Astera Labs enables organizations to unlock the full potential of modern AI. Astera Labs’ Intelligent Connectivity Platform
缺补 We Don’t Just Build The World, We Build Innovative Technology Too. Joining the Stanley Black & Decker team means working in an innovative, tech-driven and highly collaborative team environment supported by over 43,500 professionals in
Job Group Capability Process Engineering & Products Job Group Process Engineering Job Role Description The Intermediates & Synthetics (I&S) Principal Engineer combines strategic leadership with deep expertise in their technology disciplines in areas such as linear alpha