注册一亩三分地论坛,查看更多干货!
您需要 登录 才可以下载或查看附件。没有帐号?注册账号
x
Looking for E3 engineers, position is based in Sunnyvale.
. The Ads ML Serving team is in the early phases of building a PyTorch 2.0 based model inference stack that unifies support for GPU, CPU and MTIA hardware targets, delivers state-of-art inference performance and supports the newly emerging complex ranking, GenAI and content understanding model architectures for Ads. On this team you would have the opportunity to bring complex AI Models to production by working vertically across the model transformation, compiler and runtime stack. Some of the things you can expect to work on: - Develop ML runtime features and capabilities for improving model inference performance for multi-core CPUs, GPUs and MTIA. - Dive deep into CPU and GPU backends of PyTorch 2.0 Inductor compiler to develop new capabilities, fix performance issues and optimize model inference latency and throughput - Develop specialized custom CPU and GPU kernels for key performance critical operations in the model - Work on a novel PyTorch python based model inference stack for accelerated production model deployments - Develop model graph transformation passed for optimizing model inference performance.
Responsibilities:
- Assist in the collaboration with ML researchers and applied scientists to help in the productionization of model training or inference..1point3acres
- Participate in the design, planning, and execution of technical projects with guidance.. Χ
- Contribute to the development of software for AI acceleration (GPU, ASICs, CPU) and optimizations under supervision..google и
- Support efforts to ensure quality and engineering excellence in all deliverables.. Χ
- Communicate effectively, both verbally and in writing, with team members.
Requirements:
- Familiarity with AI/ML frameworks (PyTorch, TensorFlow, JAX, etc.), compilers, runtimes, or tooling.. Χ
- Experience or demonstrated interest in collaborating with ML researchers and applied scientists to support the productionization of model training or inference.
- Basic understanding or experience in AI acceleration (GPU, ASICs, CPU) software development and optimizations.. Χ
- Willingness to learn and contribute to the design, planning, and execution of technical projects.
- Commitment to quality and engineering excellence.
Good written and verbal communication skills.
有意者请将简历和一段第三人称自我介绍发到 您好! 本帖隐藏的内容需要积分高于 10 才可浏览 您当前积分为 0。 使用VIP即刻解锁阅读权限或查看其他获取积分的方式 游客,您好! 本帖隐藏的内容需要积分高于 10 才可浏览 您当前积分为 0。 VIP即刻解锁阅读权限 或 查看其他获取积分的方式 , 并在贴子下回复并加米。
补充内容 (2024-02-20 09:04 +08:00):
. From 1point 3acres bbs
另外在邮件里写一下有几年工作经验(new grad也可,仅refer需要)
补充内容 (2024-02-20 09:11 +08:00):
更正下,不是自我介绍,而是why meta should hire you,in 3rd person |