Job Description

Midea is a leading Global Fortune 500 high technology company. Committed to bring innovation to life, Midea Group aims to develop solutions for our international customers that not only meet the requirements of the present, but also the challenges of tomorrow and the day after tomorrow. Midea develops, produces, and sells innovative products in five competence areas: Smart Home, Industrial Technologies, Building Technologies, Robotics & Automation, and Digital Innovation. We operate in more than 195 countries with over 150,000 employees. Midea embraces talents of characters of aiming high, customer first, transformation and innovation, inclusion and partnership to join the mission together to integrate with the world, and to inspire your future!

AI empowers Midea by driving innovation across its diversified applications, enhancing efficiency, intelligence, and user experience. From industrial robotics and automation, smart home appliances, intelligent manufacturing, health care sector and energy solutions, AI enables Midea to optimize operations, improve product performance, and deliver personalized experiences. Through cutting-edge machine learning, computer vision, and IoT integration, Midea leverages AI to stay at the forefront of technological advancement, creating a smarter and more connected world.

We are searching for innovative AI Engineers to join our AI Research Center in Shanghai, China , and help drive Midea’s AI journey to the next level. While the role is based in Shanghai, we’ve posted the opportunity in the U.S. as well to engage with top AI talent.

Job Description

1. Efficient Implementation of Reinforcement Learning Training and Development of a Unified and High-Performance Reinforcement Learning Training Framework.

2.Develop reinforcement learning algorithms for large language models to enhance training efficiency during the reinforcement learning phase and improve reasoning capabilities in natural sciences such as mathematics and coding.

3. Develop reward and evaluation models, including fine-grained process supervision and reward modeling, covering tasks such as complex reasoning and instruction following.

4. Participate in Scaling Law research during post-training and inference stages, including reward model training, reinforcement learning training, and inference phase Scaling Law analysis.

Job Requirements

1. Master’s degree or PHD in Computer Science or a related field.

2. Research experience in large language models, with hands-on training experience in post-training, and familiarity with reward model modeling and mainstream reinforcement learning algorithms such as PPO, REINFORCE, and RLOO.

3. Strong algorithm engineering skills, proficiency in Python programming, and experience with the PyTorch deep learning framework. Familiarity with mainstream distributed training frameworks such as DeepSpeed and Megatron.

4. Strong analytical and problem-solving abilities, excellent engineering practices, and the ability to think independently and solve real-world problems.

5. Strong teamwork and communication skills, with the ability to collaborate closely with engineering, business, product, and technical teams.

Preferred Qualifications

1. Research or practical experience in large language models and machine learning, with high-quality publications in top international conferences/journals.

2. Research or practical experience in big data processing, large-scale distributed computing, and distributed training.

3.Working Proficiency level Chinese is highly preferred.

Midea Corp, is an equal opportunity employer. We do not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

Job Tags

Similar Jobs

Venture Smarter

Network Administrator Job at Venture Smarter

...response, you are welcome to re-apply after six months for another suitable position within our company. ****_ Venture Smarter has been featured in media outlets such as CBS News, Digital Journal, and Go Banking Rates. Check us out youll know were the place to be....

Airport and Group Transportation

Class B CDL Driver with passenger endorsement P2. Start today Job Job at Airport and Group Transportation

Class B CDL Driver with passenger endorsement P2. Start today JobWe manage large corporate groups that come to Colorado for meetings. We also handle weddings and school functions.Most trips are to and from airport. Need help immediately.We are looking for professional...

Guetterman Financial Group, LLC

Looking for Licensed life insurance Agents - Remote position Job at Guetterman Financial Group, LLC

...Are you an agent who has yet to master virtual sales? Or perhaps a great sales professional... ...The Wells Agency offers agents a turnkey insurance sales method. Why Work with The Wells... .... We specialize in UL's, Term, Whole Life, Annuities with a heavy emphasis on using...

Celltrio

Software Engineer Job at Celltrio

...time-to-market and costs while improving precision and quality. Celltrio's range of stand-alone manual cell storage, culturing, and harvesting machines can easily be automated for more efficient lab operations. Role Description This is a full-time on-site...

Spectrum Brands

Senior Developer, EDI Job at Spectrum Brands

Job TitleSenior Developer, EDI Job #US19636 Requisition TypeRegular Function State/ProvinceUnited States Remote CityMiddleton, WI Region US Posting Start DateJun-11-2025 Division InformationSpectrum Brands global...

LLM Algorithm Engineer/Senior Engineer/Principle Engineer Job at Midea Group, San Jose, CA

THRGanRxUmRBR2oyeDEwRHNWM3d1dFdOTEE9PQ==