Software Engineer Large Model System Graduate (Machine Learning Sys-US) - 2024 Start (BS/MS)
Thegradcafe•Jackson, MS
8 days ago
Apply On:
Qualifications
- Successful candidates must be able to commit to a start date before the end of 2024
- Please state your availability and graduation date clearly in your resume
- Minimum Requirements: Graduate with a background in Computer Science, related technical field or equivalent industrial experience
- Master distributed, parallel computing principles; know the recent advances in computing, storage, networking, and hardware technologies
- Familiar with the state-of-the-art machine learning algorithms and mainstream platforms (e.g., Tensorflow, Pytorch, MxNet)
- Master at least one or two programming languages in Linux environment such as C/C++, Go, Python, etc
- Familiar with NLP, CV-related algorithms, and technologies, and experienced in large model training
- Experience in GPU based high-performance computing
- Demonstrated a related technical experience from previous internship, work experience, coding competitions, or publications
- Curiosity towards new technologies and entrepreneurship
- High levels of creativity and quick problem-solving capabilities
Benefits
- As a graduate, you will get unparalleled opportunities for you to kickstart your career, pursue bold ideas, and explore limitless growth opportunities
Responsibilities
- The team is also responsible for the research and development of hardware acceleration technologies for cloud computing, via technologies such as distributed systems, compilers, HPC, and RDMA networking
- Responsibilities: Responsible for the machine learning system development of the company's large-scale models, researching new applications and solutions of related technologies in areas such as search, recommendation, advertising, content creation, conversation, and customer service, meeting the growing demand for intelligent interaction from users, and comprehensively improving users' lifestyles and communication methods in the future world
- Responsible for the design and development of the architecture of large-scale machine learning systems, solving technical difficulties such as high concurrency, high reliability, and high scalability of the system
- Covering various sub-directions of machine learning system, including resource scheduling, model training, model inference, data management, and workflow
- Iterate and develop the system using customer-driven scenarios
Related Internships
Software Engineer Intern
Powin Corporation•Jackson, MS
Powin Corporation
Jackson, MS
7 days ago7 days ago
Software Engineer Intern - Global Industries
Oracle•Jackson, MS
Oracle
Jackson, MS
Software Engineer Intern
Powin Corporation•Jackson, MS
Powin Corporation
Jackson, MS
7 days ago7 days ago
2025 Software Engineer Intern - Ocean Springs MS
Northrop Grumman•Ocean Springs, MS
Northrop Grumman
Ocean Springs, MS
4 days ago4 days ago