High Performance Computing Systems Engineer

📁
Information Technology
💼
NE-NERSC
📅
82480 Requisition #

Berkeley Lab is Bringing Science Solutions to the World, and YOU can be a part of it!

In the world of science, Lawrence Berkeley National Laboratory (LBNL) is synonymous with "excellence." That's why we hire the best - whether in research, science or operations. This is a great opportunity to bring your top-notch skills to bear in support of world-class scientific research that addresses national and global challenges!


Position Summary:

The National Energy Research Scientific Computing Center (NERSC) at Berkeley Lab is looking for exceptional and motivated individuals to join our Computational Systems Group.

NERSC provides world-class supercomputing, high performance, scalable data systems, and services to 6,000+ users worldwide who conduct ground-breaking research.  We support the U.S. Department of Energy Office of Science mission to deliver scientific discoveries that transform our understanding of nature, advance the energy, economic, and national security of the United States. We seek out individuals to be part of a team that helps architect, deploy, configure, and maintain scalable and distributed systems of high complexity. As a member of our team you will contribute to the development of innovative solutions for supporting science at scale on the next generation of supercomputers.

You will also be called upon to diagnose and resolve difficult problems in the configuration, tuning, and management of very large distributed and tightly coupled systems. You will be working in a challenging environment, defined by scale and complexity, in a dynamic field being revolutionized by low-power, many-core advanced computer architectures. Using your unique set of knowledge, skills and experience, you will help research, evaluate, and develop new technologies to support NERSC’s mission of accelerating scientific discovery through high performance computing and data analysis into the next decade.  Furthermore, your contributions will enable NERSC to architect, procure, deploy and operate some of the largest computing and storage systems in the world.  


NERSC is distinguished by its success in creating an environment that makes these resources effective for scientific research. Because of our team, NERSC systems are reliable and secure, and provide a state­ of­ the ­art scientific development environment with the tools needed by the diverse community of NERSC users. We offer scientists technology and services that empower them to be more effective researchers.

This position will be filled at a level 3 or 4, dependent on experience.


As a High Performance Computing Systems Engineer (Level 3), you will:

  • Lead (or collaborate on) systems programming projects to maintain and enhance system functionality, in areas such as large systems monitoring, systems and workload management and file systems and I/O subsystems.

  • Develop and use tools to manage task automation on the computational systems.

  • Work independently and as part of the Systems Group to diagnose and fix system problems, help analyze system issues and develop and implement workarounds and/or patches for software bugs.

  • Install, test, maintain and manage the NERSC computational systems.

  • Assist with technology evaluation of systems and system architecture to provide input for HPC system procurements and DOE technology roadmaps out past the next decade.

  • Work with vendors to prioritize, develop and enhance their technologies in order to better meet the needs of our users.

  • Be part of a team that provides 24x7 systems support.

In addition to the above, the High Performance Computing Systems Engineer (Level 4) will:

  • Provide leadership and technical guidance to group members, members of other groups at NERSC.

  • Recommend and lead implementation and deployment efforts for system improvements that enhance reliability, stability, usability, performance and security.

  • Identify and evaluate emerging HPC technologies and explore new features that would create new capabilities and enhance system performance and usability.

  • Participate in working/user/advocacy groups and represent NERSC and its interests to the broader HPC community.


High Performance Computing Systems Engineer (Level 3) - Required Qualifications:

  • Bachelor of Science degree in Computer Science, Computational Science or equivalent experience and/or a minimum of 8 years of UNIX or Linux experience.

  • Minimum of 4 years of experience with systems programming or management of large­-scale UNIX based systems in a high-­performance computing (HPC) or warehouse-­scale computing (WSC) environment.

  • Strong technical and collaboration skills needed to create and deploy innovative ways of allowing our diverse user base to effectively utilize the unique resources that NERSC provides.

  • Experience with C and shell/PERL/Python systems programming as well as with processor, interconnect, and storage technologies for High Performance Computing systems.

  • Familiarity with UNIX/Linux internals.

  • Experience with installation, configuration, monitoring, and tuning of workload management systems such as SLURM or Torque.

  • Demonstrated ability to work independently as well as  collaboratively in large projects, and contribute to an active intellectual environment.

  • Excellent oral and written communication skills.

  • Understand user needs and show initiative, tact and good judgment in developing solutions to problems.

In addition to the above the High Performance Computing Systems Engineer (Level 4) will have:

  • Minimum of 12 years UNIX or Linux experience with a minimum of 6 years experience with the management of large-­scale UNIX based systems in a HPC or WSC environment.

  • Demonstrated excellent systems programming skills and strong knowledge of UNIX/Linux internals.

  • Demonstrated ability to successfully lead complex projects.


The posting shall remain open until the position is filled.

NOTES:  This is a career appointment. This position requires completion of a background check.


Classification will depend upon the applicant's level of skills, knowledge, and abilities.

Salary is commensurate with experience. There are 2 openings for this position.


Berkeley Lab addresses the world’s most urgent scientific challenges by advancing sustainable energy, protecting human health, creating new materials, and revealing the origin and fate of the universe. Founded in 1931, Berkeley Lab’s scientific expertise has been recognized with 13 Nobel prizes. The University of California manages Berkeley Lab for the U.S. Department of Energy’s Office of Science.


Equal Employment Opportunity: Berkeley Lab is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, or protected veteran status. Berkeley Lab is in compliance with the Pay Transparency Nondiscrimination Provision under 41 CFR 60-1.4.  Click here to view the poster and supplement: "Equal Employment Opportunity is the Law."

Previous Job Searches

My Profile

Create and manage profiles for future opportunities.

Go to Profile

My Submissions

Track your opportunities.

My Submissions

Similar Listings

HR-Human Resources

Bay Area, California

📁 Information Technology

Requisition #: 91561

EG-Engineering

Bay Area, California

📁 Information Technology

Requisition #: 82271

EB-Environ Genom & Sys Bio Div

Bay Area, California

📁 Information Technology

Requisition #: 83384

Equal Employment Opportunity: Berkeley Lab is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, or protected veteran status. Berkeley Lab is in compliance with the Pay Transparency Nondiscrimination Provision under 41 CFR 60-1.4. Click here to view the poster and supplement: "Equal Employment Opportunity is the Law."

 

Privacy & Security Notice | LBNL is an E-verify Employer | Contact Us


The Lawrence Berkeley National Laboratory provides accommodation to otherwise qualified internal and external applicants who are disabled or become disabled and need assistance with the application process. Internal and external applicants that need such assistance may contact the Lawrence Berkeley National Laboratory to request accommodation by telephone at 510-486-7635, by email to accommodation@lbl.gov or by U.S. mail at EEO/AA Office, One Cyclotron Road, MS90R-2121, Berkeley, CA 94720. These methods of contact have been put in place ONLY to be used by those internal and external applicants requesting accommodation.