IBBR’s IT environment supports a complex array of services to support our research mission. Most critically, we support a medium sized high performance computing cluster that also serves as a virtual Linux desktop to be used in the visualization of biomolecules. We have two data centers within our campus in Rockville MD which provide highly available and secure hosting of over 1 PB of research data, websites and databases. We are also responsible for data center networking, wireless, IP telephony, Active Directory, many file servers. This position will be responsible for the maintenance of all infrastructure systems and will help to architect our future growth alongside the Director of Information Technology to whom this position reports. Responsibilities will include a wide range of projects including day to day management of the HPC cluster, planning projects for its future growth and improvement as needed, managing the virtual server infrastructure, the block and file storage arrays, and support for many of the virtual machines provided to users and IT staff. This position will work alongside the desktop support team and the software development team in the management of IBBR’s IT department. This position will be heavily involved in planning, developing project plans, and documentation. As an exempt position, the Systems Administrator may be required to work off-hours or weekends on occasion as needed during roll-outs and repairs.
Bachelor’s degree in computer science, engineering, information technology/ related field OR equivalent combination of education and relevant work experience. Demonstrated experience with Linux, command line operations and either BASH or another scripting language. Some experience with Windows Server maintenance. Solid understanding of TCP/IP networking. The demonstrated ability to research new technical projects, gather requirements, and work with a high degree of independence is required. Must be able to communicate complex topics to non-technical audiences. Must be able plan and schedule own work, set and meet deadlines. Must be able to constantly learn and gather new skills. A very high ability to ‘engineer’ solutions by asking questions and determining the root needs of a project while evaluating best practices and long term support for the deployed project. The ability to develop projects that fit into the existing portfolio while meeting customer needs is required. Demonstrated experience documenting work and creating documentation for users.
- Experience with any of the following: ZFS, BeegFS, DRBD, Apache/Nginx, Slurm, RedHat Linux, Microsoft Server, Nagios, Puppet;
- Experience with HPC clusters, especially SLURM;
- Experience with Cisco products such as Nexus switches, CallManager, Unity, WLC wireless
Predominantly operates in an office environment. Some periods of time may be spent in computer room and communications equipment spaces for installation, testing, and troubleshooting. Some work (less than 5%) requires moving and lifting of heavy networking equipment.
BEST CONSIDERATION DATE
HOW TO APPLY
Please visit https://ejobs.umd.edu/postings/81162