Position:- Linux Monitoring Engineer
Location : Pleasonton, CA
Duration : 12 mths+
Monitoring Engineer
Responsible for maintaining and improving a high-volume monitoring and metrics infrastructure based
on Nagios for monitoring, and collectd, collection3 and Graphite for metrics. Typical tasks involve
maintaining and adding to Nagios configurations, creating custom monitoring scripts and building
out bleeding-edge open-source tools such as mod_gearman and Thruk. Work also involves system
performance analysis and communicating regularly with project and support teams to develop new
monitors and keep the monitoring system in line with code updates and site developments.
Required skills:
• Strong unix scripting. Bash and Perl are must-haves, any skill with Ruby is a big plus.
• Strong linux system administration (Redhat). Solid ability to troubleshoot a variety of performance
issues and other problems, from slow disk response to running out of file descriptors.
• Strong understanding of init scripts, creating new filesystems, /proc and other sysadmin essentials.
• Experience with monitoring. Nagios experience is a huge plus, but experience with another major
monitoring system will suffice.
• Must know the in and outs of creating monitors, setting thresholds and sending notifications.
• Experience with any metrics application, such as RRDTool, Cacti or MRTG.
• Chef config experience with Nagios will be plus.
• Able to troubleshoot network problems from the system perspective, such as identifying dropped
packets or high latency that traces back to a particular network device. Ability to capture and
analyze network traffic across a particular interface is a big plus.
• Familiarity with SNMP. Understanding of how to use snmpget and snmpwalk and work with the output.
• Ability to work closely with development and support groups to define new monitoring needs
• Strong unix scripting. Bash and Perl are must-haves, any skill with Ruby is a big plus.
• Hands on experience with Chef. Building Nagios configs using Chef is a plus.
Good-to-have skills:
• Ability to create a custom rpm (big plus)
• Experience with RRDs.
• Apache administration experience. Able to define a new virtual host and handle other basic
configuration updates.
• Experience with subversion. Able to check in/out, check logs, perform merges, etc.
• Familiarity with SQL. Able to run remote queries against a database and incorporate such queries into a
script.
• An understanding of yum repositories.
You received this message because you are subscribed to the Google Groups "CVMSCRM" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cvmscrm+unsubscribe@googlegroups.com.
To post to this group, send email to cvmscrm@googlegroups.com.
Visit this group at http://groups.google.com/group/cvmscrm.
For more options, visit https://groups.google.com/d/optout.
No comments:
Post a Comment