Enterprise Systems Management Engineer
Enterprise Systems Management (ESM) Engineer Works as a member of the team responsible for the management and maintenance of the enterprise monitoring tools and processes for a large scale, high-availability, 24x7 production environment. Works closely with Network Operations Center (NOC), operational support, and development teams to implement, and maintain active monitoring and performance metrics collection across the enterprise. Primary roles will be to administer, develop, test, document, and deploy monitoring solutions throughout the enterprise. RESPONSIBILITIES: Configure, manage, and maintain Enterprise Systems Management tools including Remedy ARS, Micromuse Netcool, Concord eHealth, HP OpenView Network Node Manager, and Mercury SiteScope Work with operational support teams (systems engineering, network engineering, DBA?s, application support) and application development teams to identify and document monitoring and measurement requirements. Work with support and development teams to design and implement monitoring and measurement solutions. Work with NOC to identify and document monitoring requirements and processes. Design and implement solutions to support NOC requirements. Deploy integrations between Remedy, Netcool, HP Network Node Manager, and Crystal Reports to facilitate better event handling, operational process flow, and reporting. Collect and analyze monitoring and performance metrics. Create and generate reports from monitoring and metric data for use in managing and improving operations.Develop, test and document operations procedures including installation, maintenance, restart / recovery, monitoring and troubleshooting. Perform ongoing revision and testing of established procedures. Investigate, analyze and resolve technical issues and actively pursue mechanisms for preventing, or automating the response to, reoccurrences. Follow a structured methodology for implementing system changes, configuration modifications, active monitoring, and collection of performance metrics. Communicate standards, methodologies, and processes to NOC, development teams, and operational support teams.
QUALIFICATIONS: Bachelor's degree in related field or related equivalent experience. 1+ years UNIX operations/administration experience ideally in a Sun Solaris and/or Linux environment. Experience with multiple core technologies, including: Oracle RDBMS, IP networking, and Internet technologies. Experience with monitoring tools, automation of tasks, and root cause problem resolution required. Significant shell/Perl script development experience required. Experience with one or more of the following ESM tools: Micromuse Netcool (Omnibus, Impact, SSM, WebTop, ISM), or comparable tool HP OpenView Network Node Manager / IBM NetView or comparable tool Remedy ARS (Problem, Change, Asset Management, Approval Server). Concord eHealth (Network, System, SysEDGE Agents) Mercury SiteScope / TopazExperience with the following tools a plus: Bugzilla CiscoWorks 2000 Foundry IronView Crystal Reports TelAlertMust have excellent written, verbal and presentation skills. Excellent analytical and troubleshooting skills, flexibility, ability to plan and organize, responsiveness, creativity, self-starter. Ability to produce good, accurate, intelligible technical and procedural documentation.Strong desire to learn and work with multiple applications, tools and technologies. Willingness and desire to learn new systems and applications quickly. The following experience is a plus: Experience with systems, network, firewall, and security monitoring and metrics collection.Experience with Oracle database operations/administration. Experience in Java and JSP software development. Experience installing, configuring and operating Apache web servers, as well as developing and deploying applications under Weblogic, JBOSS, and/or Jakarta-Tomcat web / application servers.
|