Overview:
We are seeking an experienced IT Operations and Migration Specialist to provide expertise and guidance on various aspects of infrastructure migration and ongoing operations support. This role is critical for ensuring smooth data handling, effective monitoring, managing incidents during critical phases, and coordinating cut-over activities.
Responsibilities:
- Provide expertise and guidance on importing data into the Configuration Management Database (CMDB).
- Lead the management of incidents during the hypercare period, ensuring timely resolution and minimal disruption.
- Ensure servers eligible for patching are properly onboarded and maintained.
- Continuously review the migration process, prioritize changes, and implement improvements.
- Manage the Requirements Traceability Matrix (RTM) for migration, ensuring all prerequisites and server build checks are completed.
- Monitor migration status and provide operational support as needed.
- Ensure the move event runs according to schedule during the cut-over weekend.
- Oversee the onboarding of infrastructure monitoring to New Relic.
- Ensure application monitoring is onboarded to New Relic.
- Validate agents to be used during migration events and create call bridges, as necessary.
- Validate IT operation teams with schedules and pass sheets created.
- Transition infrastructure support to IT operations.
- Plan user access for the future state of the infrastructure.
- Transition non-standard infrastructure to standard support functions.
Requirements:
- Bachelor's degree in Information Technology, Computer Science, or related field.
- Proven experience in IT operations and migration projects.
- Strong knowledge of infrastructure and application monitoring tools (e.g., New Relic).
- Excellent problem-solving and incident management skills.
- Ability to work collaboratively with cross-functional teams.
- Strong communication and organizational skills.
Preferred Skills:
- Experience with CMDB data import and server patching processes.
- Familiarity with hypercare incident management and major incident resolution.
- Knowledge of Requirements Traceability Matrix (RTM) management.
- Experience in validating infrastructure agents and creating call bridges.