
Senior Platform Operation Manager - VP
- Glasgow
- Permanent
- Full-time
- The Snowflake/ Postgres Customer Engagement Team (CET) is part of the Enterprise Computing Data Services Organization in Morgan Stanley. It is part of the Data & Analytics Technology (DAT) fleet, responsible for managing mission critical distributed database platforms like Snowflake, Postgres and Greenplum on public-cloud and on-prem
- The successful candidate will be also be designated incident and escalation manager for the global production Data and Analytics infrastructure during EMEA time zone.
- The person will also lead run-the-bank type of projects such as data center migration , plantwide version upgrade , release management , plant automation, database design and architecture, performance monitoring and optimization.
- In addition, the person would also participate at least one squad as SRE, following Agile practice and contributing to the infra modernization and automation.
- 10+ years of overall enterprise level IT experience.
- Strong domain expertise related to distributed database platforms both on-prem/cloud like Snowflake /Postgres or Greenplum.
- Strong shell scripting and python programming skills for SRE related work.
- Advanced Linux / Unix skills
- Experience on using Splunk OR Grafana/Prometheus/Loki stack
- General understanding of Project Management , Database design and architecture , Data Integrity and security , Disaster recovery and backup.
- Knowledge on Agile methodologies
- Effective oral and written communication skills, and interpersonal skills to work well in a team environment required.
- Strong organizational and coordination skills with the ability to manage multiple tasks and high-pressure situations for outage handling, management, or resolution.
- Strong Incident Management Skills with proper understanding of ITIL procedures.
- Be available for weekend work.
- Deploy Optimize and manage enterprise scale distributed database platforms like Greenplum , Snowflake/Gen AI and Postgres.
- Respond to incidents, troubleshoot issues, and conduct root cause analysis.
- Design, implement, and maintain disaster recovery and high-availability solutions.
- Automate plant wide operational tasks related to provisioning, monitoring, backups, scaling, and recovery.
- Monitor system health, identify performance bottlenecks, and implement optimizations.
- Collaborate with development teams to support schema design, query optimization, and database best practices.
- Ensure data security, compliance, and access controls are enforced.
- Participate in on-call rotations and incident response.
- Experience with database deployment, upgrades, backup/restore, and schema management in production environments
- Proficiency in database monitoring, performance tuning, and troubleshooting
- Familiarity with distributed/OLTP/OLAP database environments deployed on-prem/cloud like Greenplum / Postgres and Snowflake.
- Familiarity with cloud platforms (AWS, Azure) and cloud-native databases
- Infrastructure as Code (IaC) tools (e.g., Terraform, Ansible, CloudFormation)
- Automation and configuration management using scripting languages (Python, Bash, etc.)
- Setting up and using monitoring, logging, and alerting tools (Prometheus, Grafana, ELK/EFK, Datadog, etc.)
- Understanding of Service-Level Indicators/Objectives/Agreements (SLI/SLO/SLA)
- Designing and implementing HA/DR solutions (failover, automated recovery, geo-replication)
- Running and reviewing disaster recovery drills
- Incident response and on-call support for database outages or performance issues
- Root cause analysis and post-mortem writing
- Capacity planning and scaling distributed systems
- Change management and production rollout best practices
- Experience with container orchestration (Kubernetes, Docker) for database workloads
- Familiarity with CI/CD pipelines and database migration automation
- Knowledge of regulatory compliance (GDPR, HIPAA) as it pertains to data storage and handling
- Strategic thinking and problem-solving.
- Familiarity with modern data architectures and cloud services.
- Strong organizational and documentation skills
If this role is deemed a Certified role and may require the role holder to hold mandatory regulatory qualifications or the minimum qualifications to meet internal company benchmarks.Flexible work statement
Interested in flexible working opportunities? Morgan Stanley empowers employees to have greater freedom of choice through flexible working arrangements. Speak to our recruitment team to find out more.Morgan Stanley is an equal opportunities employer. We work to provide a supportive and inclusive environment where all individuals can maximize their full potential. Our skilled and creative workforce is comprised of individuals drawn from a broad cross section of the global communities in which we operate and who reflect a variety of backgrounds, talents, perspectives, and experiences. Our strong commitment to a culture of inclusion is evident through our constant focus on recruiting, developing, and advancing individuals based on their skills and talents.