Cloud Operations Specialist

HTC Global Services


Date: 4 hours ago
City: Dearborn, MI
Contract type: Full time
Job Description:

Make a difference

HTC Global Services wants you. Come build new things with us and advance your career. At HTC Global you'll collaborate with experts. You'll join successful teams contributing to our clients' success. You'll work side by side with our clients and have long-term opportunities to advance your career with the latest emerging technologies.

At HTC Global Services our consultants have access to a comprehensive benefits package. Benefits can include Paid-Time-Off, Paid Holidays, 401K matching, Life and Accidental Death Insurance, Short & Long Term Disability Insurance, and a variety of other perks.

Job Description:

Our team, focused on Cloud Data Messaging Services that provide platform services to modern scalable, decoupled, and resilient cloud-native applications. Currently, Cloud Data Messages Services leverage technologies like GCP PubSub and Confluent Kafka Cloud. This position Cloud Data Messaging Services Specialist for GCP PubSub and Confluent Kafka Cloud requires abreast of the continual evolution of cloud data messaging technologies and understanding how GCP messaging services like Pub/Sub, alongside Kafka, integrate with other native services like Cloud Run, Dataflow, etc., within the new Standard app hosting environment to meet customer needs.

  • Develop, Improve and support Infrastructure as Code (IaC) practices.
  • Providing highly scalable and available infrastructures.
  • Implement and enhance SRE practices.
  • Develop, integrate and enhance automations for the services and its management.
  • Enabling real-time data processing and event-driven architectures.
  • Collaborate with many application DevOps teams, as well as product vendors.
  • Develop automated processes to simplify the adoption and improve experiences for application development teams.
  • Identify opportunities for adopting new data streaming technologies and patterns to solve existing needs and anticipate future challenges.
  • Create and maintain Terraform modules and documentation for provisioning and managing Pub/Sub topics/subscriptions, Kafka clusters, and related networking configurations, often with a paired partner.
  • Improve continuous integration tooling by automating manual processes within the delivery pipeline for messaging applications and enhancing quality gates based on past learnings. 11. Monitor application logs, metrics, and alerts to proactively identify and resolve issues in cloud resources and infrastructure, ensuring high uptime and optimal performance.
  • Ensure cloud systems are configured correctly, run efficiently and remain secure against potential threats. Ensure the availability and reliability of cloud services on public and private cloud platforms through proactive monitoring and incident response.
  • Own and drive end to end technical resolution of critical incidents which might need involvement from multiple parties and ensures the right collaboration and communication.
  • Implement disaster recovery and backup strategies to protect critical data and configurations and perform periodic full-scale tests to verify plans and make improvements.
  • Ensure compliance with industry regulations and maintain clear and up-to-date documentation for cloud infrastructure and procedures.
  • Recommend solutions to improve availability, performance, incident resolution, observability and supportability.

Skills Required:

  • GCP Cloud Run, Automation, Authentication, GCP, Dynatrace, Tekton, Java, Active Directory, Python, Apache Kafka

Experience Required:

Specialist

  • 5 years of experience in IT Operations, and with Infrastructure Automation / DevOps.
  • 2 years of experience utilizing cloud Data Messaging services (GCP PubSub, Confluent Kafka).
  • 2 years of experience in any Public Clouds or On-Prem Cloud Services Automation and launching Infrastructure or application services.
  • 2 years of experience in Terraform IaC.
  • 2 years of experience in Python or PowerShell or Bash Shell Scripting
  • Experience in CI/CD pipelines using Jenkins, Cloud Build or RedHat Tekton
  • Strong understanding of Identity and Access Management
  • Experience in authentication and authorization services like Oauth2, AD, LDAP, ADFS, SSL.
  • Familiarity with networking (NAT, firewalls, basic routing, load balancing, etc.)
  • SRE principles, monitoring tools and integration (Grafana, Dynatrace)
  • Open to being on call for weekends and after hours
  • Must be highly organized and detail-oriented
  • Successful at working in a team culture
  • Excellent verbal and written communication skills
  • Strong problem solving and Analytical/Reasoning skills

Experience Preferred:

Additional Skills Preferred:

  • Experience with code-level troubleshooting with Java, Sprint, NodeJS, or similar language a plus
  • Experience with Chatbot development
  • Experience with Agentic AI, ADK, MCP

Education Required:

  • Bachelor's Degree

Our success as a company is built on practicing inclusion and embracing diversity. HTC Global Services is committed to providing a work environment free from discrimination and harassment, where all employees are treated with respect and dignity. Together we work to create and maintain an environment where everyone feels valued, included, and respected. At HTC Global Services, our differences are embraced and celebrated. HTC is an Equal Opportunity Employer. We respect and seek to empower each individual and support the diverse cultures, perspectives, skills, and experiences within our workforce. HTC is proud to be recognized as a National Minority Supplier.

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.

Post a resume