DevOps/SRE Engineer at Playsdev
November 2022 - Present
An outsourcing company specializing in providing DevOps services.
Non-project Activities:
Conducted screening interviews for internship positions, conducted internal interviews, candidate selection, mentoring and assisting in training interns.
Project 0. SRE Engineer, Insurance Company.
Performed the role of CI/CD engineer, responsible for application deployment, CI/CD pipeline maintenance, monitoring and logging infrastructure. Identified system bottlenecks and performance issues, optimized Java applications (JVM tuning, memory optimization), and provided technical support to development teams. Handled incident tickets and conducted root cause analysis for issues from both system and business perspectives. Maintained high availability and reliability of production systems.
Project 1. DevOps Engineer, Mobile Travel Application.
Migrated test infrastructure from AWS to on-premises VMs, set up dev, test, and production environments with zero downtime.
Deployed and managed Kubernetes cluster on VMs, configured access via an external load balancer with high availability.
Optimized Dockerfiles for Python microservices (reduced image size by ~30%), migrated from Docker Compose to Kubernetes using Helm charts.
Set up comprehensive monitoring stack (Prometheus + VictoriaMetrics + Grafana), logging (Loki + Promtail + ELK), and alerting via Telegram.
Deployed self-hosted GitLab in Kubernetes and automated CI/CD pipelines in GitLab, reducing deployment time significantly through library and dependency caching.
Deployed and administered MinIO, Harbor, PostgreSQL, Redis, Kafka, Cassandra, Debezium CDC, Jaeger, Sentry, and EFK stack.
Wrote comprehensive infrastructure documentation and runbooks for team knowledge sharing.
Project 2. DevOps Engineer, Fintech.
Developed and optimized Dockerfiles for Java and Node.js applications, implementing multi-stage builds and best practices.
Created and maintained CI/CD pipelines in Jenkins for building, testing, and deploying applications to Nexus and OpenShift.
Managed repositories in Bitbucket, Nexus, Confluence, SonarQube, and HashiCorp Vault for secrets management.
Configured, deployed, and debugged microservices in OpenShift, ensuring scalability and reliability.
Set up Istio service mesh for internal traffic routing, migrated to newer service mesh versions with zero downtime.
Configured sidecar containers for monitoring (Prometheus exporters) and logging (Fluent Bit).
Conducted load testing with Apache JMeter, visualized results in Grafana, and provided performance recommendations.
Wrote internal documentation and conducted team training sessions on DevOps practices and tools.
Project 3. DevOps Engineer, Infrastructure Migration to AWS.
Wrote Ansible playbooks for deploying and configuring servers and services.
Migrated infrastructure from the customer's facilities to AWS, wrote Terraform code to set up environments and test benches.
Set up and managed a managed Kubernetes cluster (EKS).
Created and configured Helm charts for deploying applications to Kubernetes.
Set up Prometheus and Grafana to monitor nodes and microservice loads and alerts.
Deployed and configured EFK stack to collect and store logs.
Set up and configured self-hosted GitLab.
Migrated code from GitHub to GitLab.
Migrated and refined CI/CD pipelines from the customer's Jenkins to self-hosted GitLab.
Conducted load testing and optimized the performance of applications running in a Kubernetes cluster.
Initiated the transition to GitOps using ArgoCD.
Configured WireGuard VPN for developer access to test environments.
Provided ongoing support to the development team.
Project 4. DevOps Engineer, Group of Commercial Websites.
Migrated AWS-based websites to Azure, ensuring minimal downtime during migration.
Developed CI/CD pipelines in Azure DevOps for building and deploying WordPress containers with automated testing.
Used Terraform to provision and manage cloud infrastructure (CDN, storage, databases, container registry) as code.
Configured Azure Monitor for performance tracking and alerting, reducing incident response time.
Managed DNS, Cloudflare security settings (WAF rules, DDoS protection), and optimized PHP servers for performance and DDoS protection.
Conducted performance testing and optimized cloud costs, achieving ~30% cost reduction.
Provided ongoing developer support and troubleshooting, maintaining 99.9% uptime SLA.