Vladyslav Vinnyk

Senior Data Platform Engineer | Software Development Lead
Ukraine, UA.

About

Highly accomplished Senior Data Platform Engineer with over 10 years of experience designing, developing, and maintaining large-scale distributed data platforms and web applications. Proven expertise in Java and Scala, driving significant infrastructure cost reductions exceeding $1.5M and achieving 20x-100x improvements in data processing, throughput, and migration efficiency. Adept at leading global data lifecycle strategies, optimizing performance, and building self-service solutions across multi-cloud environments.

Work

Lyft
|

Software Engineer - Data Platform

Remote, Global, Global

Summary

Solely engineered and managed Lyft's global Data Lifecycle Strategy, optimizing data retention, maintenance, and compaction across 300K tables and 900M partitions.

Highlights

Designed and implemented the Lakehousekeeper Platform for Databricks, managing TTL Retention and External Delta Tables Maintenance for all Lyft Offline Tables, processing 300K tables and 900M partitions, and removing 10PB of data.

Identified and resolved an Iceberg Maintenance DAG optimization, resulting in a $350,000 annual infrastructure cost reduction.

Engineered a custom cold data detection engine, delivering immediate $500,000 one-time storage savings and continuous annual reductions of at least $500,000.

Resolved a critical scalability bottleneck in a Legacy Offline Table Retention framework, increasing throughput by 20x–30x, ensuring a strict 10-hour SLA, and generating $415,000 in annual storage savings.

Led HMS to Glue data catalog migration efforts, implementing a user action ingestion pipeline for 5x faster table user detection and resolving metadata bottlenecks with 86x write and 100x read performance improvements.

Developed an end-to-end testing framework for table retention using Docker, reducing the feature-to-production feedback loop by 90% (from 15 days to under 2 days) and de-risking legacy system improvements.

Designed and deployed table-level cost attribution by scanning S3 objects with heuristics, providing precise compute and storage cost visibility across the entire Data Platform to aid user cost definition.

Playtika
|

Software Engineer - Data Platform

Remote, Global, Global

Summary

Contributed to the development of a self-service Data Platform, streamlining complex pipeline creation and management for numerous tenants and studios.

Highlights

Conducted in-depth research on cutting-edge technologies, performing proof-of-concepts, documenting findings, and delivering presentations to inform strategic platform enhancements.

Designed and owned various critical system components, including orchestration, monitoring, table maintenance (compaction, snapshot cleaning), and data validations, leading design enhancement meetings.

Addressed intricate technical challenges such as memory leaks and performance optimizations, ensuring the stability and efficiency of the Data Platform.

Collaborated cross-functionally with various departments to gather requirements, collect feedback, and deliver tailored solutions and system optimizations.

Provided consistent Production On-Call support for one week per month, ensuring high availability and rapid resolution of critical data platform issues.

Epam at Sephora
|

Big Data Engineer (Scala)

Remote, Global, Global

Summary

Led the design and implementation of Sephora's marketing data segment, facilitating seamless data consumption and efficient landing to GCP via Azure.

Highlights

Developed robust marketing data pipelines for major platforms including Google, Facebook, and Rakuten, ensuring reliable data flow.

Created core, reusable components in Scala that were widely adopted by other teams, significantly enhancing development efficiency across the platform.

Conducted performance tuning for Spark jobs, optimizing data processing efficiency and reducing execution times.

Achieved significant work process acceleration, surpassing 10x improvements, by implementing asynchronous calls within data pipelines.

Successfully executed and deployed various data jobs, enhancing overall platform functionality and data availability.

Grid Dynamics at Kohls
|

Big Data Engineer (Scala, Python)

Remote, Global, Global

Summary

Delivered unified Data Platform solutions for Kohls, encompassing utility integration, job monitoring, cloud monitoring, and pipeline management.

Highlights

Developed real-time and near-real-time pipelines using Flink, Spark, and AWS, ensuring high-speed data ingestion and processing.

Designed and maintained batch ETL pipelines and data lakes on GCP using Airflow, Scala, and Python, ensuring robust data infrastructure.

Conducted performance tuning for Flink and Spark, optimizing data processing efficiency and resource utilization.

Created core, reusable components in Scala that were seamlessly adopted by other teams, fostering consistency and accelerating development.

Implemented efficient Cluster/Jobs monitoring Dashboards, significantly reducing dependency on Stackdriver and cutting cloud usage costs.

Grid Dynamics at Apple
|

Big Data Engineer (Scala)

Remote, Global, Global

Summary

Contributed to Apple Media Products' Core Data Engineering Team, handling global data ingestion, GDPR compliance, and framework development for advanced analytics.

Highlights

Developed intricate ETL pipelines and data lakes using Scala, Spark, Kafka, Hadoop, and Amazon S3 for global data ingestion from various devices.

Created reusable tools using Java, Scala, and Bash scripts, significantly enhancing team efficiency and productivity.

Developed and implemented GDPR-related jobs, ensuring full data compliance with stringent regulatory requirements for Apple Media Products data.

Successfully designed and delivered common data frameworks to downstream teams, actively guiding their effective utilization for advanced analytics.

EPAM Systems at Lufthansa
|

Java Software Engineer

Remote, Global, Global

Summary

Spearheaded the development of a micro-service at Lufthansa, efficiently managing airline user data to facilitate smooth access for various services.

Highlights

Led the full-stack development of a critical micro-service, improving user data management and access for various airline services.

Utilized Java 8, Spring Core, Spring AOP, and Spring Security to build robust and secure backend functionalities.

Implemented caching mechanisms with Memcached to optimize data retrieval and enhance system performance.

Collaborated on GitLab/Bitbucket for version control and Jira for project management, ensuring efficient development workflows.

EPAM Systems at OSM
|

Junior Software Engineer

Remote, Global, Global

Summary

Developed a workplace distribution and employee allocation application from the ground up for EPAM, now widely adopted in production.

Highlights

Led the end-to-end implementation of an application for tracking workplace distribution, managing employee allocations, and handling location management.

Developed the application using Java 8, Spring Core, Spring MVC, Spring JDBC, Spring Data, and Spring Security, ensuring a robust and secure architecture.

Implemented CI/CD pipelines and utilized Docker for streamlined deployment and improved development efficiency.

Successfully launched the application into production, achieving widespread adoption across all EPAM offices.

Education

Kharkiv National University of Radioelectronics
Kharkiv, Kharkiv Oblast, Ukraine

Bachelor

Computer Science

Kharkiv Medical University
Kharkiv, Kharkiv Oblast, Ukraine

Master

Medicine

Skills

Programming Languages

Java 8, Scala, Python, Bash, Groovy.

Big Data Technologies

Spark, Flink, Hadoop, Azkaban, Airflow, Databricks, Iceberg, Delta Tables, Alluxio, Spark SQL, Spark Structured Streaming, Spark Batch.

Cloud Platforms

GCP, Azure, AWS, Apple-Private Cloud, On-Prem k8s, BigQuery, Dataproc, Google Storage, Amazon S3, EMR, EC2, MSK.

Databases & Data Warehousing

MySQL, PostgreSQL, Bigquery, Delta tables, Iceberg, Kafka, Vertica, HMS, Glue Catalog, Unity Catalog, HDFS.

Web Frameworks

Spring Boot, Spring MVC, Spring Data/JDBC, Spring Security, Spring Core, Spring AOP.

DevOps & Tools

Kubernetes, CI/CD, Docker, Maven, Gradle, Gitlab, Bitbucket, Mercurial, Jira, Swagger, LocalStack, Feign client, Lombok.

Monitoring & Logging

Graphite, Grafana, Log4j.

Caching

Memcached, Hazelcast.

Testing

JUnit, Mockito, TestNG, Hamcrest.