PyData Yerevan 2022

Building Data Pipelines on AWS
08-12, 12:00–12:40 (Asia/Yerevan), 114W PAB

Building Data Pipelines on AWS and hidden costs that can destroy budget


Data is the new fuel. Majority of modern IT companies making their decision based on collected data. Important role during this process plays data engineering side which is responsible for delivering data in needed format. During the speech I want to talk about ways for creating Data Pipelines on Amazon Web Services. Except of data engineering I want to focus attention on the hidden costs that can easly destroy projects budget.


Prior Knowledge Expected

No previous knowledge expected

Rudolf Eremyan is a data scientist with six years of experience. In 2016 he joined the Pulsar.ai startup as an NLP/ML engineer and developed the first chatbot framework for the Georgian language. Starting from 2018, he works in the Toptal freelance network as a data engineer where created Amazon Web Services-based big data processing and visualization tools for various companies in the field of eCommerce, sport, and pharma. As an active community member, he was invited speaker and judge at the international conferences and hackathons like NASA's International Space Apps Challenge, Google DevFest, DataFest Georgia, Pecha Kucha.