Dmitry P.

Data Engineer | Software Engineer

Remote work. Part-time. Full-time.

Последнее обновление резюме 20.07.2022
Адрес Wrocław, Poland
Электронная почта Заблокировано
Телефон Заблокировано
Соцсеть или сайт Заблокировано

Опыт

Startup “RDStrim”, Enterprise Development Foundation “TWÓJ STARTUP”
Data Engineer | Software Engineer
Май 2018 - Текущий
• Data Analysis / Engineering ;
• Building out an infrastructure for working with large amounts of data;
• Designing, building and maintaining real-time/batch data pipelines and ETLs;
• Distributed Data Processing;
• Visualize Data using Analytical Platforms;
• Support in automation of reporting environment;
• Developing Data Science solutions;
• Development of tools for design and modeling, expert systems development;
• Perform light statistical analysis and machine learning;

Participation in projects:
• ML Security Net , Wroclaw, Poland. Applying Machine Learning to improve Cyber Security;
• BioNoSQL, Wroclaw, Poland. Storage and processing of DNA and protein sequences in a Distributed File System. Design and development of data pipelines and computation pipelines;
Startup “RDStrim”, Enterprise Development Foundation “TWÓJ STARTUP”
Software Engineer
Фев 2016 - Май 2018
• Web services development, processing of big data streams in real time;
• Building a robust infrastructure for dealing with large amounts of data;
• Develop scalable and streaming data flows;
• Analysis and improvement of the efficiency, scalability and stability of data collection, storage and retrieval processes;
• Create reports for internal teams and the business;

Participation in projects:
• Hotspot Shield, Redwood City, California, US. www.hotspotshield.com Participated in development of a web service for controlling access to the vpn server network. Analysis, design and development of operational statistics service, statistical modeling. Analysis, design and development of a billing subsystem;
Tomsk Polytechnic University
Software Engineer
Фев 2003 - Фев 2016
• Developing client/server applications, customer-facing applications, internal tools, Oracle Application Express (Oracle Apex) reports and applications;
• Participate in research projects in the field of distributed data processing, performing scientific research and statistical analyses, applying machine learning methods and developing advanced tools and models through scientific and technical approaches to discover insights;
• Maintaining and evolving analytics data architecture;
• Creating and maintaining optimal data pipeline architecture;
• Establishing and maintaining the set of business reporting requirements;

Participation in projects:
• NEDRA, Scientific Analytical Center, Tomsk, Russia
Project of marine seismic surveys.
- Software development for seismic data acquisition, tools for design and modeling seismic surveys.
- Software development for distributed big data processing.

• Tomsk Research and Design Institute of Oil and Gas, Tomsk, Russia
- Development of software for financial accounting and reporting.
- Develop and implement effective reporting using various BI and assessment tools to minimize risk and proactively identify problem areas.

• Cardiology Research Institute, Tomsk, Russia
- Analysis of clinical trial datasets, machine learning, statistical modeling for high volume and complex datasets.
- Software development for distributed processing of statistical data and medical research data, expert systems development.
LLC Yurginsky machine engineering plant
Software Engineer
Янв 1997 - Янв 2003
• Software development for financial accounting and management;
• Analyze business needs, building data models and BI applications to satisfy business demands;
• Designing and developing interactive dashboards and reports;

Образование

Tomsk State University of Control Systems and Radioelectronics
Bachelor's degree, Information Technology
Сен 2000 - Июн 2005

В чем вы сильны?

Having 25+ years of experience in:

Data Analytics & Engineering Skills:
• Data Analysis and Engineering;
• Relational Database Design, Development, Data modeling (OLTP, ROLAP);
• Data warehouse design and management (Data Lake, DWH, Data Mart);
• Designing, building and maintaining real-time/batch data pipelines and ETLs;
• Hands-on experience with streaming data;
• Identify, analyse and interpret trends or patterns in large and complex data sets;
• Visualize Data using Analytical Platforms;
• Support in automation of reporting environment;
• Understanding of machine learning techniques and their application to business decisions;
• Perform light statistical analysis and machine learning;

Development Skills:
• Client Server Architecture Application Development;
• Design, develop, and maintain complex software programs;
• Software development for distributed computing;
• Experience with highly-reliable, high-volume transaction processing productions systems;
• Process modelling (UML, BPMN) & process documentation;

Technical Proficiencies:
• Programming Languages: C/C++, Java, Scala, Python
• Relational Databases: Oracle, MYSQL, PostgreSQL
• NoSQL Databases: ClickHouse, Redis, InfluxDB, Elasticsearch
• Big Data Ecosystem: Hadoop (HDFS, Map Reduce), Hive, Pig, Spark
• Workflow Automation Tools: Apache Airflow, Apache NiFi
• Machine Learning Tools: Scikit-Learn, Apache Spark ML, TensorFlow, PyTorch
• Data Scientist Tools: Pandas, NumPy, SciPy, Matplotlib
• Message Brokers: Apache Kafka, RabbitMQ
• BI Systems: Oracle BI Suite, Tableau, Power BI, Google Data Studio
• Reporting Tools: JasperReports, Oracle Reports, Oracle Application Express (APEX)
• Java-based Ecosystem: Spring Framework, Hibernate, JDBC, JPA
• Development Tools: GIT, SVN, Gradle, Maven, Ant