As a passionate Junior Data Engineer with a background in Data Science, I specialize in leveraging data to drive insightful decisions and solutions. With hands-on experience in Data Engineering, including data transformation, pipeline development, and data mapping, I am constantly refining my skills in Python, SQL, and AWS. In addition to my data-focused expertise, I enjoy working on Data Science and Web Development projects in my free time, further expanding my technical skill set. I am eager to apply my technical knowledge to real-world challenges and continue advancing within the data field.
I am responsible for developing and maintaining a custom Python-based data mapping engine that performs data transformations based on configuration and mapping tables provided in CSV format for ease of use by business analysts, while generating log data to verify mapping quality. I create and optimize data pipelines, ensuring their efficiency and data quality, and develop data mappings to convert clients’ raw data into standardized reporting formats for ETL processes. My work includes handling data pre-processing, preparing inputs for transformation into standardized formats, and performing the transformation, along with identifying and correcting data errors in compliance with statistical agency guidelines. I collaborate closely with the development team via JIRA to request the implementation of data transformation logic and test the results, while contributing to the development of a data platform hosted on Amazon Web Services (AWS). I respond to ad-hoc requests from team members, sharing data processing ideas, analysis results, knowledge, and tutorials, and ensure that mapping configurations created by business analysts from United States and Poland meet technical standards for successful execution through close collaboration with them. Additionally, I support the DMS Operations team in both technical and business aspects, including setting up Python environments, implementing complex business logic in SQL and Python, and sharing my expertise in insurance data analytics, all while creating and maintaining comprehensive documentation for the developed solutions.
I worked using Scrum methodology by contributing to the development of the NoAPI project, that was in beta testing, by adding new features and addressing reported issues. NoAPI serves as a cloud datastore and hosting environment seamlessly integrated into various programming languages. It features user-friendly data models, default live data updates, and enables effortless declarative function computations. Moreover, I acquired hands-on experience in full-stack development, contributing to the creation of a hackathon platform using Python with the Django framework for the backend and HTML, CSS, and JavaScript for the frontend.
I was responsible for implementing the frontend of two web applications, namely the First Day Application and the Library. Additionally, I seamlessly integrated the backend with the frontend, created JWT Token, and addressed errors on the backend side. I worked using Scrum methodology.
Master’s Thesis:
“The application of convolutional neural networks for real-time recognition of handwritten Chinese characters”
Master’s Thesis:
“One Belt One Road – China’s vision of international trade logistics”
Bachelor’s Thesis:
“Traffic violations analysis in spatial approach”