Paweł Mitruś, Developer in Warsaw, Poland
Paweł is available for hire
Hire Paweł

Paweł Mitruś

Verified Expert  in Engineering

Data Architect and Developer

Location
Warsaw, Poland
Toptal Member Since
September 10, 2021

paweowis是一名数据工程师和架构师,拥有多年使用各种技术构建数据平台的经验, including Azure and Microsoft. Apart from traditional ETLs, data lakes, and data warehouses, 他还精通各种商业智能工具和服务. For the past few years, Paweł's focused on cloud projects, sourcing from both on-premise and cloud locations. 最近,paweows一直在担任一个主要数据网格实现的首席架构师.

Portfolio

Lingaro
Azure, ETL,数据湖,数据库,Azure数据工厂,Azure分析服务...
Azum
Python, Django,领域驱动设计(DDD), SQL, Microsoft Power BI, Scrum...
ITMAGINATION
Azure,云基础设施,Azure数据工厂,Azure分析服务...

Experience

Availability

Part-time

Preferred Environment

Azure, Databricks, SQL, PySpark, Azure Data Factory, Microsoft Power BI, Azure SQL, Azure SQL Data Warehouse, Dedicated SQL Pool (formerly SQL DW), SQL Server BI, Azure Analysis Services

The most amazing...

...Role是一个数据网格项目的首席架构师,该项目涉及40多个开发人员和20个不同的领域团队,将其集成到平台中.

Work Experience

Solution Architect

2019 - PRESENT
Lingaro
  • Led a team of 6-8 tech leads to design and develop a data mesh platform that consisted of several microservices; also helped to plan automation in context of CI/CD.
  • 提供了大约20个关于Databricks平台的最佳实践和反模式的不同培训课程(内部和外部会议),旨在提高参与者的技能.
  • 使用所见即所得编辑器设计并开发了一个自定义ETL框架, 非开发人员可以使用它来以自助方式装载他们自己的ETL管道. 该框架类似于同样在Databricks上执行的ADF数据流.
  • 通过应用最佳实践和减少未来的问题,帮助优化Spark应用程序的性能.
  • 执行了多个Azure Monitor分析,旨在发现被滥用的服务,例如.g., in big data batch processing, 知道几个标记的比例应该是什么样的,并进行分析,结果是200美元,000 in savings.
  • Consulted in multiple "traditional" data lake, data warehouse (DWH), 和在线分析处理(OLAP)项目,并帮助规划特定需求集的架构,建立和配置环境(Azure)。.
Technologies: Azure, ETL,数据湖,数据库,Azure数据工厂,Azure分析服务, Azure SQL, Azure SQL Data Warehouse, Dedicated SQL Pool (formerly SQL DW), Microsoft Power BI, Azure DevOps, Azure App Service, Azure Logic Apps, Architecture, Cloud, SQL, PySpark, Python, Distributed Systems, SQL Server DBA, Azure Data Lake, Azure Event Hubs, Visual Studio Code (VS Code), Cloud Infrastructure, Azure Resource Manager (ARM), Azure Virtual Machines, Scrum, Agile, Data Engineering, Data Modeling, Data Pipelines, JSON, REST APIs, T-SQL (Transact-SQL), Apache Spark, Big Data, Data Analytics, Data Architecture, Kimball Methodology

Freelance Lead Analytics Developer and Product Designer

2019 - 2021
Azum
  • 为从用户设备上传到Azum平台的体育活动设计监控和分析功能.
  • Described and helped to understand developers how FIT, TCX, 以及包含活动细节的GPX文件应该如何处理以及如何解释.
  • Helped to organize the process of gathering requirements, specifying them, and handing them over to the development team in a Scrum manner.
Technologies: Python, Django,领域驱动设计(DDD), SQL, Microsoft Power BI, Scrum, Agile, Data Engineering, Data Modeling, JSON, Data Architecture

Solution Architect

2017 - 2019
ITMAGINATION
  • Led several teams, as a solution architect, 与11-15名开发人员在不同的项目中成功交付了超过10个数据分析平台,最终用户总数超过500人.
  • 计划并执行从SQL Server 2008R2到2016年BI平台的主要迁移,该平台包括15个不同的区域.
  • Optimized a data warehouse refresh from 12 to four hours, 主要是通过应用适当的数据结构和索引,还有分区表.
  • 在现有的SSIS框架中实现了一个数据质量面板,该面板收集关于读取/插入行的信息,以便通过不同的数据层(分段)跟踪行数, data warehouse, and semantic).
Technologies: Azure,云基础设施,Azure数据工厂,Azure分析服务, Azure SQL Data Warehouse, Dedicated SQL Pool (formerly SQL DW), SQL Server BI, SQL Server DBA, SQL Server Integration Services (SSIS), SQL Server Analysis Services (SSAS), SQL Server Reporting Services (SSRS), Microsoft Power BI, Azure Resource Manager (ARM), Azure Virtual Machines, Databricks, Architecture, Azure SQL, Cloud, SQL, Azure Data Lake, Visual Studio, ETL, Data Lakes, Azure DevOps, Scrum, Agile, Data Engineering, Data Modeling, Data Pipelines, T-SQL (Transact-SQL), Data Analytics, Data Architecture, Kimball Methodology

Data Developer

2014 - 2017
ITMAGINATION
  • 通过与团队和个人一起分析客户的需求,帮助设计数据仓库星型模式以及事实和维度表(Ralph Kimball).
  • 构建并发布数据仓库(DWH)和商业智能(BI)项目,其中包括与SSIS的集成, a data warehouse hosted on SQL Server 2012-2016, an OLAP database as SSAS (multidimensional and tabular), and reports in SSRS.
  • 开发一个基于SQL Server 2012 MDS的MDM系统,包括培训数据管理员(客户端)如何使用app和Excel表单.
  • Delivered a couple of training sessions regarding PowerQuery, PowerPivot, PowerReport, 熟练使用Microsoft Excel数据透视表(自助式BI).
Technologies: SQL Server BI, SQL Server DBA, SQL Server Integration Services (SSIS), SQL Server Reporting Services (SSRS), SQL Server Analysis Services (SSAS), SQL, Visual Studio, ETL, Scrum, Agile, Data Engineering, Data Modeling, Data Pipelines, T-SQL (Transact-SQL), Data Analytics, Data Architecture, Kimball Methodology

Data Mesh

我曾担任数据网格实现的首席架构师(技术部分可以在Martinfowler找到).com/articles/data-mesh-principles.html) in the FMCG field. 我与较小的开发团队的技术主管一起工作,讨论并同意低级架构. I also shared my expertise in Databricks utilization, as it stands for the processing engine of the platform.

技术栈:Azure, Databricks (Python), Airflow, Azure SQL, Azure Data Lake Gen2, App Services

Azure Data Analytics Platform

一个托管在Azure上的数据分析平台,在访问数据集和构建报告方面主要是为自助服务而构建的. 在交付实现之前,大多数高级用户都在使用Databricks制作原型. 该解决方案包括批处理和近实时处理.

我的角色主要涉及架构咨询和帮助计划实现. 我还帮助解决性能问题并调整云利用率以降低总体成本.

Technology Stack: Azure, Data Factory, Databricks, Azure SQL, Azure SQL Data Warehouse (Synapse), Databricks, Azure Data Lake Gen2, Event Hub, Azure Analysis Services, Power BI

Global Business Intelligence

I worked as an architect for a business intelligence platform, 采购主要是MS Dynamics AX,部署在世界各地的几个地区.

开发工作持续了两年多,涉及5-7名开发人员. 我们每天以批处理模式实现一次ETL,以便用户可以访问数据仓库(DWH)。, OLAP database, or predefined reports. Due to the immaturity of the Azure PaaS services, we decided to host the solution mostly on VMs (IaaS).

技术堆栈:Azure, MS SQL Server 2016 (SSIS, SSRS), Azure分析服务,PowerBI
2011 - 2015

Engineer's Degree in Computer Science

Warsaw University of Technology - Poland, Warsaw

DECEMBER 2019 - PRESENT

Azure Solutions Architect

Microsoft

DECEMBER 2018 - PRESENT

Agile PM

APMG International

DECEMBER 2018 - PRESENT

Professional Scrum Master 1 (PSM1)

Scrum.org

FEBRUARY 2017 - PRESENT

Microsoft Certified Professional

Microsoft

Libraries/APIs

PySpark, REST APIs

Tools

SQL Server BI, Microsoft Power BI, Visual Studio, Azure应用服务,Azure逻辑应用

Languages

SQL, T-SQL (Transact-SQL), Python

Platforms

Azure, Databricks, Azure SQL Data Warehouse, Visual Studio Code (VS Code), Dedicated SQL Pool (formerly SQL DW), Azure Event Hubs

Paradigms

ETL, Scrum, Agile, Kimball Methodology, Azure DevOps, DSDM

Storage

Azure SQL, Data Lakes, Data Pipelines, SQL Server Analysis Services (SSAS), SQL Server Integration Services (SSIS), SQL Server Reporting Services (SSRS), SQL Server DBA, JSON

Frameworks

Apache Spark, Django

Other

Azure Data Factory, Architecture, Cloud, Data Engineering, Data Modeling, Data Architecture, Azure Analysis Services, Azure Data Lake, Domain-driven Design (DDD), Cloud Infrastructure, Azure Resource Manager (ARM), Big Data, Data Analytics, Distributed Systems, Azure Virtual Machines, Data Mesh

Collaboration That Works

How to Work with Toptal

在数小时内,而不是数周或数月,我们的网络将为您直接匹配全球行业专家.

1

Share your needs

在与Toptal领域专家的电话中讨论您的需求并细化您的范围.
2

Choose your talent

在24小时内获得专业匹配人才的简短列表,以进行审查,面试和选择.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring