Hi, my name is

Jakub Durczok.

Lead Software Engineer — Data & AI

Lead Software Engineer building and leading teams that deliver scalable data, ML, and AI products used across 30+ global markets by a Fortune 50 FMCG leader.

Jakub Durczok profile image

About Me

Lead Software Engineer building and leading teams that deliver scalable data, ML, and AI products used across 30+ global markets by a Fortune 50 FMCG leader. Core backend expertise in Python, Spark, SQL, and Azure with deep knowledge of Django, PostgreSQL, REST APIs, and distributed systems.

I hold a Master’s degree (summa cum laude, ranked first in class) in Advanced Analytics — Big Data from SGH Warsaw School of Economics.

Technologies I work with:

Python PySpark / Databricks Django PostgreSQL Azure Docker LangGraph / RAG Kedro GitHub Actions Terraform Redis Kafka Kubernetes Airflow Streamlit

Experience

Lead Software Engineer — Data & AI
October 2025 — Present
  • Built an engineering platform covering 130 customers and terabytes of data by designing a SOLID-based connector architecture on Databricks and Kedro with strategy pattern for SAP, Oracle, and PostgreSQL.
  • Unified schema management across 5 backends (Unity Catalog / Local / Memory / Oracle / PostgreSQL) by developing a database schema abstraction layer in Python with time-travel queries and SCD 2 logic.
  • Engineered a “code chain” library with guaranteed O(n) time complexity by building a REST API server and NetworkX DAG engine to transform and process product code changes for time series ML in Python.
  • Cut weekly effort by 12 hours across 7 projects by building an agentic AI app on Django and LangGraph; benchmarked 4 LLMs across 3 retrieval datasets and 3 vector stores (Chroma, PostgreSQL, FAISS) to optimize a RAG pipeline integrated with Jira and ServiceNow APIs for ticket analysis and knowledge base management.
  • Achieved >$200,000/year in savings by building an insourced engineering team and deploying an autonomous L1 support agent on LangGraph that monitors Databricks jobs, diagnoses failures, and escalates via Teams.
  • Led the company-wide migration of Azure email auth from SMTP to OAuth 2.0 for a shared ML/DE framework used by hundreds of engineers across hundreds of projects by refactoring the core notification module and coordinating rollout.
Software Engineer — Data
October 2022 — September 2025
  • Scaled a demand-forecasting ML product from 30 to 110 customers while improving reliability to 100% by optimizing Spark pipelines in Python/PySpark and implementing MLOps and CI/CD gates.
  • Saved 6 FTEs of operations effort by developing a distributed Django web app in Python with custom concurrency controls and Redis caching to prevent race conditions.
  • Cut I/O-bound SAP API latency by 90% by building a Python SDK for concurrent data retrieval and async processing.
Engineer — Salesforce
July 2022 — September 2022
  • Delivered a CRM system MVP in 3 months on Salesforce for a car dealership by configuring end-to-end vehicle, client, and repair tracking workflows.
Wiktor Kos Consulting
Engineer — SAP HCM
June 2021 — June 2022
  • Delivered zero-defect payroll for 10,000+ employees across Germany, Austria, and Switzerland and de-risked a multi-million euro SAP S/4HANA migration by building a custom ABAP master data validation tailored to client’s needs.

Education

Master of Arts, Advanced Analytics — Big Data
October 2023 — July 2025
GPA: 4.96 / 5.0

Summa cum laude

Coursework: Python, R, C++, Real-Time Analytics (PySpark, Spark Structured Streaming, Docker, Flask), Database Systems (SQL, PL/SQL), Data Visualisation (R, Shiny, Power BI), Credit Scoring (SAS).

Bachelor of Arts, Quantitative Methods in Economics and IT
October 2020 — July 2023
GPA: 4.95 / 5.0

Summa cum laude | Ranked First in Class

Coursework: Statistical Methods, Database Systems (SQL Developer), Econometrics, Operational Research.

Projects

Demand Forecasting — Product Code Chains
Python Django REST API Docker
Engineered a REST API and DAG-based library that links product code changes into families, preserving demand history for time series ML. Published as an open-source case study.
Warehouse Management System
Python Django Hackathon Winner
Optimized donation logistics for a childhood cancer charity by building a full-stack system in Python (Django). Hackathon Winner (2025).
Vendor Manager — Thesis Project
Python Django PostgreSQL Docker
Reduced manual resource administration by developing a web app in Python (Django) for automated contract and cost tracking. Thesis Project (2023).

Certifications

Databricks Data Engineering Professional
Databricks Data Engineering Professional
Earned 2026
Databricks Data Engineering Associate
Databricks Data Engineering Associate
Earned 2024
Microsoft Azure Fundamentals (AZ-900)
Microsoft Azure Fundamentals (AZ-900)
Earned 2023

Languages

English
C2 — Proficient
German
C2 — Proficient
Polish
Native