Lead Data Engineer (Python, Pandas)
- Back-End
- Preferred from Brazil, Bulgaria, Colombia, Georgia, Hungary, Lithuania, Poland, Romania, Uzbekistan
- 4000 - 6000 USD
- Lead
- Full-Time
- Remote
XY DIGITAL is seeking a Lead Data Engineer (Python, Pandas) to join a large-scale transformation project with one of the most respected global consulting firms. You will architect, lead, and scale complex data solutions across cloud-native platforms, mentor a skilled team, and ensure excellence in data handling and backend performance. The project uses a modern stack with a focus on Python, FastAPI, and Azure Cloud. If you’re a senior engineer with leadership experience and a hands-on mindset, we want to hear from you.
Step 1: HR Screening
Step 2: Technical Deep Dive (code task or live walkthrough)
Step 3: Leadership Interview with Client and Engineering Lead
Start Date: Immediate
Contract: Long-term / Open-ended
Lead architecture and development of data-intensive Python applications
Define coding standards, enforce best practices, and lead code reviews
Guide a team of developers and support junior mentorship
Optimize backend logic and data processing for performance and scalability
Build maintainable code with full test coverage and robust documentation
Participate in daily collaboration with cross-functional teams and the client
Resolve complex data pipeline and transformation issues
Ensure delivery aligns with project goals, timelines, and client expectations
7+ years of experience in backend or data engineering roles
Strong expertise in Python and FastAPI
Extensive use of Pandas, Polars, or equivalent for advanced data handling
Experience with Pydantic and clean schema design
Proficiency in asyncio and asynchronous programming techniques
Experience working with MongoDB, Parquet, and Delta Tables
Deep understanding of transformation workflows and data architecture
Ability to manage tasks and lead technical delivery independently
Detail-oriented with strong documentation and organizational skills
Comfortable working in a high-performance Agile environment
Experience with Azure (Data Lake, Blob Storage, Redis, Service Bus)
Familiarity with .NET, Databricks, Spark, or PySpark
Hands-on experience with Docker and Kubernetes
Background in designing or working within microservice architectures
Data Engineering
Python
Pandas
FastAPI
Pydantic
asyncio
MongoDB
Parquet
Delta Tables
Azure Cloud