💾 Data Engineer focused on building reliable data platforms, turning raw data into meaningful insights, and designing scalable solutions in the cloud.
I enjoy solving complex problems, learning continuously, and transforming ideas into data-driven products 🚀
- 🎓 Degree in Systems Analysis and Development
- 🧰 Experience designing and implementing data pipelines, ETL/ELT workflows, and analytics architectures
- ☁️ Hands-on with Azure, Databricks, and Apache Spark in production environments
- 🐍 Passionate about Python, SQL, and end-to-end automation
- 🔐 Interested in data governance, quality, and best practices for scalable data platforms
- 🎯 I believe data should not only describe the past, but also support decisions and drive innovation
| Project | Description | Stack |
|---|---|---|
| ETL Automation Pipeline | End-to-end automated ETL/ELT workflow on Databricks and Airflow, orchestrating large-scale batch jobs with monitoring and logging for reliability. | Python, Airflow, Databricks, Spark |
| Sales Analytics Dashboard | Interactive analytics solution providing real-time sales KPIs, trends, and drill-downs for business stakeholders. | SQL, Power BI , Python, PySpark |
| Azure Data Lake Project | Designed a scalable data lake architecture for analytics and reporting, with structured zones and standardized ingestion patterns. | Azure, Spark, Python |
I share content about Data Engineering, PySpark, and Cloud Technologies on Medium:
📘 Recent topics include:
- Desvendando a Camada “Worked”: O Elo Perdido na Arquitetura de Dados Medallion.
- O Fim da Era do Data Lake? Por que o Data Mesh é o Futuro da Engenharia de Dados.
- Engenharia de Dados com Mentalidade FinOps: Menos Custo, Mais Inteligência.
👉 Read my articles here: medium.com/@luciana.sampaio84
📫 I’m always happy to exchange ideas about data, analytics, and technology:
✨ “Good data tells a story — great data drives change.”


