Skip to content
View igorconsulting's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report igorconsulting

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
igorconsulting/README.md

Hi folks πŸ‘‹, I'm Igor Caetano

A passionate Data Scientist from Brazil.

πŸ“ Montes Claros, Minas Gerais - Brazil
πŸ“§ icaetanodiniz@gmail.com
πŸ’Ό LinkedIn
πŸ“± +55 38 988636216


πŸš€ About Me

Data Scientist and Machine Learning Engineer with over 4 years of experience developing scalable, data-driven solutions. Currently pursuing a Ph.D. in Data Science at PUC-Rio, focusing on scalable generative AI architectures and anomaly detection.

πŸŽ“ Academic Background:

  • Approved in both IME and ITA entrance exams
  • Gold medal winner at Desafio PUC-Rio Olympiad (full scholarship recipient)
  • Master's in Applied Mathematics from PUC-Rio
  • Bachelor's in Mathematics from PUC-Rio

🏒 Industry Experience:

  • Worked on high-impact AI initiatives with Petrobras, Intel, Embraer, and Eletrobras
  • Delivered solutions in: Generative AI, RAG, Computer Vision, LLMs, NLP, and Anomaly Detection
  • Full ML lifecycle expertise: modeling, deployment, CI/CD pipelines, MLOps, cloud-native infrastructure

πŸ’Ό Current Role

Machine Learning Staff Researcher and Engineer @ HVAR (Sep 2025 - Present)

  • Research and development on LLMs, RAG, and GraphRAG solutions
  • Working with LLAMA and GEMINI for text-to-SQL applications
  • Designing advanced ML solutions within Databricks

πŸ› οΈ Technical Skills

Languages & Frameworks

Python SQL PySpark

Machine Learning & AI

  • Deep Learning: Neural Networks, CNNs, RNNs, Transformers
  • Classical ML: Random Forest, XGBoost, SVM, Decision Trees, KMeans
  • NLP & LLMs: LLAMA, Gemini, GPT, RAG, Text-to-SQL
  • Computer Vision: Object Detection, Image Classification
  • Anomaly Detection: Isolation Forest, SOS, LOF

MLOps & Infrastructure

Docker Kubernetes Airflow Terraform MLflow

Cloud Platforms

GCP AWS Azure Databricks

Tools & Technologies

  • Data Processing: Pandas, NumPy, Elasticsearch
  • Visualization: Matplotlib, Streamlit
  • Version Control: Git, GitHub Actions
  • Deployment: BentoML, Flask
  • Other: Unity Catalog, Unix Systems

πŸ† Key Projects & Achievements

πŸ”Ή Anomaly Detection in Oil Wells (Petrobras)

  • Developed anomaly detection system for petroleum well time series
  • Achieved 80%+ detection rate using SOS, Isolation Forest, and LOF
  • Enabled proactive maintenance and optimized resource allocation

πŸ”Ή Text-to-SQL with LLMs (Intel)

  • Built experimental RAG framework for SQL query generation
  • Leveraged LLAMA, Gemini, and GPT architectures
  • Automated natural language to SQL conversion

πŸ”Ή Insurance Fraud Detection (Vert)

  • Implemented ML-based fraud detection system
  • Achieved 4x higher accuracy compared to manual methods
  • Utilized clustering, regression, and decision trees

πŸ”Ή Credit Risk Prediction (MJV)

  • Built end-to-end ML pipeline for default risk prediction
  • Reduced debt by 50%+ and recovered $1M in losses
  • Automated data pipelines with AWS and PySpark

πŸ“š Education

Ph.D. in Data Science (2023 - 2026)
PontifΓ­cia Universidade CatΓ³lica do Rio de Janeiro (PUC-Rio)

M.Sc. in Applied Mathematics (2021 - 2022)
PUC-Rio | Thesis: Random Forest for Reservoir Simulation

B.Sc. in Mathematics (2018 - 2020)
PUC-Rio

Telecommunications Engineering (2015 - 2017)
Instituto Militar de Engenharia (IME)


🌐 Languages

  • Portuguese: Native
  • English: Advanced

πŸ“Š GitHub Stats

Igor's GitHub stats

Top Languages


🀝 Let's Connect!

I'm always interested in collaborating on innovative ML and AI projects. Feel free to reach out!

LinkedIn Email GitHub


⭐️ From igorconsulting

Connect with me:

igor caetano diniz

Languages and Tools:

docker gcp git pandas postgresql python scikit_learn seaborn sqlite tensorflow

igorconsulting

Pinned Loading

  1. Vehicle-Claim-Fraud-Detection Vehicle-Claim-Fraud-Detection Public

    Forked from mathlaranjeira/Vehicle-Claim-Fraud-Detection

    Jupyter Notebook 1