Skip to content
View RafaelGallo's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@dsamentoria @WorkML @ComunidadeEstatistica

Block or report RafaelGallo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
RafaelGallo/README.md

👋 Hi, I’m Rafael Gallo

Python R SQL Pandas NumPy scikit--learn TensorFlow PyTorch NLP LLMs Spark PySpark Hadoop Databricks BigQuery Google Cloud Vertex AI Azure Azure ML AWS SQL Server Power BI Qlik Sense Looker Studio Excel Git Docker Scrum Kanban OKR

About

Data Scientist with 5+ years delivering end-to-end solutions for retail and adjacent industries. I cover the full stack—from data engineering/ETL to modeling (ML/DL/NLP), validation, and production/MLOps—primarily in Python and SQL. Strong track record building scalable pipelines (BigQuery, Databricks, Spark) and running cloud-native deployments (Google Cloud, Azure, AWS), tying technical metrics to business outcomes.

🔎 NLP at scale: customer review mining, sentiment/topic classification, and LLM workflows (prompting/embeddings) for actionable insights.

☁️ Cloud & Production: data orchestration, experiment tracking, and model deployment on Vertex AI and Azure ML.

📈 Impact-driven analytics: predictive modeling (churn, recommendation, time series) and executive dashboards (Power BI, Qlik Sense, Looker Studio) for decision-making.

One-liner (optional): Data Scientist (5+ yrs) — NLP, forecasting & recommenders in production; Python/SQL, BigQuery/Databricks/Spark, Vertex AI/Azure ML; business-impact focus.

🛠️ Tech Stack

Programming Languages

  • Python, R, SQL

Data & ETL

  • Pandas, NumPy, PySpark, Spark, Hadoop
  • SQL Server, BigQuery, Databricks

Statistics & Mathematics

  • Descriptive & inferential stats, probability, regression, hypothesis testing

Machine Learning

  • Regression, Classification, Clustering (K-Means, DBSCAN, Hierarchical)
  • Ensemble methods: Random Forest, XGBoost, LightGBM, CatBoost
  • Feature engineering & selection, dimensionality reduction (PCA, t_SNE)
  • Recommenders, anomaly detection, hyperparameter optimization (Optuna, Bayesian)

Time Series

  • ARIMA/SARIMA, Prophet
  • Deep Learning: LSTM, GRU

Deep Learning

  • MLPs, CNNs, RNNs
  • GANs (DCGAN, Pix2Pix, CycleGAN, DeOldify)
  • Transfer Learning (VGG, ResNet, MobileNet, EfficientNet)

NLP & Transformers

  • Preprocessing (tokenization, lemmatization, n-grams, embeddings)
  • BERT (sequence classification), BART (summarization/generation)
  • ChemBERTa (biomedical/healthcare text)
  • Sentiment analysis, classification, summarization, NER

LLMs (Large Language Models)

  • Fine-tuning, Prompt Engineering, RAG
  • Embedding-based classifiers, LangChain, LlamaIndex, Gemini, LLaMA

Cloud & MLOps

  • Google Cloud (BigQuery, Vertex AI), Azure ML, AWS SageMaker
  • Model deployment, CI/CD, monitoring, pipelines (Airflow, Prefect, Kubeflow)

BI & Visualization

  • Power BI, Qlik Sense, Looker Studio, Excel
  • Matplotlib, Seaborn, Plotly

🚀 Highlight Projects

📚 MBA Projects (FIAP)

🎓 Extra Courses

🏫 Data Science Academy

  • Practical Projects – Data Science Academy
    A collection of mini-projects from the Data Science Academy training: big data with R and Azure ML, Python/Spark, Machine Learning, Business Analytics, visualization, and data engineering (Hadoop/Spark).
    Tasks include churn analysis, recommender systems, fraud detection, sentiment analysis, time series forecasting, and dashboards.
    Repo: https://github.com/RafaelGallo/Projetos_dsa

Coursera

📫 Reach Me

Popular repositories Loading

  1. NLP---Sentiment-Analysis-VADER-LeIA NLP---Sentiment-Analysis-VADER-LeIA Public

    Análise de sentimento em frases com VADER

    Jupyter Notebook 5

  2. MLfow---Model MLfow---Model Public

    Modelo de machine learning - MLfow

    Jupyter Notebook 4

  3. AutoML---Machine-learning AutoML---Machine-learning Public

    Projetos realizado com AutoML modelo machine learning

    Jupyter Notebook 4

  4. Project-machine-learning---Climate Project-machine-learning---Climate Public

    Projetos de machine learning aplicado temperatura clima. Projeto de modelos machine learning, series temporais.

    Jupyter Notebook 4

  5. Project-machine-learning---PLN Project-machine-learning---PLN Public

    Projeto de machine learning voltado área PLN

    Jupyter Notebook 4

  6. Machine-learning---PLN-Fake-News Machine-learning---PLN-Fake-News Public

    Projeto machine learning NLP - Combate a noticias falsas

    Jupyter Notebook 4 1