Skip to content
View smeetnalawde's full-sized avatar

Block or report smeetnalawde

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
smeetnalawde/README.md

Hi, I'm Smeet Nalawade 👋

Data Science @ Stevens Institute of Technology


🧠 About Me

I’m a Data Science graduate student at Stevens passionate about building end-to-end ML systems, data-driven products, and scalable analytics pipelines that solve real-world problems.


🔧 What I Work With

  • 🤖 Machine Learning & Applied AI
  • 🏗️ Data Engineering & Big Data (Spark, Hadoop)
  • ☁️ Cloud & Databases (AWS, MongoDB, SQL)

🚀 Projects I’m Proud Of

Project Description Tech Stack Outcome
ETF Recommender System Built a recommender engine to identify structurally similar ETFs based on holdings overlap and weight distribution. Stored complete ETF master + holdings data in MongoDB for scalable querying. Computed similarity using Jaccard & Cosine metrics. Python, Pandas, MongoDB, PyMongo, NumPy, SciPy Successfully identified high-similarity ETF alternatives (e.g., IYY → ITOT/IWB/ILCB), enabling portfolio diversification and risk-aligned fund substitution.
Automated Fruit Grading System Designed a computer vision model to classify fruits by size, color, ripeness, and quality. Developed real-time inference pipeline and designed a UI interface for usage in market settings. Python, OpenCV, TensorFlow/Keras, CNNs, Gradio Improved grading accuracy to ~95%, helping automate manual labor and support fair pricing for farmers. Inspired by real agricultural challenges.
Weather Forecasting & Landslide Prediction Built predictive models for rainfall trends and integrated environmental indicators to forecast landslide risk. Conducted feature engineering and multi-model comparison. Python, Pandas, Scikit-Learn, Random Forest, XGBoost, GIS Datasets Achieved high predictive accuracy and demonstrated how data-driven alerts can support disaster prevention planning.
Quantitative Stock Forecasting System Developed an algorithmic forecasting system for stock price movements using time-series modeling + statistical indicators. Evaluated trend signals for trading decisions. Python, NumPy, Pandas, yFinance API, Statsmodels, Scikit-Learn Generated directional accuracy signals useful for backtesting trading strategies & understanding market volatility.
Crime Resource Allocation & Predictive Analytics (911 NYPD Dataset) Built a predictive pipeline to forecast emergency call hotspots and optimized patrol allocation for response efficiency. Conducted spatiotemporal analysis & visualization. Python, R, ggplot2, dplyr, Pandas, Jupyter Highlighted crime density patterns and demonstrated how data-driven deployment improves urban safety resource management.
Financial Stress & Default Risk Predictor (Satellite + ESG Data) Integrated satellite imagery + ESG features + financial indicators to estimate corporate credit deterioration risk. Applied feature encoding, dimensionality reduction, and gradient boosting models. Python, XGBoost, Pandas, Scikit-Learn, Geospatial Imagery, ESG Data APIs Built a multimodal model capable of improving early-warning risk detection for investment due diligence.

🎯 What Drives Me

Using data & intelligence to create real impact, not just models.


🌐 Connect with Me

Pinned Loading

  1. Quantitative-Stock-Forecasting-System Quantitative-Stock-Forecasting-System Public

    A quantitative trading system that leverages statistical models, machine learning, and market data to generate stock price forecasts and trading signals. This system processes historical price data…

    Jupyter Notebook

  2. Face-Recognition-Based-Attendance-System Face-Recognition-Based-Attendance-System Public

    This project implements a Face Recognition-Based Attendance System using machine learning and computer vision techniques. It detects and recognizes faces to automate attendance tracking, ensuring a…

  3. Game-Data-Analysis-using-R-Studio Game-Data-Analysis-using-R-Studio Public

    This project focuses on **Game Data Analysis** using **R Studio** to derive actionable insights from gaming datasets. It involves data preprocessing, exploratory analysis, and visualization to unco…

    R

  4. Multi-Modal-Brain-Tumor-Segmentation-and-Classification-using-BraTS2020 Multi-Modal-Brain-Tumor-Segmentation-and-Classification-using-BraTS2020 Public

  5. Resource-Allocation-Optimization-and-Predictive-Analytics-for-Crime-and-Accidents-based-on-911-NYPD- Resource-Allocation-Optimization-and-Predictive-Analytics-for-Crime-and-Accidents-based-on-911-NYPD- Public

    This project utilizes the 911 NYPD dataset to build a framework for resource allocation optimization and predictive analytics. By analyzing emergency call trends, it supports law enforcement and re…

    Jupyter Notebook

  6. Financial-Stress-Default-Risk-Predictor-Using-Satellite-and-ESG-Images Financial-Stress-Default-Risk-Predictor-Using-Satellite-and-ESG-Images Public

    A machine learning model that predicts corporate financial distress and default risk by analyzing satellite imagery and ESG (Environmental, Social, and Governance) data. This innovative approach co…

    Jupyter Notebook