Skip to content
View TendaiShoko's full-sized avatar
:atom:
Building cool apps
:atom:
Building cool apps
  • South Africa
  • 05:02 (UTC -12:00)

Highlights

  • Pro

Block or report TendaiShoko

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
TendaiShoko/README.md

Hey, I'm Tendai πŸ‘‹

Senior Data Scientist β€’ AI/ML Engineer β€’ MLSecOps

Building intelligent systems that solve real-world problems.


πŸ€– AI & ML Expertise

Core Competencies

  • Deep Learning β€” CNNs, Transformers, Vision Transformers (ViT), attention mechanisms
  • Computer Vision β€” Object detection, target re-identification, image segmentation, feature extraction
  • Natural Language Processing β€” Text classification, sentiment analysis, named entity recognition, embeddings
  • Generative AI β€” LLM fine-tuning, RAG architectures, prompt engineering, multi-modal models
  • Foundation Models β€” CLIP, ALIGN, GPT, BERT, experience with model fusion and adaptation
  • Knowledge Distillation β€” Model compression, teacher-student architectures, edge deployment
  • MLSecOps β€” Secure ML pipelines, model monitoring, drift detection, responsible AI

Techniques & Methods

  • Mixture-of-Experts (MoE) architectures
  • Contrastive learning & triplet loss
  • Transfer learning & domain adaptation
  • Hyperparameter optimization (Optuna, Ray Tune)
  • Model interpretability (SHAP, LIME, Grad-CAM)
  • A/B testing for ML models

πŸ”§ Tech Stack

ML & Deep Learning
Python TensorFlow PyTorch Scikit-Learn Keras

Generative AI & LLMs
OpenAI Hugging Face LangChain

Cloud & MLOps
AWS Azure GCP SageMaker Docker Kubernetes

Data & Databases
Pandas NumPy PostgreSQL MongoDB

CI/CD & Tools
GitHub Actions Jenkins MLflow


πŸš€ Key Projects

MoE-KD: Foundation Model Fusion for Real-Time Re-Identification

Developed a novel Mixture-of-Experts framework that dynamically fuses CLIP and ALIGN foundation models, then distills knowledge into a compact student network for edge deployment. Achieved 50% reduction in inference time while maintaining competitive accuracy on VeRi-776 (63.5% mAP) and Market-1501 (76.1% mAP) benchmarks.

PyTorch CLIP ALIGN Knowledge Distillation Computer Vision


Multi-Camera Target Re-Identification System

Built an end-to-end re-ID pipeline for matching targets across non-overlapping camera networks. Implemented triplet loss with hard negative mining, cross-camera domain adaptation, and real-time inference optimization for surveillance applications processing 6,000+ frames/second.

PyTorch OpenCV CUDA TensorRT Docker


Large-Scale Social Sentiment Analysis Pipeline

Engineered a distributed NLP pipeline processing millions of tweets for immigration sentiment analysis in South Africa. Implemented custom BERT fine-tuning, multi-label classification, and temporal trend analysis. Published in IEEE ICTAS 2024.

Transformers BERT NLP Spark AWS


Generative AI Document Intelligence System

Designed a RAG-based system for enterprise document Q&A with multi-format ingestion (PDF, DOCX, images), hybrid search (dense + sparse retrieval), and hallucination mitigation. Deployed on Azure with autoscaling to handle 10K+ daily queries.

LangChain Azure OpenAI Pinecone FastAPI Kubernetes


Real-Time ML Fraud Detection Platform

Architected a streaming ML pipeline for transaction fraud detection processing 50K+ events/second. Implemented online learning with concept drift detection, feature stores, and model versioning with automated retraining triggers.

Kafka Flink SageMaker Feature Store MLflow


Automated MLOps Pipeline with Security Controls

Built a complete MLSecOps framework including automated model training, vulnerability scanning, bias detection, model signing, and secure deployment. Integrated with CI/CD for continuous model delivery with governance controls.

GitHub Actions Docker Kubernetes SageMaker Trivy


πŸ“š Publications

Enhancing Target Re-Identification via Model Fusion and Knowledge Distillation of Pre-trained Foundation Models
SACAIR 2025 β€” Novel MoE-KD framework for efficient real-time re-identification using foundation models.

Analyzing the Perception of Immigrants in South Africa: A Machine Learning Approach to Aggregate Twitter Sentiment Data
IEEE ICTAS 2024 β€” Read Paper


πŸŽ“ Education & Certifications

MSc Artificial Intelligence β€” University of Johannesburg

Azure Data Scientist Azure AI Engineer


πŸ“« Connect

Email GitHub


Always learning. Always building.

Pinned Loading

  1. -trkr-app -trkr-app Public

    JavaScript

  2. 100-Days-Of-ML-Code 100-Days-Of-ML-Code Public

    Forked from Avik-Jain/100-Days-Of-ML-Code

    100 Days of ML Coding

  3. awesome-mcp-servers awesome-mcp-servers Public

    Forked from punkpeye/awesome-mcp-servers

    A collection of MCP servers.

  4. credit-deployment-aws credit-deployment-aws Public

    Jupyter Notebook

  5. Health_Risk_Assesment Health_Risk_Assesment Public

    Python

  6. Heart-Disease-Prediction Heart-Disease-Prediction Public

    Forked from Ravjot03/Heart-Disease-Prediction

    Jupyter Notebook