Senior Data Scientist β’ AI/ML Engineer β’ MLSecOps
Building intelligent systems that solve real-world problems.
Core Competencies
- Deep Learning β CNNs, Transformers, Vision Transformers (ViT), attention mechanisms
- Computer Vision β Object detection, target re-identification, image segmentation, feature extraction
- Natural Language Processing β Text classification, sentiment analysis, named entity recognition, embeddings
- Generative AI β LLM fine-tuning, RAG architectures, prompt engineering, multi-modal models
- Foundation Models β CLIP, ALIGN, GPT, BERT, experience with model fusion and adaptation
- Knowledge Distillation β Model compression, teacher-student architectures, edge deployment
- MLSecOps β Secure ML pipelines, model monitoring, drift detection, responsible AI
Techniques & Methods
- Mixture-of-Experts (MoE) architectures
- Contrastive learning & triplet loss
- Transfer learning & domain adaptation
- Hyperparameter optimization (Optuna, Ray Tune)
- Model interpretability (SHAP, LIME, Grad-CAM)
- A/B testing for ML models
Developed a novel Mixture-of-Experts framework that dynamically fuses CLIP and ALIGN foundation models, then distills knowledge into a compact student network for edge deployment. Achieved 50% reduction in inference time while maintaining competitive accuracy on VeRi-776 (63.5% mAP) and Market-1501 (76.1% mAP) benchmarks.
PyTorch CLIP ALIGN Knowledge Distillation Computer Vision
Built an end-to-end re-ID pipeline for matching targets across non-overlapping camera networks. Implemented triplet loss with hard negative mining, cross-camera domain adaptation, and real-time inference optimization for surveillance applications processing 6,000+ frames/second.
PyTorch OpenCV CUDA TensorRT Docker
Engineered a distributed NLP pipeline processing millions of tweets for immigration sentiment analysis in South Africa. Implemented custom BERT fine-tuning, multi-label classification, and temporal trend analysis. Published in IEEE ICTAS 2024.
Transformers BERT NLP Spark AWS
Designed a RAG-based system for enterprise document Q&A with multi-format ingestion (PDF, DOCX, images), hybrid search (dense + sparse retrieval), and hallucination mitigation. Deployed on Azure with autoscaling to handle 10K+ daily queries.
LangChain Azure OpenAI Pinecone FastAPI Kubernetes
Architected a streaming ML pipeline for transaction fraud detection processing 50K+ events/second. Implemented online learning with concept drift detection, feature stores, and model versioning with automated retraining triggers.
Kafka Flink SageMaker Feature Store MLflow
Built a complete MLSecOps framework including automated model training, vulnerability scanning, bias detection, model signing, and secure deployment. Integrated with CI/CD for continuous model delivery with governance controls.
GitHub Actions Docker Kubernetes SageMaker Trivy
Enhancing Target Re-Identification via Model Fusion and Knowledge Distillation of Pre-trained Foundation Models
SACAIR 2025 β Novel MoE-KD framework for efficient real-time re-identification using foundation models.
Analyzing the Perception of Immigrants in South Africa: A Machine Learning Approach to Aggregate Twitter Sentiment Data
IEEE ICTAS 2024 β Read Paper
MSc Artificial Intelligence β University of Johannesburg
Always learning. Always building.