AK.
Open to Opportunities

Building AISystems ThatScale.

AI EngineeringMLOpsComputer VisionLLM SystemsBackend EngineeringProduct Development
ayo2kadri@gmail.comLondon, UK
Production · Healthcare AI
Clinical Imaging
CVAT Auto Annotation
Human Validation
Training Pipeline
Azure ML
Regression Testing
Production Deployment
YOLOTensorFlowCVATAzure MLDockerOpenCV
Production SystemArchitecture
6+Years Eng.
3AI Platforms
MScAI · Wolverhampton
Scroll
Production Impact
6+Years EngineeringAcross AI, backend & cloud
10K+Healthcare ImagesProcessed in production
70%Annotation EffortReduced via automation
60%Faster DeploymentsCI/CD pipeline optimisation
22%Student RetentionUplift via ML prediction
3Production AI PlatformsShipped to production
Ayo Kadri — AI Systems Engineer

Ayo Kadri

AI Systems Engineer

Computer VisionLLM / RAGMLOpsHealthcare AIFintech AIProduct Engineering
01
About Me

I build AI systems end to end.

I'm an AI Systems Engineer — not just an ML practitioner. There's a difference: I don't hand off a model and call it done. I own the full stack: data pipelines, model training, serving infrastructure, monitoring, and the product layer that makes it useful to real people. MSc in Artificial Intelligence (University of Wolverhampton), and 6+ years building software and AI systems across healthcare, fintech, education, and AI products.

At Eyoto Group, I built the EyeVision AI Platform from the ground up — a production healthcare computer vision system that processes clinical ophthalmic imagery, automates annotation workflows, and deploys via Azure ML with full regression testing. At Your Study Path, I shipped LLM pipelines that moved the retention needle by 22% and cut deployment time by 60%. I'm now building CogFlow — an AI learning OS for professionals, powered by LangGraph multi-agent pipelines and pgvector.

My approach: architecture decisions matter more than model accuracy. A 94% AUC-ROC score in a system that breaks under load, can't be monitored, or can't be explained to stakeholders is not a success. I build AI systems that are scalable, reproducible, observable, and — most importantly — actually useful.

6+Years Engineering
3AI Platforms Shipped
4+Industry Domains
MScArtificial Intelligence

Google Cloud Certifications

Professional ML EngineerData AnalyticsGenerative AIResponsible AI
02
Career

Where I've Built Things

Production systems deployed. Real metrics. Real domains.

AIML Engineer

Eyoto Group Ltd

Sep 2025 – PresentHealthcare AI
  • Architected and deployed the EyeVision AI Platform — a production healthcare computer vision system spanning automated annotation pipelines, YOLOv8-based iris/pupil detection, and full Azure ML deployment infrastructure
  • Designed scalable training pipelines processing 10,000+ annotated medical images; reduced annotation effort by 70% through automated CVAT workflows and quality verification tooling
  • Owned end-to-end MLOps: Docker-based model packaging, Azure ML experiment tracking, regression testing, and production release workflows for a safety-critical healthcare application
  • Fine-tuned object detection models on ophthalmic datasets; drove measurable improvements in detection accuracy and inference latency critical for real-time clinical use

Machine Learning Engineer

Your Study Path Limited

Jun 2024 – Jul 2025EdTech / LLM
  • Designed and shipped production LLM pipelines using LangChain and Hugging Face — improving AI chatbot performance by 40% and directly increasing student engagement across the platform
  • Built and fine-tuned transformer models for student retention prediction; the system drove a 22% uplift in platform retention, tied directly to business revenue
  • Owned the full MLOps lifecycle on Google Vertex AI: containerised training jobs, GitHub Actions CI/CD, automated redeployment — cutting deployment cycle time by 60%
  • Delivered RESTful APIs integrating ML models into real-time recommendation and decision systems, enabling product features used daily by thousands of students

Software Developer

Ascalon Integrated

May 2021 – Feb 2024Mobility / Data
  • Built a high-performance car-sharing platform managing 500+ vehicles, real-time booking state, and concurrent user sessions — owning backend architecture decisions for scalability
  • Engineered a real-time analytics engine processing 10M+ daily data points using Apache Spark; designed the pipeline architecture and DevOps infrastructure for reliability and observability

Software Engineer

StreetRates Limited

Jan 2020 – Feb 2021FinTech
  • Architected and delivered RESTful APIs integrating Google Maps for real-time, country-based FX rate visualisation — the core data feature of the product
  • Implemented CI/CD pipelines with Docker, achieving 99.9% platform uptime and enabling zero-downtime deployments in a fintech production environment

Education

MSc in Artificial Intelligence

University of Wolverhampton

Aug 2024
Flagship Product
In Development
Building in public

CogFlow

AI Learning Operating System for Professionals

Problem

Professionals lack AI-native, adaptive learning tools. Most study tools are passive, generic, and disconnected from individual knowledge gaps. Learning should be as intelligent as the problems professionals are trying to solve.

Solution

A production-grade AI learning OS — not a chatbot wrapper, but a full system with multi-agent tutoring, RAG-powered curriculum retrieval, adaptive learning paths, memory persistence, and mastery analytics.

Retrieval-augmented generation
Adaptive learning paths
Personalized study plans
Learning analytics dashboard
Multi-agent tutoring system
Memory & context persistence
Mastery tracking
Progress visualisation
FastAPILangGraphPostgreSQLpgvectorNext.jsDockerOpenAI
Follow on GitHubProgress updates

System Architecture

Multi-Agent Tutoring

LangGraph orchestrated agents

RAG Retrieval

pgvector + PostgreSQL

Adaptive Learning

Personalised study plans

Analytics Engine

Mastery & progress tracking

# LangGraph agent routing
User Query
→ Supervisor Agent
→ RAG Agent (pgvector)
→ Tutor Agent (OpenAI)
→ Assessment Agent
→ Adaptive Response
→ Mastery DB Update

LangGraph

Multi-agent

pgvector

Vector store

Adaptive

Learning paths

Persistent

Memory system

03
Work

AI Platforms Built

Production AI platforms and shipped ML applications — each solving a real domain problem.

View all on GitHub →
Flagship Project
LLM / RAG / Product

CogFlow

AI Learning Operating System for Professionals

In Progress
Flagship AI Product
In ProgressFlagship AI Product

Problem

Professionals lack AI-native, adaptive learning tools. Most study tools are passive, generic, and disconnected from individual knowledge gaps.

arch:Next.js → FastAPI → LangGraph agents → pgvector (PostgreSQL) → OpenAI
FastAPILangGraphPostgreSQLpgvectorNext.jsDockerOpenAI
View Repository →Public Flagship Repository
Computer Vision / Healthcare

EyeVision AI Platform

Healthcare Computer Vision Platform for Automated Annotation & Iris Detection

Completed
Production Healthcare AI System
10K+Healthcare Images Processed
70%Reduction in Annotation Effort

Problem

Ophthalmic software teams needed accurate, real-time iris/pupil detection and a scalable annotation infrastructure — not just a point model, but a complete AI system.

arch:CVAT Annotation → YOLOv8 Fine-tuning → OpenCV Pipeline → Azure ML → REST API → Regression Testing
PythonYOLOv8TensorFlowOpenCVAzure MLCVATDocker
View Case Study →Proprietary Healthcare AI System
Also Built
Finance ML

Credit Risk Assessment Model

Live
0.94AUC-ROC Score

Credit underwriting teams needed interpretable, auditable risk scores — not black-box ML outputs.

PythonLightGBMSHAPGradio
ML / Product

Customer Churn Prediction App

Live
89%Prediction Accuracy

SaaS businesses lacked actionable, real-time churn risk scores for customer success teams.

PythonScikit-learnGradioHugging Face
04
Deep Dives

Case Studies

Full architecture breakdowns — problem, solution, challenges, and what I'd do differently.

01

EyeVision AI Platform

Healthcare Computer Vision — Full Platform

Computer Vision / Healthcare

10K+

Images Processed

70%

Annotation Effort Saved

Azure ML

Deployment Platform

Problem

Ophthalmic medical software needed real-time iris and pupil detection from clinical camera feeds — but the real challenge was scale: manual annotation of thousands of images was a bottleneck, and deployment to a safety-critical production environment required reproducibility and regression testing that didn't exist.

Solution

Built the EyeVision AI Platform end-to-end: automated CVAT annotation workflows with quality verification, YOLOv8 fine-tuning on 10K+ ophthalmic images, a streaming OpenCV inference pipeline, Azure ML for experiment tracking and deployment, and a regression testing suite for production releases.

Architecture

CVAT Annotation Workflow
Quality Verification
YOLOv8 Fine-tuning
OpenCV Inference Pipeline
Azure ML Experiment Tracking
REST API
Regression Test Suite

Challenges

Lighting variation across clinical cameras caused significant performance degradation on edge cases. Solved with targeted augmentation (albumentations) and domain-specific preprocessing. The annotation quality problem was harder than the model problem — inconsistent labelling was the real accuracy bottleneck.

Lessons Learned

Data quality infrastructure (annotation workflows, quality gates) should be the first investment in any computer vision project — not the model. I'd build the automated annotation pipeline before touching model architecture next time.

PythonYOLOv8TensorFlowOpenCVCVATAlbumentationsAzure MLDocker
02

Fraud Intelligence Platform

Real-Time Financial Risk Detection

MLOps / Fintech

92%

Accuracy

25%

Fewer False Positives

<5ms

Inference Latency

Problem

A financial platform needed sub-10ms fraud classification at scale — before payment authorisation completed. Batch approaches were too slow, rule-based systems had unacceptable false positive rates triggering customer friction, and black-box model decisions couldn't be explained to compliance teams.

Solution

Built a fraud detection platform — not just a model. An ensemble classifier (XGBoost + shallow NN) backed by a Redis feature store for sub-millisecond feature retrieval. SHAP-based explainability for compliance. FastAPI microservice consuming Kafka events. MLflow for production monitoring and drift detection.

Architecture

Kafka Stream
Redis Feature Store
Ensemble (XGBoost + NN)
FastAPI Endpoint
SHAP Explainer
MLflow Monitoring
Alert System

Challenges

Feature freshness was the hardest problem. The model hit 92% in staging with batch features but degraded under real-time conditions where features were stale. Rebuilding around a Redis feature store with TTL management — before touching inference — was the critical architectural fix.

Lessons Learned

Build the feature store first. Every subsequent decision — model architecture, serving latency, monitoring — becomes simpler once feature consistency is solved. I'd also instrument for data drift from day one: by the time you notice it in production, the damage is already done.

PythonXGBoostTensorFlowFastAPIRedisKafkaSHAPMLflowDocker
05
Stack

Tools & Technologies

The full stack — from raw data to deployed model to monitored API.

ML & Deep Learning

Python
TensorFlow
PyTorch
Scikit-learn
XGBoost
LightGBM
Keras

Computer Vision

OpenCV
YOLO
MediaPipe
PIL / Pillow
Albumentations
CVAT

LLM / RAG Systems

LangChain
Hugging Face
OpenAI API
Pinecone
Chroma
LlamaIndex

MLOps & DevOps

MLflow
Docker
GitHub Actions
DVC
Kubernetes
CI/CD

Cloud Infrastructure

Azure ML
AWS SageMaker
Google Vertex AI
EC2 / S3
GCP
HF Spaces

Data Engineering

Apache Spark
Databricks
Airflow
Pandas
NumPy
SQL
PostgreSQL

Backend & APIs

FastAPI
Django
Node.js
REST APIs
Redis
gRPC
GraphQL

Google Cloud Certifications

Google Cloud Professional ML Engineer
Google Cloud Data Analytics
Google Cloud Generative AI
Google Cloud Responsible AI
06
Build In Public

Engineering Decisions

Architecture choices, technical deep dives, and what I wish I'd known when building production AI systems.

LLM ArchitectureMay 2026

Why I switched from LangChain + Pinecone to LangGraph + pgvector for CogFlow

LangChain got us to prototype fast. LangGraph is how we're shipping to production. Here's the architectural decision: what statefulness in multi-agent systems actually costs, why managed vector DBs add latency you don't need, and how pgvector inside PostgreSQL solved three problems at once.

11 min readRead
Computer VisionMar 2026

Building the EyeVision annotation pipeline: from 70% manual effort to fully automated

The bottleneck wasn't the model — it was getting data into the model. This is the full story of building an automated CVAT annotation workflow for 10,000+ ophthalmic images: the edge cases that broke our quality checks, the Azure ML orchestration decisions, and what we'd do differently now.

14 min readRead
MLOpsNov 2025

Shipping fraud detection to production: the feature store decision that almost broke us

Our ensemble model hit 92% accuracy in staging and nearly failed in production — not because the model was wrong, but because feature freshness was inconsistent. This is the architecture decision I'd make first next time: why we built a Redis feature store before touching inference, and how it changed everything.

10 min readRead

Follow the build progress

I share architecture decisions, lessons learned, and behind-the-scenes engineering on LinkedIn.

Follow on LinkedIn
Opportunities

Open to Opportunities

Currently available for the following roles

AI Engineer
Machine Learning Engineer
MLOps Engineer
Computer Vision Engineer
Technical AI Consultant
LocationUnited Kingdom
PreferenceRemote · Hybrid · On-Site
VisaOpen to UK sponsorship
Get in Touch
07
Get in Touch

Let's build intelligent systems that solve real problems.

Open to senior AI engineering roles, technical contracts, consulting, and collaboration on production AI systems.

London, UK — Available remotely or hybrid
Download CV

Send a Message

I read every message. Usually reply within 24 hours.