Open to Opportunities

Building AISystems ThatScale.

AI EngineeringMLOpsComputer VisionLLM SystemsBackend EngineeringProduct Development

View Projects Download CV LinkedIn

ayo2kadri@gmail.comLondon, UK

Production · Healthcare AI

Clinical Imaging

↓

CVAT Auto Annotation

↓

Human Validation

↓

Training Pipeline

↓

Azure ML

↓

Regression Testing

↓

Production Deployment

YOLOTensorFlowCVATAzure MLDockerOpenCV

● Production SystemArchitecture

6+Years Eng.

3AI Platforms

MScAI · Wolverhampton

Scroll

Production Impact

6+Years EngineeringAcross AI, backend & cloud

10K+Healthcare ImagesProcessed in production

70%Annotation EffortReduced via automation

60%Faster DeploymentsCI/CD pipeline optimisation

22%Student RetentionUplift via ML prediction

3Production AI PlatformsShipped to production

Ayo Kadri

AI Systems Engineer

Computer VisionLLM / RAGMLOpsHealthcare AIFintech AIProduct Engineering

About Me

I build AI systems end to end.

I'm an AI Systems Engineer — not just an ML practitioner. There's a difference: I don't hand off a model and call it done. I own the full stack: data pipelines, model training, serving infrastructure, monitoring, and the product layer that makes it useful to real people. MSc in Artificial Intelligence (University of Wolverhampton), and 6+ years building software and AI systems across healthcare, fintech, education, and AI products.

At Eyoto Group, I built the EyeVision AI Platform from the ground up — a production healthcare computer vision system that processes clinical ophthalmic imagery, automates annotation workflows, and deploys via Azure ML with full regression testing. At Your Study Path, I shipped LLM pipelines that moved the retention needle by 22% and cut deployment time by 60%. I'm now building CogFlow — an AI learning OS for professionals, powered by LangGraph multi-agent pipelines and pgvector.

My approach: architecture decisions matter more than model accuracy. A 94% AUC-ROC score in a system that breaks under load, can't be monitored, or can't be explained to stakeholders is not a success. I build AI systems that are scalable, reproducible, observable, and — most importantly — actually useful.

6+Years Engineering

3AI Platforms Shipped

4+Industry Domains

MScArtificial Intelligence

Google Cloud Certifications

Professional ML EngineerData AnalyticsGenerative AIResponsible AI

Career

Where I've Built Things

Production systems deployed. Real metrics. Real domains.

AIML Engineer

Eyoto Group Ltd

Sep 2025 – PresentHealthcare AI

Architected and deployed the EyeVision AI Platform — a production healthcare computer vision system spanning automated annotation pipelines, YOLOv8-based iris/pupil detection, and full Azure ML deployment infrastructure
Designed scalable training pipelines processing 10,000+ annotated medical images; reduced annotation effort by 70% through automated CVAT workflows and quality verification tooling
Owned end-to-end MLOps: Docker-based model packaging, Azure ML experiment tracking, regression testing, and production release workflows for a safety-critical healthcare application
Fine-tuned object detection models on ophthalmic datasets; drove measurable improvements in detection accuracy and inference latency critical for real-time clinical use

Machine Learning Engineer

Your Study Path Limited

Jun 2024 – Jul 2025EdTech / LLM

Designed and shipped production LLM pipelines using LangChain and Hugging Face — improving AI chatbot performance by 40% and directly increasing student engagement across the platform
Built and fine-tuned transformer models for student retention prediction; the system drove a 22% uplift in platform retention, tied directly to business revenue
Owned the full MLOps lifecycle on Google Vertex AI: containerised training jobs, GitHub Actions CI/CD, automated redeployment — cutting deployment cycle time by 60%
Delivered RESTful APIs integrating ML models into real-time recommendation and decision systems, enabling product features used daily by thousands of students

Software Developer

Ascalon Integrated

May 2021 – Feb 2024Mobility / Data

Built a high-performance car-sharing platform managing 500+ vehicles, real-time booking state, and concurrent user sessions — owning backend architecture decisions for scalability
Engineered a real-time analytics engine processing 10M+ daily data points using Apache Spark; designed the pipeline architecture and DevOps infrastructure for reliability and observability

Software Engineer

StreetRates Limited

Jan 2020 – Feb 2021FinTech

Architected and delivered RESTful APIs integrating Google Maps for real-time, country-based FX rate visualisation — the core data feature of the product
Implemented CI/CD pipelines with Docker, achieving 99.9% platform uptime and enabling zero-downtime deployments in a fintech production environment

Education

MSc in Artificial Intelligence

University of Wolverhampton

Aug 2024

Flagship Product

In Development

Building in public

CogFlow

AI Learning Operating System for Professionals

Problem

Professionals lack AI-native, adaptive learning tools. Most study tools are passive, generic, and disconnected from individual knowledge gaps. Learning should be as intelligent as the problems professionals are trying to solve.

Solution

A production-grade AI learning OS — not a chatbot wrapper, but a full system with multi-agent tutoring, RAG-powered curriculum retrieval, adaptive learning paths, memory persistence, and mastery analytics.

Retrieval-augmented generation

Adaptive learning paths

Personalized study plans

Learning analytics dashboard

Multi-agent tutoring system

Memory & context persistence

Mastery tracking

Progress visualisation

FastAPILangGraphPostgreSQLpgvectorNext.jsDockerOpenAI

Follow on GitHubProgress updates

System Architecture

Multi-Agent Tutoring

LangGraph orchestrated agents

RAG Retrieval

pgvector + PostgreSQL

Adaptive Learning

Personalised study plans

Analytics Engine

Mastery & progress tracking

# LangGraph agent routing

User Query

→ Supervisor Agent

→ RAG Agent (pgvector)

→ Tutor Agent (OpenAI)

→ Assessment Agent

→ Adaptive Response

→ Mastery DB Update

LangGraph

Multi-agent

pgvector

Vector store

Adaptive

Learning paths

Persistent

Memory system

Work

AI Platforms Built

Production AI platforms and shipped ML applications — each solving a real domain problem.

View all on GitHub →

Flagship Project

LLM / RAG / Product

CogFlow

AI Learning Operating System for Professionals

In Progress

Flagship AI Product

In ProgressFlagship AI Product

Problem

Professionals lack AI-native, adaptive learning tools. Most study tools are passive, generic, and disconnected from individual knowledge gaps.

arch:Next.js → FastAPI → LangGraph agents → pgvector (PostgreSQL) → OpenAI

FastAPILangGraphPostgreSQLpgvectorNext.jsDockerOpenAI

View Repository →Public Flagship Repository

Computer Vision / Healthcare

EyeVision AI Platform

Healthcare Computer Vision Platform for Automated Annotation & Iris Detection

Completed

Production Healthcare AI System

10K+Healthcare Images Processed

70%Reduction in Annotation Effort

Problem

Ophthalmic software teams needed accurate, real-time iris/pupil detection and a scalable annotation infrastructure — not just a point model, but a complete AI system.

arch:CVAT Annotation → YOLOv8 Fine-tuning → OpenCV Pipeline → Azure ML → REST API → Regression Testing

PythonYOLOv8TensorFlowOpenCVAzure MLCVATDocker

View Case Study →Proprietary Healthcare AI System

Also Built

Finance ML

Credit Risk Assessment Model

Live

0.94AUC-ROC Score

Credit underwriting teams needed interpretable, auditable risk scores — not black-box ML outputs.

PythonLightGBMSHAPGradio

Live Demo →View Code

ML / Product

Customer Churn Prediction App

Live

89%Prediction Accuracy

SaaS businesses lacked actionable, real-time churn risk scores for customer success teams.

PythonScikit-learnGradioHugging Face

Live Demo →View Code

Deep Dives

Case Studies

Full architecture breakdowns — problem, solution, challenges, and what I'd do differently.

EyeVision AI Platform

Healthcare Computer Vision — Full Platform

Computer Vision / Healthcare

10K+

Images Processed

70%

Annotation Effort Saved

Azure ML

Deployment Platform

Problem

Ophthalmic medical software needed real-time iris and pupil detection from clinical camera feeds — but the real challenge was scale: manual annotation of thousands of images was a bottleneck, and deployment to a safety-critical production environment required reproducibility and regression testing that didn't exist.

Solution

Built the EyeVision AI Platform end-to-end: automated CVAT annotation workflows with quality verification, YOLOv8 fine-tuning on 10K+ ophthalmic images, a streaming OpenCV inference pipeline, Azure ML for experiment tracking and deployment, and a regression testing suite for production releases.

Architecture

CVAT Annotation Workflow →

Quality Verification →

YOLOv8 Fine-tuning →

OpenCV Inference Pipeline →

Azure ML Experiment Tracking →

REST API →

Regression Test Suite

Challenges

Lighting variation across clinical cameras caused significant performance degradation on edge cases. Solved with targeted augmentation (albumentations) and domain-specific preprocessing. The annotation quality problem was harder than the model problem — inconsistent labelling was the real accuracy bottleneck.

Lessons Learned

Data quality infrastructure (annotation workflows, quality gates) should be the first investment in any computer vision project — not the model. I'd build the automated annotation pipeline before touching model architecture next time.

PythonYOLOv8TensorFlowOpenCVCVATAlbumentationsAzure MLDocker

Fraud Intelligence Platform

Real-Time Financial Risk Detection

MLOps / Fintech

92%

Accuracy

25%

Fewer False Positives

<5ms

Inference Latency

Problem

A financial platform needed sub-10ms fraud classification at scale — before payment authorisation completed. Batch approaches were too slow, rule-based systems had unacceptable false positive rates triggering customer friction, and black-box model decisions couldn't be explained to compliance teams.

Solution

Built a fraud detection platform — not just a model. An ensemble classifier (XGBoost + shallow NN) backed by a Redis feature store for sub-millisecond feature retrieval. SHAP-based explainability for compliance. FastAPI microservice consuming Kafka events. MLflow for production monitoring and drift detection.

Architecture

Kafka Stream →

Redis Feature Store →

Ensemble (XGBoost + NN) →

FastAPI Endpoint →

SHAP Explainer →

MLflow Monitoring →

Alert System

Challenges

Feature freshness was the hardest problem. The model hit 92% in staging with batch features but degraded under real-time conditions where features were stale. Rebuilding around a Redis feature store with TTL management — before touching inference — was the critical architectural fix.

Lessons Learned

Build the feature store first. Every subsequent decision — model architecture, serving latency, monitoring — becomes simpler once feature consistency is solved. I'd also instrument for data drift from day one: by the time you notice it in production, the damage is already done.

PythonXGBoostTensorFlowFastAPIRedisKafkaSHAPMLflowDocker

Stack

Tools & Technologies

The full stack — from raw data to deployed model to monitored API.

⬡

ML & Deep Learning

Python

TensorFlow

PyTorch

Scikit-learn

XGBoost

LightGBM

Keras

◎

Computer Vision

OpenCV

YOLO

MediaPipe

PIL / Pillow

Albumentations

CVAT

◈

LLM / RAG Systems

LangChain

Hugging Face

OpenAI API

Pinecone

Chroma

LlamaIndex

⬙

MLOps & DevOps

MLflow

Docker

GitHub Actions

DVC

Kubernetes

CI/CD

△

Cloud Infrastructure

Azure ML

AWS SageMaker

Google Vertex AI

EC2 / S3

GCP

HF Spaces

⬟

Data Engineering

Apache Spark

Databricks

Airflow

Pandas

NumPy

SQL

PostgreSQL

⬡

Backend & APIs

FastAPI

Django

Node.js

REST APIs

Redis

gRPC

GraphQL

Google Cloud Certifications

Google Cloud Professional ML Engineer

Google Cloud Data Analytics

Google Cloud Generative AI

Google Cloud Responsible AI

Build In Public

Engineering Decisions

Architecture choices, technical deep dives, and what I wish I'd known when building production AI systems.

LLM ArchitectureMay 2026

Why I switched from LangChain + Pinecone to LangGraph + pgvector for CogFlow

LangChain got us to prototype fast. LangGraph is how we're shipping to production. Here's the architectural decision: what statefulness in multi-agent systems actually costs, why managed vector DBs add latency you don't need, and how pgvector inside PostgreSQL solved three problems at once.

11 min readRead

Computer VisionMar 2026

Building the EyeVision annotation pipeline: from 70% manual effort to fully automated

The bottleneck wasn't the model — it was getting data into the model. This is the full story of building an automated CVAT annotation workflow for 10,000+ ophthalmic images: the edge cases that broke our quality checks, the Azure ML orchestration decisions, and what we'd do differently now.

14 min readRead

MLOpsNov 2025

Shipping fraud detection to production: the feature store decision that almost broke us

Our ensemble model hit 92% accuracy in staging and nearly failed in production — not because the model was wrong, but because feature freshness was inconsistent. This is the architecture decision I'd make first next time: why we built a Redis feature store before touching inference, and how it changed everything.

10 min readRead

Follow the build progress

I share architecture decisions, lessons learned, and behind-the-scenes engineering on LinkedIn.

Follow on LinkedIn

Opportunities

Open to Opportunities

Currently available for the following roles

AI Engineer

Machine Learning Engineer

MLOps Engineer

Computer Vision Engineer

Technical AI Consultant

LocationUnited Kingdom

PreferenceRemote · Hybrid · On-Site

VisaOpen to UK sponsorship

Get in Touch

Let's build intelligent systems that solve real problems.

Open to senior AI engineering roles, technical contracts, consulting, and collaboration on production AI systems.

London, UK — Available remotely or hybrid

GitHub

github.com/Ayodimeji1

linkedin.com/in/ayodimejikadri

ayo2kadri@gmail.com

Download CV

Send a Message

I read every message. Usually reply within 24 hours.