Portrait of Matome Mbowene

Matome Mbowene

Software & AI Engineer • OCR/CV • RAG • Backend

Open to Software • AI/ML • Backend/Cloud Location Cape Town Work mode Hybrid / Remote

I build production AI systems—document automation, retrieval, and backend services—designed for reliability.

Outcomes

Based in Cape Town • Open to hybrid/remote roles

References

References available

References available on request.

Public proof is available via case studies, repos, and certifications.

Programs & platforms

Credentialed programs and core platforms used in delivery.

  • Google Cloud skill badge(s) (Credly)
  • Dell Young Leaders program
  • GitHub, Docker, and OpenAI used across projects

Recent Experience

Computer vision & document automation

Recent
  • Delivered OCR + LLM pipeline with high field-mapping accuracy on a defined set.
  • Built validation layers, confidence scoring, and audit-grade logging.

Retrieval systems (RAG)

Recent
  • Built a RAG assistant with FAISS + sentence-transformers and LLM APIs.
  • Shipped the end-to-end pipeline and tightened production safeguards.

Embedded / edge fundamentals

Recent
  • Developing sensor fusion and edge-AI for real-time hazard detection.
  • Contributing to firmware and data architecture for LiDAR/IMU/camera integration.

Skills & experience

Production AI, backend, reliability, and embedded systems.

AI / ML systems

Document automation & OCR
RAG systems
Computer Vision

Backend & data

API design & integration
Data modeling
Observability

Systems & shipping

CI/CD & release hygiene
Containerization
Performance optimization

Embedded & edge

Sensor integration
Real-time pipelines
Edge AI fundamentals

Featured Projects

Featured picks first; filter for detail.

NDA note (what I can share publicly)

Built to protect clients and teams
I can discuss
  • Problem framing, constraints, and tradeoffs
  • Architecture patterns and reliability practices
  • Validation, observability, and delivery workflow
I won’t publish
  • Client names, internal documents, or private metrics
  • Confidential datasets, prompts, or implementation details
  • Anything that violates NDA or privacy
Featured

OCR Document Automation

Production OCR + LLM pipeline for document-to-structured-data automation.

High accuracy
Python OpenCV OCR PDF Forms LLM APIs
Case study details
What I optimized for: correctness and traceability first. The pipeline uses layered validation (format, geometry, constraints, cross-field rules) so extraction failures are caught early and are explainable.
What I’d improve next: expand evaluation sets, add drift checks on input quality, and tighten “confidence-to-review” thresholds to reduce manual review time.
Featured

RAG AI Assistant

RAG assistant for portfolio and candidate Q&A with grounded retrieval and conservative response behavior.

Reliable retrieval
Python FAISS sentence-transformers Streamlit LLM APIs

Embedded Navigation

Sensor fusion + edge-AI foundations for real-time hazard detection.

C/C++ STM32 LiDAR IMU Camera
Case study details
Emphasis on signal integrity and latency. Contributed to firmware/data architecture decisions to ensure consistent sensor timing and robust downstream consumption.
NDA

Confidential AI Product Build (NDA)

Ongoing work on an AI-driven product with details limited by NDA.

LLM APIs Backend CI/CD

Note: Details limited by NDA; happy to discuss at a high level.

FashionMNIST Classifier

Neural network image classifier with an end-to-end training and evaluation pipeline.

  • Outcome: 89.33% test accuracy on FashionMNIST.
  • Approach: training pipeline with preprocessing and reproducible runs.
  • Reliability: metrics, validation, and clear experiment tracking.
  • Stack: Python, PyTorch.
Case study details
What this demonstrates: the full ML loop (data → training → evaluation) with repeatability.
What I’d improve next: add calibration, stronger baselines, and automated experiment tracking to make comparisons faster and more robust.

MyAdvisor (Full‑Stack Web App)

Full-stack web app emphasizing usability, data integrity, and maintainable APIs.

Java Spring Boot MySQL

Scheduling & Networking Systems

Hands-on systems work: scheduling simulations, protocol design, and integrity checks.

35% efficiency gain
Java Python Sockets Hashing

Project timeline (high level)

A quick view of how my work evolved across domains. Details are intentionally high-level where needed.

Full‑stack foundations Web • APIs • Databases
Shipped web app features with API design, data modeling, and UX iteration.
Systems + performance Scheduling • Networking
Benchmarked algorithms and built integrity-first networking components.
Modeling + evaluation PyTorch • CV
Developed training pipelines with reproducible evaluation and clear metrics.
Production AI systems OCR • RAG • Reliability
Built document automation and retrieval pipelines with guardrails and operational readiness.

Certifications

Google Cloud Skill Badges

Verified cloud learning and hands-on labs focused on practical, production-relevant skills.

  • Multimodal RAG (Gemini) (view on Credly)

Programs

Selected programs that strengthened leadership, communication, and technical execution.

Writing

Short, public-safe notes on how I build and ship reliable systems.

Validation-first OCR: why “accuracy” isn’t enough

In production OCR, errors often look “plausible” and silently poison downstream systems. My default is validation-first extraction: explicit constraints, cross-field rules, confidence thresholds, and human-review hooks. The goal is not just extracting text, but producing outputs you can trust and audit.

RAG guardrails: being useful without hallucinating

A good RAG system is as much about “when to refuse” as it is about retrieval. I focus on deterministic indexing, conservative thresholds when evidence is weak, and clear provenance (links back to sources). This keeps the assistant helpful while staying honest and grounded.

Shipping discipline: small checks prevent big failures

What helps me move fast safely: reproducible environments, dependency hygiene, automated checks, and clear “definition of done”. Even lightweight CI, good error handling, and predictable releases reduce firefighting and increase delivery cadence.

About

How I work

I build end-to-end AI features that hold up in production: ingestion, validation, retrieval, model integration, observability, and delivery. My focus is correctness and reliability: clear interfaces, defensive checks, and measurable outcomes.


What I’m looking for

Software engineering, AI/ML engineering, and backend/cloud roles. Open to hybrid or remote opportunities.

Education

University of Cape Town
Computer Science & Computer Engineering


Skills

Production OCR Computer Vision RAG Backend services CI/CD Reliability

Get In Touch

Email or LinkedIn preferred. Resume and GitHub below.

Response 24–48h Availability Full-time / Contract Work mode Hybrid / Remote
Email
Email
matomepontso@gmail.com
LinkedIn
LinkedIn
linkedin.com/in/matomembowene
Resume PDF
Resume PDF
resume.pdf
GitHub
GitHub
github.com/MatomeMb

Note: this form opens your email client with a pre-filled message (no server, no data storage).

Open to opportunities