Aung Khant Phyo

Software Engineer & Gen AI Developer

About Me

Get to know me better

I'm Aung Khant Phyo, a Computing graduate from the University of Greenwich. I started in mechatronics but shifted to computer science after discovering the power of IT during the COVID-19 lockdown.

Since then, I've developed skills in data analysis using Python and PowerBI, and have experience in machine learning with frameworks like TensorFlow and PyTorch. I've worked as a data analyst for a social listening company and as a backend engineer for a Gen AI application development company. I'm especially enthusiastic about solving global challenges with information technology.

Education

My academic background

University of Greenwich

London, United Kingdom

Bachelor of Science (Honours) in Computing

June 2024 – April 2025

🎖️ First Class Honours

Certificate

University of the People

Pasadena, California

Associate of Science in Computer Science

Dec 2022 – April 2024

🎖️ Cumulative GPA: 3.42

University of Technology (Yatanarpon Cyber City)

Mandalay, Myanmar

Bachelor of Engineering in Mechanical Precision and Automation

Dec 2015 – March 2020

Completed four years of a six-year program (suspended due to covid-19)

Skills

Technologies I work with

Programming Languages

Python

TypeScript

Frameworks & Cloud

FastAPI

LangGraph

OpenAI

React

Google Cloud Platform

Docker

PostgreSQL

ChromaDB

Machine Learning & Deep Learning

Gradient Boosting Models

Decision Trees

Sequential Models

LSTM

RNN

GRU

Projects

What I've built

Text Data Collection Platform (for "guided-back" translation)

As part of my final year project, I developed a comprehensive translation platform with sophisticated role-based access control, featuring three distinct user types (Freelancer, QA Member, Admin) each with dedicated dashboards and permissions. Implemented a complete workflow system where Admins create translation jobs, Freelancers complete tasks, and QA Members perform quality assessment with acceptance metrics tracking. Built robust authentication with JWT, email verification, and role-specific password management flows. The platform includes an advanced appeal system allowing freelancers to challenge rejected translations with documentation upload capability. Database schema leverages SQLAlchemy ORM with efficient relationship mapping between users, tasks, and languages. Features multi-language system localization (English, Burmese, and regional dialects) to support linguistic diversity in Myanmar.

Python

TypeScript

FastAPI

PostgreSQL

React

AI-Powered Telegram Bot with RAG Pipeline

Developed an intelligent Telegram assistant leveraging Google Cloud Vertex AI with Retrieval-Augmented Generation (RAG) for contextual responses. Built on FastAPI with clean architecture, the bot automatically syncs knowledge from Google Drive documents, processes queries through Gemini 1.5 Flash model, and delivers personalized answers. Features production-ready webhook integration, automated document corpus management, containerized Cloud Run deployment, and comprehensive monitoring with structured logging and health endpoints.

Python

FastAPI

Vertex AI

RAG

Gemini 1.5

Docker

Cloud Run

Google Drive API

Let's Share – Full-Stack Clean Architecture Demo

Demonstrated professional-grade full-stack development with a clean, enterprise-style architecture emphasizing scalability, maintainability, and testability. The project implements a layered backend using FastAPI, SQLAlchemy, and PostgreSQL, following repository and service patterns with dependency injection, async data handling, and JWT-based authentication. The frontend, built with React and TypeScript, adopts a modular architecture with Zustand state management, custom hooks, and a dedicated API service layer for clean separation of concerns.

Python

FastAPI

SQLAlchemy (Async)

PostgreSQL

React 19

TypeScript

Zustand

Axios

Fact Checker — Chrome Extension with AI-Powered Backend

Built a browser extension with a FastAPI backend that allows users to fact-check selected text or images directly from any webpage. The backend leverages Google Gemini, a large language model (LLM), for real-time claim verification, web search, and OCR-based image analysis. Includes multilingual support via Google Cloud Translate and is containerized with Docker for easy deployment.

JavaScript

Python

FastAPI

Docker

Google Gemini

Google Cloud Translate

Google Cloud Platform

Chatbot that answer with your own Doc

Built a chatbot that helps users find information from their documents using text-to-vector conversion and Retrieval-Augmented Generation (RAG). Users can upload PDF or TXT files, which are processed and stored in a vector database, enabling fast and accurate responses to their queries.

Python

LangChain

ChromaDB

OpenAI

Streamlit UI

fMRI Brain Activity Classification for Music Genre Prediction

Developed a comprehensive machine learning pipeline to classify fMRI brain activity data and predict music genres based on neural responses in the auditory cortex. Implemented end-to-end workflow through three structured Jupyter notebooks covering exploratory data analysis, systematic model comparison, and final model training. Processed 22,036 brain activity features from blood-oxygenation level measurements across 160 training samples spanning five music genres (Ambient, Country, Heavy Metal, Rock, Classical). Applied advanced feature engineering with SelectKBest feature selection, PCA dimensionality reduction, and statistical significance testing. Conducted comprehensive model evaluation comparing 9+ ML algorithms including Random Forest, XGBoost, LightGBM, and SVM with cross-validation and hyperparameter optimization. Built robust data preprocessing pipeline handling feature scaling, problematic feature identification, and proper CSV data loading without headers. Achieved systematic model performance tracking with metadata persistence and confidence scoring for final predictions on 40 test samples.

Python

Scikit-learn

Pandas

NumPy

Jupyter

XGBoost

LightGBM

Football Total Goals Predictor with CatBoost ML

Developed an advanced machine learning system to predict total goals in football matches using CatBoost regression and comprehensive betting odds analysis. Built end-to-end prediction pipeline processing historical match data from 9+ European leagues including Premier League, Bundesliga, La Liga, Serie A, and Championship divisions. Implemented sophisticated feature engineering creating 16 derived features from betting odds (PSH, PSD, PSA) through polynomial transformations, geometric combinations, and complex mathematical interactions. Designed robust data preprocessing workflow handling multi-league CSV imports with encoding detection, target variable creation (FTHG + FTAG), and outlier filtering for matches with ≤8 total goals. Constructed modular Python architecture with separate modules for preprocessing, feature engineering, model training, and interactive prediction capabilities. Achieved systematic model evaluation using Mean Absolute Error (MAE) with train/validation/test splits and implemented real-time single-match prediction interface with probability distribution visualization using matplotlib and scipy. Deployed configurable YAML-based hyperparameter management and integrated Poetry dependency management for reproducible development environment.

Python

CatBoost

Scikit-learn

Pandas

NumPy

Matplotlib

Scipy

Poetry

YAML

Certifications & Courses

My continuous learning journey

AI Agentic Design Patterns with AutoGen

DeepLearning.AI

September 2024

AI Agents in LangGraph

DeepLearning.AI

July 2024

Deep Learning Specialization by Andrew Ng

Coursera

April 2023

Machine Learning with Python

freeCodeCamp.org

February 2023

CS50's Introduction to Computer Science

Harvard University

January 2023

Contact Me

Let's get in touch

Feel free to reach out if you're looking for a developer, have a question, or just want to connect.

itsakphyo@gmail.com