Aung Khant Phyo

Aung Khant Phyo

Software Engineer & AI Developer

About Me

Get to know me better

I'm Aung Khant Phyo, a Computing graduate from the University of Greenwich. I started in mechatronics but shifted to computer science after discovering the power of IT during the COVID-19 lockdown.

Since then, I've developed skills in data analysis using Python and PowerBI, and have experience in machine learning with frameworks like TensorFlow and PyTorch. I've worked as a data analyst for a social listening company and as a backend engineer for a Gen AI application development company. I'm especially enthusiastic about solving global challenges with information technology.

Education

My academic background

University of Greenwich

London, United Kingdom

Bachelor of Science (Honours) in Computing

June 2024 – April 2025

First Class Honours

Certificate

University of Greenwich Certificate
University of the People

Pasadena, California

Associate of Science in Computer Science

Dec 2022 – April 2024

Cumulative GPA: 3.42

University of Technology (Yatanarpon Cyber City)

Mandalay, Myanmar

Bachelor of Engineering in Mechanical Precision and Automation

Dec 2015 – March 2020

Completed four years of a six-year program (suspended due to covid-19)

Skills

Technologies I work with

Programming Languages

Python
TypeScript

Frameworks & Cloud

FastAPI
LangGraph
OpenAI
React
Google Cloud Platform
Docker
PostgreSQL
ChromaDB

Machine Learning & Deep Learning

Gradient Boosting Models
Decision Trees
Sequential Models
LSTM
RNN
GRU

Projects

What I've built

Text Data Collection Platform (for "guided-back" translation)

As part of my final year project, I developed a comprehensive translation platform with sophisticated role-based access control, featuring three distinct user types (Freelancer, QA Member, Admin) each with dedicated dashboards and permissions. Implemented a complete workflow system where Admins create translation jobs, Freelancers complete tasks, and QA Members perform quality assessment with acceptance metrics tracking. Built robust authentication with JWT, email verification, and role-specific password management flows. The platform includes an advanced appeal system allowing freelancers to challenge rejected translations with documentation upload capability. Database schema leverages SQLAlchemy ORM with efficient relationship mapping between users, tasks, and languages. Features multi-language system localization (English, Burmese, and regional dialects) to support linguistic diversity in Myanmar.

Python
TypeScript
FastAPI
PostgreSQL
React
AI-Powered Telegram Bot with RAG Pipeline

Developed an intelligent Telegram assistant leveraging Google Cloud Vertex AI with Retrieval-Augmented Generation (RAG) for contextual responses. Built on FastAPI with clean architecture, the bot automatically syncs knowledge from Google Drive documents, processes queries through Gemini 1.5 Flash model, and delivers personalized answers. Features production-ready webhook integration, automated document corpus management, containerized Cloud Run deployment, and comprehensive monitoring with structured logging and health endpoints.

Python
FastAPI
Vertex AI
RAG
Gemini 1.5
Docker
Cloud Run
Google Drive API
Fact Checker — Chrome Extension with AI-Powered Backend

Built a browser extension with a FastAPI backend that allows users to fact-check selected text or images directly from any webpage. The backend leverages Google Gemini, a large language model (LLM), for real-time claim verification, web search, and OCR-based image analysis. Includes multilingual support via Google Cloud Translate and is containerized with Docker for easy deployment.

JavaScript
Python
FastAPI
Docker
Google Gemini
Google Cloud Translate
Google Cloud Platform
Chatbot that answer with your own Doc

Built a chatbot that helps users find information from their documents using text-to-vector conversion and Retrieval-Augmented Generation (RAG). Users can upload PDF or TXT files, which are processed and stored in a vector database, enabling fast and accurate responses to their queries.

Python
LangChain
ChromaDB
OpenAI
Streamlit UI
fMRI Brain Activity Classification for Music Genre Prediction

Developed a comprehensive machine learning pipeline to classify fMRI brain activity data and predict music genres based on neural responses in the auditory cortex. Implemented end-to-end workflow through three structured Jupyter notebooks covering exploratory data analysis, systematic model comparison, and final model training. Processed 22,036 brain activity features from blood-oxygenation level measurements across 160 training samples spanning five music genres (Ambient, Country, Heavy Metal, Rock, Classical). Applied advanced feature engineering with SelectKBest feature selection, PCA dimensionality reduction, and statistical significance testing. Conducted comprehensive model evaluation comparing 9+ ML algorithms including Random Forest, XGBoost, LightGBM, and SVM with cross-validation and hyperparameter optimization. Built robust data preprocessing pipeline handling feature scaling, problematic feature identification, and proper CSV data loading without headers. Achieved systematic model performance tracking with metadata persistence and confidence scoring for final predictions on 40 test samples.

Python
Scikit-learn
Pandas
NumPy
Jupyter
XGBoost
LightGBM
Football Total Goals Predictor with CatBoost ML

Developed an advanced machine learning system to predict total goals in football matches using CatBoost regression and comprehensive betting odds analysis. Built end-to-end prediction pipeline processing historical match data from 9+ European leagues including Premier League, Bundesliga, La Liga, Serie A, and Championship divisions. Implemented sophisticated feature engineering creating 16 derived features from betting odds (PSH, PSD, PSA) through polynomial transformations, geometric combinations, and complex mathematical interactions. Designed robust data preprocessing workflow handling multi-league CSV imports with encoding detection, target variable creation (FTHG + FTAG), and outlier filtering for matches with ≤8 total goals. Constructed modular Python architecture with separate modules for preprocessing, feature engineering, model training, and interactive prediction capabilities. Achieved systematic model evaluation using Mean Absolute Error (MAE) with train/validation/test splits and implemented real-time single-match prediction interface with probability distribution visualization using matplotlib and scipy. Deployed configurable YAML-based hyperparameter management and integrated Poetry dependency management for reproducible development environment.

Python
CatBoost
Scikit-learn
Pandas
NumPy
Matplotlib
Scipy
Poetry
YAML

Certifications & Courses

My continuous learning journey

AI Agentic Design Patterns with AutoGen

DeepLearning.AI

September 2024

View Certificate
AI Agents in LangGraph

DeepLearning.AI

July 2024

View Certificate
AI Deep Learning Specialization by Andrew Ng

Coursera

April 2023

View Certificate
Machine Learning with Python

freeCodeCamp.org

February 2023

View Certificate
CS50's Introduction to Computer Science

Harvard University

January 2023

View Certificate

Contact Me

Let's get in touch

Feel free to reach out if you're looking for a developer, have a question, or just want to connect.

Bangkok, Thailand
(+66) 62 965 2927
itsakphyo@gmail.com