About Me
Get to know me better
I'm Aung Khant Phyo, a Computing graduate from the University of Greenwich. I started in mechatronics but shifted to computer science after discovering the power of IT during the COVID-19 lockdown.
Since then, I've developed skills in data analysis using Python and PowerBI, and have experience in machine learning with frameworks like TensorFlow and PyTorch. I've worked as a data analyst for a social listening company and as a backend engineer for a Gen AI application development company. I'm especially enthusiastic about solving global challenges with information technology.
Education
My academic background
London, United Kingdom
Bachelor of Science (Honours) in Computing
June 2024 – April 2025
First Class Honours
Certificate

Pasadena, California
Associate of Science in Computer Science
Dec 2022 – April 2024
Cumulative GPA: 3.42
Mandalay, Myanmar
Bachelor of Engineering in Mechanical Precision and Automation
Dec 2015 – March 2020
Completed four years of a six-year program (suspended due to covid-19)
Skills
Technologies I work with
Programming Languages
Frameworks & Cloud
Machine Learning & Deep Learning
Projects
What I've built
As part of my final year project, I developed a comprehensive translation platform with sophisticated role-based access control, featuring three distinct user types (Freelancer, QA Member, Admin) each with dedicated dashboards and permissions. Implemented a complete workflow system where Admins create translation jobs, Freelancers complete tasks, and QA Members perform quality assessment with acceptance metrics tracking. Built robust authentication with JWT, email verification, and role-specific password management flows. The platform includes an advanced appeal system allowing freelancers to challenge rejected translations with documentation upload capability. Database schema leverages SQLAlchemy ORM with efficient relationship mapping between users, tasks, and languages. Features multi-language system localization (English, Burmese, and regional dialects) to support linguistic diversity in Myanmar.
Developed an intelligent Telegram assistant leveraging Google Cloud Vertex AI with Retrieval-Augmented Generation (RAG) for contextual responses. Built on FastAPI with clean architecture, the bot automatically syncs knowledge from Google Drive documents, processes queries through Gemini 1.5 Flash model, and delivers personalized answers. Features production-ready webhook integration, automated document corpus management, containerized Cloud Run deployment, and comprehensive monitoring with structured logging and health endpoints.
Built a browser extension with a FastAPI backend that allows users to fact-check selected text or images directly from any webpage. The backend leverages Google Gemini, a large language model (LLM), for real-time claim verification, web search, and OCR-based image analysis. Includes multilingual support via Google Cloud Translate and is containerized with Docker for easy deployment.
Built a chatbot that helps users find information from their documents using text-to-vector conversion and Retrieval-Augmented Generation (RAG). Users can upload PDF or TXT files, which are processed and stored in a vector database, enabling fast and accurate responses to their queries.
Developed a comprehensive machine learning pipeline to classify fMRI brain activity data and predict music genres based on neural responses in the auditory cortex. Implemented end-to-end workflow through three structured Jupyter notebooks covering exploratory data analysis, systematic model comparison, and final model training. Processed 22,036 brain activity features from blood-oxygenation level measurements across 160 training samples spanning five music genres (Ambient, Country, Heavy Metal, Rock, Classical). Applied advanced feature engineering with SelectKBest feature selection, PCA dimensionality reduction, and statistical significance testing. Conducted comprehensive model evaluation comparing 9+ ML algorithms including Random Forest, XGBoost, LightGBM, and SVM with cross-validation and hyperparameter optimization. Built robust data preprocessing pipeline handling feature scaling, problematic feature identification, and proper CSV data loading without headers. Achieved systematic model performance tracking with metadata persistence and confidence scoring for final predictions on 40 test samples.
Developed an advanced machine learning system to predict total goals in football matches using CatBoost regression and comprehensive betting odds analysis. Built end-to-end prediction pipeline processing historical match data from 9+ European leagues including Premier League, Bundesliga, La Liga, Serie A, and Championship divisions. Implemented sophisticated feature engineering creating 16 derived features from betting odds (PSH, PSD, PSA) through polynomial transformations, geometric combinations, and complex mathematical interactions. Designed robust data preprocessing workflow handling multi-league CSV imports with encoding detection, target variable creation (FTHG + FTAG), and outlier filtering for matches with ≤8 total goals. Constructed modular Python architecture with separate modules for preprocessing, feature engineering, model training, and interactive prediction capabilities. Achieved systematic model evaluation using Mean Absolute Error (MAE) with train/validation/test splits and implemented real-time single-match prediction interface with probability distribution visualization using matplotlib and scipy. Deployed configurable YAML-based hyperparameter management and integrated Poetry dependency management for reproducible development environment.
Certifications & Courses
My continuous learning journey
DeepLearning.AI
September 2024
DeepLearning.AI
July 2024
Coursera
April 2023
freeCodeCamp.org
February 2023
Harvard University
January 2023
Contact Me
Let's get in touch