Hello, I'm
Data Engineer with 3+ years of experience specialising in ETL pipelines, distributed data processing, and cloud data infrastructure. Currently pursuing an MSc in Data & Computer Science at Heidelberg University under a DAAD scholarship.
Big Data platform for trend analysis from large online news datasets using Scrapy, Spark, and HDFS. Built Elasticsearch data warehouse with Flask + ReactJS interfaces.
Integrated platform for managing and processing gamer behaviour data using Python, Redis, MongoDB, BigQuery, and Looker for advanced visualisation.
Lambda architecture system correlating Binance crypto transactions with Twitter sentiment in real-time using Spark Streaming, HDFS, and Cassandra.
NLP-based sentiment analysis of Spotify app reviews integrating RNN, LSTM, GRU, Bi-LSTM, and Transformer models built with Python and TensorFlow.
Streamlined financial data collection, storage, and analysis pipeline using Python, Airflow for orchestration, and AWS Cloud services for scalable infrastructure.
Personalised movie recommendation engine using Regression, KNN, and SGD models in Python, improving search results and recommendation accuracy.
Heidelberg University, Germany
49th Global · THE World University Rankings 2026 · DAAD Scholarship
Hanoi University of Science and Technology
GPA 3.80 / 4.0 · 2nd in CS Vietnam (QS Ranking)
Bigdata Storage & Processing · Deep Learning · Software Design & Construction
Le Hong Phong High School for the Gifted
GPA 8.7 / 10
Highly competitive DAAD scholarship for STEM disciplines supporting Master studies in Germany.
National university competition · Algebra · Ranked 6th out of 161 contestants.
National high school competition specialising in mathematics.
Northern high school mathematics competition.
Associated with Hitachi Digital Services.
Associated with Hitachi Digital Services.
Associated with Hitachi Digital Services.
Online certification.
Online certification.
Excellent classification · Conduct score above 90/100.
I'm open to new opportunities, collaborations, or just a chat about data engineering.