Zhiwen Chen

MSc Health Data Science @ UCL · BSc Computer Science @ Birmingham (2:1) · Targeting UK early-career roles in AI, ML, NLP and Health Data.

London, United Kingdom

About

I am finishing an MSc in Health Data Science at UCL after a BSc in Computer Science at Birmingham (2:1, Jun 2025). My work sits at the intersection of machine learning, natural language processing, and healthcare data — from ontology-aware clinical entity linking with SNOMED CT, to a novel three-phase evolutionary algorithm for multi-objective Neural Architecture Search.

I am looking for an internship, graduate, or junior role in the UK where I can apply ML and NLP to real problems — particularly in healthcare, digital health, and applied AI.

Featured Project

Selected Work

Ontology-aware Clinical Entity Linking with SNOMED CT

UCL MSc research · Ongoing

  • Two-stage entity linking framework for clinical text: candidate generation followed by hierarchy-informed reranking.
  • Reranker combines semantic similarity with SNOMED CT structural signals — hierarchy distance, shared ancestors, structural consistency.
  • Evaluating disambiguation accuracy against a non-hierarchical baseline.

Heart Failure Prediction from Clinical Features

UCL MSc coursework · 2026

  • End-to-end ML workflow on structured clinical variables, with EDA, missingness review, and comparative feature visualisation.
  • Compared classifiers using ROC-AUC, precision, recall, and F1-score, prioritising clinically interpretable feature importance.

NLP Algorithm Intern — Shanghai Zhiyu Information Technology

Beijing, Jun 2025 – Sep 2025

  • Built an OCR + NLP pipeline for automated identification of lending-related clauses in scanned corporate charter documents.
  • Designed page-level pre-processing (text extraction, region localisation, document cleaning) and iterated on model + data refinement.

Skills

Languages

Python · R · SQL · Java · JavaScript · C · Haskell

ML / NLP

PyTorch · Hugging Face Transformers · PaddleNLP · Information Extraction · Multi-objective Optimisation · Neural Architecture Search

Data & Genomics

EDA · Statistical evaluation · Benchmarking · PLINK · GWAS · Variant QC

Tools

Git / GitHub · Linux / Unix · OCR tooling · LaTeX

Education

University College London — MSc Health Data Science Sep 2025 – Sep 2026 (expected)

AI in Healthcare · Advanced Methods in Data Science and Statistics · Principles of Health Data Science · Applied Computational Genomics · Epidemiology for Data Science · Programming with Python for Health Research.

University of Birmingham — BSc Computer Science · 2:1 Sep 2021 – Jun 2025

Artificial Intelligence · Natural Language Processing · Evolutionary Computation · Data Structures and Algorithms · Operating Systems · Software Engineering · Mathematics and Logic Foundations.