ciro.dev
GPU
0%
MEM
0%
TEMP
0°C
   ██████╗██╗██████╗  ██████╗
  ██╔════╝██║██╔══██╗██╔═══██╗
  ██║     ██║██████╔╝██║   ██║
  ██║     ██║██╔══██╗██║   ██║
  ╚██████╗██║██║  ██║╚██████╔╝
   ╚═════╝╚═╝╚═╝  ╚═╝ ╚═════╝
    

Ciro Zhang

SunbathingFish

Hi! I’m Ciro Zhang, a recent UCSD grad (B.S. Data Science & Computer Engineering) heading to Harvard for my S.M. in Data Science this fall. I build machine learning systems that tackle real-world problems — from generative AI for gene function prediction to bioluminescence forecasting on the California coast. I also have a published paper in computational pathology and have TAed 600+ students across ML and data science courses.

ciro@dev:~$
Welcome to Ciro's Portfolio Terminal v1.0
Type 'help' for available commands
guest@ciro.dev:~$
ACCURACY 0.0000
LOSS 0.0000
EPOCH 0/100

Education 0x2000

Harvard

Harvard University

S.M. Data Science

Incoming Fall 2026

UCSD

UC San Diego

B.S. Data Science & Computer Engineering

Sept 2022 – June 2026

Senior Capstone Communication Lower Bounds for Multi-Party Graph Traversal (Advisor: Dr. Shachar Lovett)  Site →
Honor Society IEEE-HKN Eta Kappa Nu at UCSD (Kappa Psi Chapter)
DSC 140AProbabilistic Modeling and Machine Learning
DSC 140BRepresentation Learning
CSE 151ALearning Algorithms
CSE 151BDeep Learning
CSE 152AComputer Vision
CSE 156Natural Language Processing
CSE 158Recommender Systems and Web Mining
CSE 190Machine Learning for Music and Audio
DSC 40ABTheoretical Foundations of Data Science
DSC 80Practice and Application of Data Science
DSC 100Data Management
DSC 102Systems for Scalable Analytics
DSC 106Data Visualization
DSC 120Signal Processing for Data Analysis
MATH 183Statistical Methods
MATH 189Exploratory Data Analysis and Inference
CSE 20Discrete Mathematics
CSE 100Advanced Data Structures
CSE 101Design and Analysis of Algorithms
CSE 105Theory of Computation
CSE 30Computer Organization and Systems Programming
CSE 120Principles of Computer Operating Systems
CSE 140LDigital Systems
CSE 141LComputer Architecture
ECE 111Advanced Digital Design Project
ECE 35Introduction to Analog Design
ECE 45Circuits and Systems
ECE 65Components and Circuits Laboratory
ECE 101Linear Systems Fundamentals
ECE 109Engineering Probability and Statistics
CSE 15LSoftware Tools and Techniques Laboratory
CSE 109Programming Contests
CSE 110Software Engineering

Experiences 0x3000

Active Dec 2025 – Present

GenAI Researcher

Electrical & Computer Engineering · UC San Diego (Advisor: Dr. Pengtao Xie)

Developing GeneChat, a multimodal LLM that generates interpretable gene explanations from DNA sequences. The model uses a hybrid DNABERT + Vicuna-13B architecture trained on 100k+ gene sequences on A100 GPUs, and is benchmarked against GPT-4o, LLaMA 3, and Gemini on gene function prediction tasks.

Jan 2025 – Jan 2026

ML Researcher

Scripps Institution of Oceanography · UC San Diego (Advisor: Dr. George Sugihara)

Developed EDM-LSTM, a hybrid model combining empirical dynamic modeling with LSTM for ocean bioluminescence forecasting. Trained on 1,000+ weeks of San Diego ocean data and achieved a 23% AUC improvement over standalone EDM and LSTM baselines.

Jun 2025 – Sept 2025

Data Engineering Intern

BC Cancer Agency

Worked on bioinformatics data pipelines, writing Groovy automation scripts inside Nextflow to preprocess cellular and gene expression datasets. Built Python tooling to detect and fix formatting issues across large experimental files, and used QuPath pipelines to extract quantitative features from cell imaging data.

Jun 2024 – Jun 2025

CV Researcher

Tea Labs · University of British Columbia (Advisor: Dr. Li Xiaoxiao)

Built CV pipelines for whole-slide imaging (WSI) pathology analysis using YOLO. Designed a multi-stage slide processing pipeline covering blur/edge filtering, patch extraction, and parallel GPU inference. Also developed semi-supervised dataset pipelines to scale model training despite limited expert annotations.

Jul 2024 – Sept 2024

Research Intern

BiMBA · Peking University (Advisor: Dr. Ma Jingjing)

Built an automated pipeline to scrape and analyze trending keywords from Tencent platforms, studying factors that drive online charitable donations. Integrated LLM APIs to generate structured trend reports and automatically publish findings to Feishu Sheets.

Teaching 0x4000

DSC 80 Active
Practice and Application of Data Science
Instructional Assistant · Sept 2025 – Present · 1 term
Machine Learning: Learning Algorithms
Instructional Assistant · Mar 2025 – June 2025 · 1 term
Principles of Data Science
Instructional Assistant · Sept 2023 – Mar 2025 · 5 terms

Publications 0x4500

2022 12 citations

Histological subtype is associated with PD-L1 expression and CD8+ T-cell infiltrates in triple-negative breast carcinoma

Annals of Diagnostic Pathology · Salisbury T, Abozina A, Zhang C, Mao E, Banyi N, Leo J, Ionescu D, Zhou C, Wang G

Investigated the relationship between tumor histological subtypes and immune markers (PD-L1 expression, CD8+ T-cell infiltration) across 72 triple-negative breast carcinoma cases, identifying subtype-specific patterns with implications for immunotherapy response prediction.

Bigger Group Projects 0x5000

DSC10 Practice Platform

Python • Pandoc • BeautifulSoup • Educational Tool

Practice problem platform for UCSD's DSC 10 course hosting past exams and discussion materials. Built LaTeX-to-Markdown conversion tools and standardized problem formatting system.

HAB Forecasting: Harmful Algal Bloom Prediction (WIP)

Python • EDM–LSTM • Time-Series Forecasting • Oceanographic Data

Developed and deployed a web platform with the HKN project team for forecasting bioluminescent harmful algal blooms (HABs) at UCSD Scripps, integrating our EDM–LSTM model with real-time coastal monitoring data.

Personal Projects 0x6000