Skip to content
View veedeeve's full-sized avatar

Block or report veedeeve

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
veedeeve/README.md

Vy Thao Dang | Houston, TX

Bioinformatician · Computational Biology Researcher · MSc Biomedical Informatics

Email LinkedIn GitHub


About Me

I am a computational biology researcher building cross-disciplinary expertise in transcriptomics, structural bioinformatics, and high-performance data pipelines. My work focuses on developing reproducible workflows across multiple domains of bioinformatics while strengthening both biological insight and computational rigor.

My interest in biotechnology started with a fascination for how rapidly the medical field was advancing, that science could fundamentally change how diseases are treated. I want to be part of work that creates real impact on people's lives by bridging computational tools with high-impact research.


Current Focus

  • Protein structure trajectory modeling using deep learning-generated ensembles
  • Comparative analysis of BioEmu structural ensembles vs. molecular dynamics simulations
  • Building reproducible pipelines on Linux-based HPC environments

Languages & Tools

Languages

Python R Bash SQL

Computing Environments

Linux HPC Git Jupyter Conda

NGS & Bioinformatic Tools

HISAT2 BWA-MEM SAMtools GATK4 FastQC MultiQC Trimmomatic featureCounts Funcotator SPAdes Prokka BLAST QUAST

Data Analysis & Visualization

ggplot2 matplotlib pandas numpy EnhancedVolcano Tableau

Statistical & Bioconductor

limma-voom clusterProfiler DESeq2


Featured Projects

🌸 RNA-Seq Differential Expression — NUDT21 Knockdown

End-to-end RNA-seq analysis pipeline for NUDT21 knockdown including gene set enrichment analysis.

Key results

  • 11,450 genes significant (FDR < 0.05)
  • Ribosome biogenesis & RNA processing pathways strongly affected
  • Results suggest that NUDT21 regulates core proliferative and RNA-processing programs

HISAT2 SAMtools featureCounts limma-voom clusterProfiler ggplot2 EnhancedVolcano

View Repository


🌸 WGS Germline Human Variant Calling & Functional Annotation

Germline variant discovery workflow following GATK Best Practices.

Key results

  • Successful complete germline variant calling workflow using GATK4
  • Generated high-confidence SNP and INDEL callsets from aligned WGS data
  • Annotated variants using Funcotator to extract gene-level information

GATK4 BWA-MEM SAMtools FastQC Funcotator MultiQC

View Repository


🌸 Genome Assembly and Annotation — Clostridium thermocellum

De novo genome assembly using short-read sequencing data followed by structural and functional annotation.

Key results

  • A 3.45 Mb draft genome assembled across 100 contigs, consistent with expected genome size for C. thermocellum
  • Prokka identified 2,980 coding sequences (CDS), 52 tRNAs, 4 rRNAs, and 1 tmRNA
  • 738 high-confidence matches (24.77%) in BLASTP against Swiss-Prot

SPAdes QUAST Prokka BLAST FastQC Trimmomatic

View Repository


Protein Structural Trajectory Modeling

Trajectory inference of protein isoforms using RMSD-based clustering and structural similarity analysis. Comparative analysis between deep learning-generated structural ensembles (BioEmu) and molecular dynamics simulations to evaluate model credibility and accuracy.

Python MDAnalysis PyTorch


Open to research roles, internships, and full-time positions in bioinformatics and computational biology.

Popular repositories Loading

  1. vydang.github.io vydang.github.io Public

    Vy Dang's Bioinformatic Portfolio

    Jupyter Notebook

  2. thermocellum-genome-assembly-and-annotation thermocellum-genome-assembly-and-annotation Public

    De-novo genome assembly & annotation of C. thermocellum.

    Shell

  3. rna-seq-nudt21-functional-analysis rna-seq-nudt21-functional-analysis Public

    End-to-end RNA-seq analysis pipeline for NUDT21 knockdown including read QC, alignment (HISAT2), quantification (featureCounts), differential expression (limma-voom), and GO-based Gene Set Enrichme…

    R

  4. wgs-germline-variant-pipeline wgs-germline-variant-pipeline Public

    End-to-end germline variant discovery workflow following GATK Best Practices. Includes alignment (BWA-MEM), BQSR, SNP/INDEL calling and functional annotation (Funcotator)

    Shell

  5. veedeeve veedeeve Public