This repository contains the analysis pipeline used in our study of early embryonic development in rainbow trout (Oncorhynchus mykiss) and zebrafish (Danio rerio). The primary focus is on TOMO-seq (tomographic RNA sequencing), applied to three experimental datasets spanning the freshly fertilized egg to the early blastula stage. The github layout and code workflow has been optimized with the help of ChatGPT to improve readability
Publication: "RNA redistribution driven by alterations in transcription during early embryogenesis of rainbow trout" (Developmental Biology, 2026).
The analysis is based on TOMO-seq across multiple experiments involving early embryonic stages:
- Trout TOMO-seq: O. mykiss – 0 hpf (freshly fertilized egg/zygote), 1 hpf (early cleavage stage), 1 dpf (early blastula stage, ~32-64 cells), and 3 dpf (mid-blastula, ~1000 cells)
- Trout-hybrid TOMO-seq: O. mykiss egg fertilized with Salvelinus fontinalis sperm – 0 hpf (freshly fertilized egg/zygote), 1 hpf (early cleavage stage), 1 dpf (early blastula stage, ~32-64 cells), and 3 dpf (mid-blastula, ~1000 cells)
- Zebrafish TOMO-seq: Danio rerio – 1 hpf (~4-cell), 2 hpf (~64-cell), and 3 hpf (~1000-cell) stages
Explore spatial and temporal gene expression dynamics using our interactive flexdashboard.
📋 Features:
- Visualize normalized gene expression across embryonic stages and spatial sections
- Search by gene symbols from O. mykiss, S. fontinalis, D. rerio
- Cross-species ortholog mapping of differentially localized transcripts (DLTs)
- Genome-specific profiles for hybrid trout (genome1 = O. mykiss, genome2 = S. fontinalis)
🚀 How to use:
- Download the
flexdashboard/folder - Open the
.Rmdfile in RStudio - Install required R packages
- Set the
folder_pathto qs2 file
🔍 Use gene_id to search for genes and visualize expression patterns.
Raw sequencing data for all experiments are publicly available at NCBI GEO: Super series: GSE304790
📂 Scripts available in codebase.
Most required package versions are tracked via 'renv.lock'. To recreate the environment copy the supplied 'renv.lock' to your working folder followed by: renv::restore()
01 FastQ Preprocessing
Objective: Preprocess FastQ reads by removing low-quality reads, mapping to the reference genome, and quantifying transcript counts.
- Script workflow
- Import FastQ
- Quality control (fastqc)
- Adaptor trimming (fastp)
- rRNA filter (RiboDetector)
- mtDNA filter (sortmerna)
- Alignment (STAR)
- Transcript counting (htseq-count)
02 Differential Analysis
Objective: Identify differentially localized transcripts (DLTs) across sections within and between stages.
- Script workflow
- Import count data
- Normalize counts using DESeq2
- Perform differential analysis (DESeq2)
- Filter significant transcripts (DLTs):
- padj < 0.01
03 TrendCatcher
Objective: Additional method to identify section-specific expression trends across development.
- Script workflow
- Import normalized DLT counts
- Identify trends using trendcatcher
- Filter significant transcripts (DLTs):
- padj < 0.01
- Minimum 100-count difference between conditions
04 Identify Unique Clusters
Objective: Identify spatially or temporally co-localized gene clusters.
- Script workflow
- Import normalized DLT counts
- Filter DLTs: normalized count > 20 in ≥1 stage
- Rule-based classification of profiles
- Import variance stabilization transformation (VST) of DLTs
- Cluster VST-transformed genes (DEGReport)
- Build summary DLT report table (summarized expression, profiles)
05 Orthology
Objective: Map DLTs to zebrafish or human gene symbols.
- Script workflow
- Import DLT report table
- Extract protein sequence of DLTs
- Protein similarity (via Proteinortho)
- Integrate ortholog information from NCBI, Ensembl BioMart, and orthogene
- Add orthologous reference gene symbols to DLT report table
06 Ligand–Receptor
Objective: Analyze spatial ligand–receptor interactions between extreme animal and vegetal poles.
- Identify DLTs and orthologs specific to animal/vegetal poles
Script workflow - Match ligands and receptors using DanioTalk
Script workflow - Visualize interactions using chord diagrams
Script workflow
07 Gene Ontology (GO) and Pathway enrichment
Objective: Identify enriched GO terms and pathway activities for gene clusters.
- Script workflow
- Import DLT report table
- Anchor genes to zebrafish orthologs
- Perform GO enrichment using gprofiler2 and goCompass
- Estimate pathway activity changes with decoupleR
08 Regulons
Objective: Detect transcription factors and their targets active across stages and sections.
- Script workflow
- Import DLT report table
- Anchor to zebrafish orthologs
- Compute gene regulatory networks using SCENIC
- Identify and score regulons (TF–target modules) using SCENIC
This section contains the R code used to generate the figures for our manuscript, "RNA redistribution driven by alterations in transcription during early embryogenesis of rainbow trout" (Developmental Biology, 2026).
📂 Scripts available in manuscript_figures.

