pman_diaphragm_rnaseq

This repository is for data processing and analysis related to Schweizer et al. (2023), "Gene regulatory changes underlie developmental plasticity in respiration and aerobic performance in highland deer mice" with Catherine M. Ivy, Chandrasekhar Natarajan, Graham R. Scott, Jay F. Storz, and Zachary A. Cheviron.

Below, users will find example commands for analyzing raw RNAseq fastq data to generate count data. In the .Rmarkdown file, users will find all the analyses to perform gene expression analyses in R, including differential gene expression analysis with EdgeR, identifying regulatory modules in WGCNA, testing for correlations of WGCNA modules with phenotypes, testing for effects of treatment on module expression, and identifying overap with candidate genes for high-altitude adaptation. There is also an R script for plotting output and results from REVIGO analysis.

Processing raw RNAseq fastq reads

Run BBduk to remove adapters from paired end RNAseq data.

ls *_R1_001.fastq.gz | while read file; do
        name=$(echo "${file}" | cut -d "/" -f 8 | cut -d "." -f 1 | cut -d "_" -f 1)
        echo "${name}"                                                                  
        if [ -f "fastq_trimmed/${name}.1.fq.gz" ]; then
                echo "exists; skipping..."                              #
        else
               bbmap/bbduk.sh -Xmx1g \
                in1="${name}_R1_001.fastq.gz" \
                in2="${name}_R2_001.fastq.gz" \
                out1="fastq_trimmed/${name}.1.bb.fq.gz" \
                out2="fastq_trimmed/${name}.2.bb.fq.gz" \
                ref=adapters.fa \
                ktrim=r k=23 mink=11 hdist=1 tpe tbo
        fi
done         
echo "Done!"

Run trimmomatic to trim junk off of reads.

ls *.1.bb.fq.gz | while read file; do 
        name=$(echo "${file}" | cut -d "." -f 1) 		
        echo "${name}" 
        if [ -f "${name}.fastp.trimmed_1P.fq.gz" ]; then 		
                echo "exists; skipping..."				
        else

                trimmomatic PE -threads 8 -phred33 -trimlog \
                	"${name}.trimmomatic.log" "${file}" "${name}.2.bb.fq.gz" \
                	-baseout "${name}.fastp.trimmed.fq.gz" \
                	ILLUMINACLIP:TruSeq3-PE.fa:2:30:10:8:TRUE LEADING:3 TRAILING:3 SLIDINGWINDOW:5:20 MINLEN:36 
        fi
done
echo "Done!"

Align reads to deer mouse genome with HiSat2.


ls *.trimmed_1P.fq.gz | while read file; do 
        name=$(echo "${file}" | cut -d "." -f 1) 		
        echo "${name}" 									
	~/hisat2-2.2.1/hisat2 -p 48 --mp 2,0 -q -x ~/pman2_ref/GCA_003704035.1_HU_Pman_2.1_genomic_wMt.fa -1 "${name}.fastp.trimmed_1P.fq.gz" -2 "${name}.fastp.trimmed_2P.fq.gz" -U "${name}.fastp.trimmed_1U.fq.gz" --summary-file "${name}_align_mp20.summ.txt" --un "${name}.mp20.unmapped.fq" | samtools view -Sbo "../bam/${name}_Halign_wMt_mp20.bam" 
	samtools sort ../bam/${name}_Halign_wMt_mp20.bam -o ../bam/${name}_Halign_wMt_mp20.s.bam
done
echo "Done!"

Generate gene count data with featureCounts.

featureCounts -p -O -F GTF -a ~/pman2_ref/Peromyscus_maniculatus_bairdii.HU_Pman_2.1.104.gtf -o pman_diaphragm_wMt_mp20_readcounts.txt *_Halign_wMt_mp20.bam

Analyzing gene expression count data in R

See diaphragm_RNAseq_analysis.Rmd

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
data_files		data_files
README.md		README.md
diaphragm_RNAseq_analysis.Rmd		diaphragm_RNAseq_analysis.Rmd
diaphragm_RNAseq_analysis.html		diaphragm_RNAseq_analysis.html
plot_revigo_output.R		plot_revigo_output.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pman_diaphragm_rnaseq

Processing raw RNAseq fastq reads

Analyzing gene expression count data in R

About

Releases

Packages

Languages

renaschweizer/pman_diaphragm_rnaseq

Folders and files

Latest commit

History

Repository files navigation

pman_diaphragm_rnaseq

Processing raw RNAseq fastq reads

Analyzing gene expression count data in R

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages