Tag Archives: gonad

DNA Isolation & Quantification – C. virginica Gonad gDNA

I isolated DNA from the Crassotrea virginica gonad samples sent by Katie Lotterhos using the E.Z.N.A. Mollusc Kit with the following modifications:

  • Samples were homogenized with plastic, disposable pestle in 350μL of ML1 Buffer
  • No optional steps were used
  • Eluted each in 100μL of Elution Buffer and pooled into a single sample

NOTE: Sample 034 did not process properly (no phase separation after 24:1 chlorform:IAA addition – along with suggested additions of ML1 Buffer) and was discarded.

Quantified the DNA using the Qubit dsDNA BR Kit (Invitrogen). Used 2μL of DNA sample.

Samples were stored in the same box the tissue was delivered in and stored in the same location in our -80C: rack 8, row 5, column 4.


Qubit (Google Sheet): 20171114_qubit_Cvirginica_gDNA

Ample DNA in all samples for MBDseq. (Refer to “Original Sample Conc.” column in spreadsheet.)

Will let Steven & Katie know.


RNA Isolation – Olympia oyster gonad tissue in paraffin histology blocks

UPDATE 20170712: The RNA I isolated below is from incorrect regions of tissue. I misunderstood exactly what this tissue was, and admittedly, jumped the gun. The tissue is actually collected from the visceral mass – which contains gonad (a small amount) and digestive gland (a large amount). The RNA isolated below will be stored in one of the Shellfish RNA boxes and I will isolate RNA from the correct regions indicated by Grace

Isolated RNA from Olympia oyster gonad previously preserved with the PAXgene Tissue Fixative and Stabilizer and then embedded in paraffin blocks. See Laura’s notebook for full details on samples and preservation.


RNA was isolated from the following samples using the PAXgene Tissue RNA Kit (Qiagen). Gouged samples from the blocks weighing ~10mg from each of the tissues and processed according the protocol for isolating RNA from blocks of paraffin-embedded tissues.

Tissue identification is available in this GitHub Issue



  • Prior to beginning, I prepared an aliquot of Buffer TR1 by adding 40μL of β-mercaptoethanol (β-ME) to 4000μL of Buffer TR1).
  • Reconstituted DNase I with 550μL of RNase-free H2O. Aliquoted in 100μL volumes and stored @ -20C in the “-20C Kit Components” box.

Isolated RNA according to the PAXgene Tissue RNA Kit protocol with the following alterations:

  • “Max speed” spins were performed at 20,000g.
  • Tissue disruption was performed by adding ~25-50 glass beads (425 – 600μm diameter) with the Disruptor Genie @ 45C for 15mins (in the Friedman Lab).
  • Shaking incubation step was performed with Disruptor Genie
  • Samples were eluted with 27μL of Buffer TR4 x 2, incubated @ 65C for 5mins, immediately placed on ice and quantified on the Roberts Lab Qubit 3.0 with the RNA High Sensitivity Assay (ThermoFisher Scientific) using 5μL of each sample.


Concentrations (Google Sheet): 20170710_RNA_qubit_oly_histo_blocks

Well, the good news is that there’s RNA from all the samples and it seems to be in relatively high concentrations!

The bad news is that the concentrations for 10 of the 12 samples were too high and outside the range of the Qubit RNA HS Assay! Since we don’t have the broad range RNA assay, I can’t properly quantify the remaining samples. However, these samples are being sent to Katherine Silliman at some point, so I’ll leave it up to her to quantify the samples. I’m also guessing that she’ll run them on a Bioanalyzer to assess their integrity prior to beginning library construction, so that will also yield concentrations for the samples.

Samples were stored at -80C temporarily.

Samples will be sent to Katherine Silliman for high-throughput library construction and sequencing once I hear back from her regarding her availability to receive the samples.


Sample Annotation – Olympia oyster histology blocks (from Laura Spencer)

I’ve been asked to isolate RNA from some paraffin-embedded Olympia oyster gonad tissue.

Despite some excellent documentation by Laura Spencer (images of tissue layouts in histology cassettes and a corresponding cassette mapping key file), the histology facility seems to have flipped some things around and/or repositioned/split the contents of each cassette. This makes ID-ing the proper tissues tedious and, at times, difficult.

The list of tissues that needs to be processed is listed in this GitHub Issue #648. I’ve also added the list below:

NF-10 22

Prior to beginning RNA isolations, I have annotated images of the histology blocks and will be waiting for Laura to confirm that my annotations are correct. I will be posting a link to this notebook entry in the GitHub issue listed above for her to view and wait for her confirmation.

UPDATE 201700707 – Laura has indicated that many of my annotations are incorrect. Katie has gone through and made proper identification: https://github.com/sr320/LabDocs/issues/648#issuecomment-313792588


Additionally, as indicated in the GitHub Issue above, histology block “Oly 14″ does not have a corresponding tissue cassette photo (containing sample NF-10 26). Without the original image, I don’t think I can make an accurate guess on how the tissues are oriented in the resulting two histo blocks (see below).













BLOCKS 14 (unable to annotate at time of posting)















Project Summary: Maturation processes in the marine mollusc, Panopea generosa

Work Accomplished

The overall goal of this project is to develop a fundamental understanding of processes controlling marine mollusc reproductive maturation. In order accomplish this goal the specific research objectives of this proposal were to 1) characterize tissue specific transcriptomic resources for the geoduck and 2) identify proteins that play a role in geoduck reproductive maturation.

The first step in this project was collecting clams at different reproductive stages as determined through histological analysis. Gonadal tissue from 70 geoducks was sampled in batches of about eight per week over the span of two months from November 2014 to early January 2015. Hundreds of images were analyzed and reproductive status was determined for each individual.

Based on histological determination of reproductive maturational stage, seven female and six male paraffin-embedded gonad samples were selected for construction of RNA-seq libraries. A total of 443,468,476 reads were obtained and the de novo assembly resulted in a total of 153,982 transcript contigs with a mean contig length of 660 bp and an N50 value of 1015 bp. In comparing our contigs with oyster sequences whose expression changed during gonad development in 161 matched including geoduck sequences corresponding to genes expressed in gonads in early gonad developmental stages (7), with increasing expression during spermatogenesis (44), with increasing expression during oogenesis (31) and genes with varying expression level during gonadogenesis in both sexes (79) .

Proteomic profiles were determined for the primary reproductive maturation stages in both male and female clams using data dependent acquisition (DDA) of gonad proteins. This approach yielded 3,627 detected proteins across both sexes and three maturation stages. This is a significant escalation in the understanding of proteomic responses in maturation stages of marine mollusks. Based on the DDA data, 27 proteins from early- and late-stage male and female clams were chosen for selected reaction monitoring (SRM). The SRM assay yielded a suite of indicator peptides that can be used as an efficient assay to non-lethally determine geoduck gonad maturation status.


Non-metric multidimensional scaling plot (NMDS) of geoduck gonad whole proteomic profiles generated by data dependent acquisition. Gonad proteomes differ among clams by both sex (male = orange, female = blue) and stage (early-stage = circles, mid-stage = squares, late-stage = triangles; p<0.05).

Impact of Award

Beyond contributing to the fundamental knowledge of marine mollusk reproduction, this award produced numerous publications and provided basis for further funding and proposal submissions. In addition the transcriptomic data was the basis for the course: Bioinformatics for Transcriptomic and Epigenomic Analyses – Centro de Investigación Científica y de Educación Superior de Ensenada, B.C. (CICESE) 19-24 October 2015

Further Funding

Currently two projects have been funded that were based on this project and others have been submitted. Funded projects include: Proteomic response of shellfish to environmental stress; Department of Natural Resources $107,805 and Elucidating the physiological and epigenetic response of tetraploid and triploid Pacific Oysters to environmental stressors; NOAA $178,898. Submitted proposals include one to NOAA on the development of new clam species for aquaculture.


Crandall, Grace; Roberts, Steven (2016): Reproductive Maturation in Geoduck clams (Panopea generosa). figshare.
Retrieved: 14 41, Dec 23, 2016 (GMT)
This fileset includes a research paper describing reproductive maturation in geoduck clams with 200 images of gonadal histological sections and associated datasheets. Downloads = 1761.

Emma B. Timmins-Schiffman, Grace A. Crandall, Brent Vadopalas, Michael E. Riffle, Brook L. Nunn, Steven B. Roberts (2016) Integrating proteomics and selected reaction monitoring to develop a non-invasive assay for geoduck reproductive maturation
bioRxiv 094615; doi: https://doi.org/10.1101/094615

[Data] Transcriptomic profiles of adult female & male gonads in Panopea generosa (Pacific geoduck).

Panopea gonad transcriptome
Open Science Framework Project

[Data] Geoduck (Panopea generosa) gonad DDA LC-MS/MS

[Code] Source Code for GO Analysis in Geoduck Gonad Background

[Data] Geoduck (Panopea generosa) gonad DIA LC-MS/MS

[Data] Selected reaction monitoring of geoduck gonad peptides to develop biomarkers of reproductive maturation status

[Data] Selected reaction monitoring of geoduck hemolymph peptides to develop biomarkers of reproductive maturation status


Getting back on tracks

Yesterday I uploaded v0.0.1 of the Geoduck genome to CoGe.

Now I want to start adding tracks. To do this I used CLC to create RNA-seq tracks from our male and female gonad transcriptome data.

As would be expected only a small amount of reads mapped. This is as we are limiting the genome to the 22 scaffolds with length > 100k.



One thing to point out (and will have to be followed up on) is that many more Female reads mapped back.

I took the Reads data and exported to BAM.

Then uploaded to CoGe.
I called this Version 1, and interestingly I got some cool options.. so I selected them.


This included saving as a Notebook.


This was Finished in less than 5 minutes!

The SNP view.

Voila – we have it in a Browser.

and you can zoom in

Here we have a Notebook view
It is now public, though not quite sure if there is a url.

Everything is public so please give it a look / twirl.


Second look at Geoduck transcriptome

Last week I popped out a quick assembly and annotation on our geoduck gonadal transcriptome. A second assembly was also done using Trinity.

August 3 – Confirmed // in file location had no impact on assembly.
July 14 – TransDecoder protein annotations
10:40am – added TransDecoder results
10:29am – added Stats via Trinity

--seqType fq 
-JM 24G 
--left /Volumes/web/cnidarian/Geo_Pool_F_GGCTAC_L006_R1_001_val_1.fq /Volumes/web/cnidarian/Geo_Pool_M_CTTGTA_L006_R1_001_val_1.fq 
--right /Volumes/web/cnidarian//Geo_Pool_F_GGCTAC_L006_R2_001_val_2.fq /Volumes/web/cnidarian//Geo_Pool_M_CTTGTA_L006_R2_001_val_2.fq 
--CPU 16 



0:999   127840
1000:1999   18164
2000:2999   5321
3000:3999   1817
4000:4999   762
5000:5999   291
6000:6999   135
7000:7999   73
8000:8999   22
9000:9999   29
10000:10999     4
11000:11999     5
12000:12999     3
13000:13999     4
14000:14999     4
15000:15999     3
16000:16999     0
17000:17999     2
18000:18999     1

Total length of sequence:   101862868 bp
Total number of sequences:  154480
N25 stats:          25% of total sequence length is contained in the 8095 sequences &gt;= 2045 bp
N50 stats:          50% of total sequence length is contained in the 26158 sequences &gt;= 1014 bp
N75 stats:          75% of total sequence length is contained in the 64574 sequences &gt;= 446 bp
Total GC count:         37657770 bp
GC %:               36.97 %
hummingbird:Geo-trinity steven$ /Users/gilesg/compile/trinityrnaseq_r20131110/util/TrinityStats.pl /Volumes/web/cnidarian/Geo-trinity/trinity_out_dir/Trinity.fasta 

## Counts of transcripts, etc.
Total trinity transcripts:  154480
Total trinity components:   100155
Percent GC: 36.97

Stats based on ALL transcript contigs:

    Contig N10: 3444
    Contig N20: 2385
    Contig N30: 1766
    Contig N40: 1343
    Contig N50: 1014

    Median contig length: 371
    Average contig: 659.39
    Total assembled bases: 101862868


    Contig N10: 2999
    Contig N20: 2026
    Contig N30: 1462
    Contig N40: 1067
    Contig N50: 768

    Median contig length: 321
    Average contig: 553.88
    Total assembled bases: 55473621

Rerunning to see if double slash was a problem- did not see anything in error. Also running TransDecoder

TransDecoder Results

Ran the following

/Users/gilesg/compile/trinityrnaseq_r20131110/trinity-plugins/TransDecoder_r20131110/TransDecoder -t  /Volumes/web/cnidarian/Geo-trinity/trinity_out_dir/Trinity.fasta

This provided a peptide file with 36003 sequences.

!head /Volumes/web-1/cnidarian/Geo-trinity/Trinity.fasta.transdecoder.pep

>cds.comp100047_c0_seq2|m.5982 comp100047_c0_seq2|g.5982 ORF comp100047_c0_seq2|g.5982 comp100047_c0_seq2|m.5982 type:internal len:142 (-) comp100047_c0_seq2:3-425(-)

Running blastp on Trinity.fasta.transdecoder.pep

-query /Volumes/web/cnidarian/Geo-trinity/Trinity.fasta.transdecoder.pep 
-db /usr/local/bioinformatics/dbs/uniprot_sprot.fasta 
-evalue 1e-5 
-max_target_seqs 1 
-max_hsps 1 
-outfmt 6 
-num_threads 4 
-out /Volumes/web/cnidarian/Geo-trinity/Trinity.fasta.transdecoder.pep-blastp-uniprot-2.out

results: http://eagle.fish.washington.edu/cnidarian/Geo-trinity/Trinity.fasta.transdecoder.pep-blastp-uniprot-2.out


RNAseq Data Receipt – Geoduck Gonad RNA 100bp PE Illumina

Received notification that the samples sent on 20150601 for RNAseq were completed.

Downloaded the following files from the GENEWIZ servers using FileZilla FTP and stored them on our server (owl/web/nightingales/P_generosa):


Generated md5 checksums for each file:

$for i in *; do md5 $i >> checksums.md5; done

Made a readme.md file for the directory.


Sample Submission – Geoduck Gonad for RNA-seq

Prepared two pools of geoduck RNA for RNA-seq (Illumina HiSeq2500, 100bp, PE) with GENEWIZ, Inc.

I pooled a set of female and a set of male RNAs that had been selected by Steven based on the Bioanalyzer results from Friday.

The female RNA pool used 210ng of each sample, with the exception being sample #08. This sample used 630ng. The reason for this was due to the fact that there weren’t any other female samples to use from this developmental time point. The two other developmental time points each had three samples contributing to the pool. So, three times the quantity of the other individual samples was used to help equalize the time point contribution to the pooled sample. Additionally, 630ng used the entirety of sample #08.

The male RNA pool used 315ng of each sample. This number differs from the 210ng used for the female RNAs so that the two pools would end up with the same total quantity of RNA. However, now that I’ve typed this, this doesn’t matter since the libraries will be equalized before being run on the Illumina HiSeq2500. Oh well. As long as each sample in each pool contributed to the total amount of RNA, then it’s all good.

The two pools were shipped O/N on dry ice.

  • Geo_pool_M
  • Geo_pool_F

Calculations (Google Sheet): 20150601_Geoduck_GENEWIZ_calcs


Bioanalyzer – Geoduck Gonad RNA Quality Assessment

Before proceeding with transcriptomics for this project, we need to assess the integrity of the RNA via Bioanalyzer.

RNA that was previously isolated on 20150508, 20150505, 20150427, and 20150424 (those notebook entries have been updated to report this consolidation and have a link to this notebook entry) were consolidated into single samples (if there had been multiple isolations of the same sample) and spec’d on the Roberts Lab NanoDrop1000:

Google Sheet: 20150528_geoduck_histo_RNA_ODs

NOTE: Screwed up consolidation of Geoduck Block 03 sample (added one of the 04 dupes to the tube, so discarded 03).

RNA was stored in Shellfish RNA Box #5.

RNA was submitted to to Jesse Tsai at University of Washington Department of Environmental and Occupational Health Science Functional Genomics Laboratory for running on the Agilent Bioanalyzer 2100, using either the RNA Pico or RNA Nano chips, depending on RNA concentration (Pico for lower concentrations and Nano for higher concentrations – left decision up to Jesse).



Bioanalzyer 2100 Pico Data File (XAD): SamWhite_Eukaryote Total RNA Pico_2015-05-28_12-50-00.xad
Bioanalzyer 2100 Nano Data File (XAD): SamWhite_Eukaryote Total RNA Nano_2015-05-28_13-22-53.xad


Pico Gel Representation


Pico Electropherogram


Nano Gel Representation


Nano Electropherogram


Jesse alerted me to the fact that they did not have any ladder to use on the Nano chip, as someone had used the remainder, but failed to order more. I OK’d him to go ahead with the Nano chip despite lacking ladder, as we primarily needed to assess RNA integrity.


Bad Samples:

  • Geo 04 – No RNA detected
  • Geo 65, 67, 68 – These three samples show complete degradation of the RNA (i.e. no ribosomal band present, significant smearing on the gel representation).

All other samples look solid. Will discuss with Steven and Brent on how they want to proceed.

Full list of samples for this project (including the Block 03 sample not included in this analysis; see above). Grace’s notebook will have details on what the numbering indicates (e.g. developmental stage).

  • block 02
  • block 03 (no RNA)
  • block 04 (no RNA)
  • block 07
  • block 08
  • block 09
  • block 34
  • block 35
  • block 38
  • block 41
  • block 42
  • block 46
  • block 51
  • block 65 (degraded RNA)
  • block 67 (degraded RNA)
  • block 68 (degraded RNA)
  • block 69
  • block 70

RNA Isolation – Geoduck Gonad in Paraffin Histology Blocks

UPDATE 20150528: The RNA isolated in this notebook entry may have been consolidated on 20150528.

The RNA isolation I performed earlier this week proved to be better for some of the samples (scraping tissue directly from the blocks), but still exhibited low yields from some samples. I will perform a final RNA isolation attempt (the kit only has six columns left) from the following samples:

  • 02
  • 03
  • 04
  • 07
  • 08
  • 09

Instead of full sections from each histology cassette, I gouged samples directly from the tissue in each of the blocks to maximize the amount of tissue input.


Samples were then processed with the PAXgene Tissue RNA Kit in a single group.

Isolated RNA according to the PAXgene Tissue RNA Kit protocol with the following alterations:

  • “Max speed” spins were performed at 19,000g.
  • Tissue disruption was performed with the Disruptor Genie @ 45C for 15mins.
  • Shaking incubation step was performed with Disruptor Genie
  • Samples were eluted with 40μL of Buffer TR4, incubated @ 65C for 5mins, immediately placed on ice and quantified on the Roberts Lab NanoDrop1000.


All samples were stored @ -80C in Shellfish RNA Box #5.



Two samples (02 and 07) produced great yields and perfect RNA (260/280 and 260/230 of ~2.0). The remainder of the samples showed little improvement compared to what I’ve been obtaining from the previous three attempts. Will discuss with Steven and Brent about how to proceed with this project.