Metatranscriptomics

Metatranscriptome refers to the complete set of RNA transcripts (mostly messenger RNA, or mRNA) that are expressed by the microbial community in a specific environment at a given time. It provides a snapshot of active gene expression in a microbiome, allowing researchers to understand what microbial genes are being used and how the community is responding to environmental conditions.
This approach is particularly useful for understanding how microbial populations respond to changing environmental conditions, interact with hosts, or contribute to ecosystem functions such as nutrient cycling. The process typically involves extracting total RNA from a sample, removing the abundant ribosomal RNA to enrich for messenger RNA, converting the mRNA into complementary DNA (cDNA), and then sequencing it using high-throughput platforms.
Bioinformatic tools are then used to analyze the data, identifying which genes are expressed and linking them to both their microbial origins and biological functions.

IMG
This metatranscriptomics course is designed to equip students with a comprehensive understanding of how to study gene expression in microbial communities directly from environmental samples. Students will also learn to interpret functional activity within microbial communities and link expression profiles to environmental or host-associated processes. Through case studies and practical assignments, students will develop the skills to critically evaluate metatranscriptomic studies, conduct statistical analyses like differential expression and different plots.

👉 Enroll now and take the first step toward a future in data-driven biology and beyond.

## 10% Discount if you register before 15th October, 2025. Hurry up!!


Course Information
Course Metatranscriptome(metagenomic mRNA differential expression)
Duration online 7 Days Training [ 2 Hours Daily [ Monday To Friday ] ]

Slots

Our working Time is 9:00 AM to 6:00 PM Indian Time Available slots - 9:00 AM to 11:00 AM / 11:00 AM to 1:00 PM / 2:00 PM to 4:00 PM / 4:00 PM to 6:00 PM
For training slots after 6 PM or before 9 am as well as weekends training kindly mention during registration accordingly it will be scheduled.

Mode

👉 For online training candidate have to install ZOOM (with remote control on candidate system which makes 100% interactive)
👉 Run time video recording candidate can make as well as pdf manual will be provided for future reference.
👉 All our training is 100% practical and 100% industrial and 100% interactive which provides same as offline learning.
👉 For doubt clear there will be extra support will be provided based on the requirement.
👉 Certificate will be provided

Sequencing Platform Illumina /Ion Torrent/PacBio/Nanopore
Raw data Candidate can include maximum 4 datasets of their own during training. Publication standards figures and tables will be generated.
Training Fees
Module-NGS Metatranscriptome Data Analysis
    📘 Introduction to NGS Metatranscriptome
    - Understanding NGS and Its Applications
    - Types of sequencing data generated
    - Understanding FASTQ files and sequencing quality scores

    📘 Linux Basics & Environment Setup
    - Linux Command Line Basics
    - Installing WGS Metagenome Tools
    - Using Conda and Shell Scripting

    📘 Data Retrieval & Formats
    - Fetching data from NCBI SRA using SRA Toolkit
    - Understanding different file formats

    📘 Introduction to R/Bioconductor
    - Installing packages with CRAN and Bioconductor
    - Data types and standardized data container
    - Data manipulation

    📘 Detailed Metatranscriptome Data Analysis Pipeline

    1. - Assembly:
      Assembling short reads into longer contigs to reconstruct genomic fragments
    2. - RNA Prediction and Classification:
      Detecting ribosomal RNA and other non-coding RNAs from assembled sequences and classifying them into known RNA families
    3. - ORF (CDS) Prediction:
      Predicting open reading frames (ORFs) or coding sequences (CDS) using tools like Prodigal/prokka to identify potential genes
    4. - Homology Searching Against Taxonomic and Functional Databases:
      Matching predicted genes against reference databases (e.g., NCBI NR, COG, KEGG) to infer function and taxonomy
    5. - HMMER Searching Against Pfam Database:
      Identifying protein domains using HMMER against the Pfam database to support functional annotation
    6. - Taxonomic Assignment of Genes:
      Assigning taxonomy to each gene based on homology search results
    7. - Functional Assignment of Genes:
      Linking predicted genes to functional categories like metabolic pathways or gene families using KEGG, COG etc.
    8. - Blastx on Parts of the Contigs with No Gene Prediction or Hits:
      Running Blastx on contig regions that lack gene predictions or functional hits to discover missed genes
    9. - Taxonomic Assignment of Contigs and Disparity Checks:
      Assigning taxonomy at the contig level and verifying consistency among genes within each contig to avoid chimeras
    10. - Coverage and Abundance Estimation for Genes and Contigs:
      Calculating how abundant each gene or contig is by mapping sequencing reads back to the assembly
    11. - Estimation of Taxa Abundances:
      Quantifying the abundance of microbial taxa across samples based on read mappings and gene assignments
    12. - Estimation of Function Abundances:
      Estimating the abundance of biological functions or pathways in the community using annotated gene data
    13. - Merging of Previous Results to Obtain the ORF Table:
      Integrating taxonomic, functional, and abundance data into a single table summarizing all ORFs
    14. - Merging of Previous Results to Obtain the Merged Data:
      Creating an abundance matrix summarizing all gene content and count of number of reads
    15. - Generation of Tables with Aggregated Taxonomic and Functional Profiles:
      Producing overview tables that summarize taxonomic composition and functional potential across samples
    16. - Visualization:
      Creating rich visual outputs including PCA plots, rarefaction curves, stacked bar charts, heatmaps, Krona charts, volcanoplot, GO and pathway enrichment, and network analysis to interpret the data effectively

Preparation

- Metatranscriptomics : https://en.wikipedia.org/wiki/Metatranscriptomics
- NCBI : https://pubmed.ncbi.nlm.nih.gov/39029911/

Instructor

Industry Experienced

Target Audiance

This course is designed for graduate students, postdoctoral researchers, and professionals working in the fields of conservation biology, evolutionary genomics, and population genetics or any life sciences who are interested in applying genomic tools to real-world conservation challenges.

Contact

Please write us at info@arraygen.com or call or whatsapp us on mobile +91-9673625446 if you need any clarification or for any custom training based on candidate reference paper or candidate own content/tools.

Course Information
Course Metatranscriptome Data Analysis
Duration 15 Days Training [ 2 Hours Daily [ Monday To Friday ] ]

Slots

Our working Time is 9:00 AM to 6:00 PM Indian Time Available slots - 9:00 AM to 11:00 AM / 11:00 AM to 1:00 PM / 2:00 PM to 4:00 PM / 4:00 PM to 6:00 PM
For training slots after 6 PM or before 9 am as well as weekends training kindly mention during registration accordingly it will be scheduled.

Mode

👉 For online training candidate have to install ZOOM (with remote control on candidate system which makes 100% interactive)
👉 For offline training candidate need to come to head office pune, India
👉 Run time video recording candidate can make as well as pdf manual will be provided for future reference.
👉 All our training is 100% practical and 100% industrial and 100% interactive which provides same as offline learning.
👉 For doubt clear there will be extra support will be provided based on the requirement.
👉 Certificate will be provided

Sequencing Platform Illumina /Ion Torrent/PacBio/Nanopore
Raw data Candidate can include maximum 4 datasets of their own during training. Publication standards figures and tables will be generated.
Training Fees
Module-I Advanced Bioinformatics & basic programming
Topics
    📘 Introduction to Bioinformatics
    - Overview of bioinformatics and its applications
    - Key concepts in computational biology
    - Role of bioinformatics in genomics, transcriptomics, and proteomics

    📘 Understanding NGS and Genomics Bioinformatics
    - Basics of Next-Generation Sequencing (NGS)
    - Types of NGS data (RNA-seq, WGS, WES)
    - Overview of NGS data formats: FASTQ, BAM, VCF
    - Introduction to pipelines and tools for NGS data analysis

    📘 Databases & Data Retrieval (NCBI and UCSC)
    - Learning how to retrieve biologically correct data
    - Performing complete batch retrieval (e.g., whole exome)
    - NCBI: understanding gene-level data retrieval
    - UCSC: handling large-scale data retrieval
    - UCSC Genome Browser and Table Browser usage
    - Batch Coordinate Retrieval and Genomic Data downloads
    - GFF/GTF gene annotation formats and how to retrieve them
    - Using BLAT for sequence-based search and alignment

    📘 Gene Prediction and Functional Annotation
    - Gene prediction approaches and tools
    - Functional annotation using Gene Ontology (GO)
    - Pathway analysis using KEGG, Reactome
    - Interpreting gene sets and biological relevance

    📘 Standalone/Offline BLAST for Large-Scale Genomic Data
    - Installing and setting up standalone BLAST
    - Running local BLAST for batch sequence alignment
    - Applications in genome-wide homology searches
    - Custom BLAST databases and performance optimization

    📘 PCR Primer Designing and Specificity Check
    - Designing accurate primers for PCR amplification
    - Tools: Primer3, NCBI Primer-BLAST
    - Checking primer specificity using genome-wide BLAST
    - Avoiding non-target amplification through design best practices

    📘 Understanding Python Programming
    - Introduction to Python for bioinformatics
    - Scripting basics: variables, loops, functions
    - Libraries like Biopython, pandas for biological data handling
    - Automating genomic workflows with Python scripts

AND
Module-II Next Generation Sequencing (NGS) - Metatranscriptome Data Analysis
Topics
    📘 Introduction to NGS Metatranscriptome
    - Understanding NGS and Its Applications
    - Types of sequencing data generated
    - Understanding FASTQ files and sequencing quality scores

    📘 Linux Basics & Environment Setup
    - Linux Command Line Basics
    - Installing WGS Metagenome Tools
    - Using Conda and Shell Scripting

    📘 Data Retrieval & Formats
    - Fetching data from NCBI SRA using SRA Toolkit
    - Understanding different file formats

    📘 Introduction to R/Bioconductor
    - Installing packages with CRAN and Bioconductor
    - Data types and standardized data container
    - Data manipulation

    📘 Detailed Metatranscriptome Data Analysis Pipeline

    1. - Assembly:
      Assembling short reads into longer contigs to reconstruct genomic fragments
    2. - RNA Prediction and Classification:
      Detecting ribosomal RNA and other non-coding RNAs from assembled sequences and classifying them into known RNA families
    3. - ORF (CDS) Prediction:
      Predicting open reading frames (ORFs) or coding sequences (CDS) using tools like Prodigal/prokka to identify potential genes
    4. - Homology Searching Against Taxonomic and Functional Databases:
      Matching predicted genes against reference databases (e.g., NCBI NR, COG, KEGG) to infer function and taxonomy
    5. - HMMER Searching Against Pfam Database:
      Identifying protein domains using HMMER against the Pfam database to support functional annotation
    6. - Taxonomic Assignment of Genes:
      Assigning taxonomy to each gene based on homology search results
    7. - Functional Assignment of Genes:
      Linking predicted genes to functional categories like metabolic pathways or gene families using KEGG, COG etc.
    8. - Blastx on Parts of the Contigs with No Gene Prediction or Hits:
      Running Blastx on contig regions that lack gene predictions or functional hits to discover missed genes
    9. - Taxonomic Assignment of Contigs and Disparity Checks:
      Assigning taxonomy at the contig level and verifying consistency among genes within each contig to avoid chimeras
    10. - Coverage and Abundance Estimation for Genes and Contigs:
      Calculating how abundant each gene or contig is by mapping sequencing reads back to the assembly
    11. - Estimation of Taxa Abundances:
      Quantifying the abundance of microbial taxa across samples based on read mappings and gene assignments
    12. - Estimation of Function Abundances:
      Estimating the abundance of biological functions or pathways in the community using annotated gene data
    13. - Merging of Previous Results to Obtain the ORF Table:
      Integrating taxonomic, functional, and abundance data into a single table summarizing all ORFs
    14. - Merging of Previous Results to Obtain the Merged Data:
      Creating an abundance matrix summarizing all gene content and count of number of reads
    15. - Generation of Tables with Aggregated Taxonomic and Functional Profiles:
      Producing overview tables that summarize taxonomic composition and functional potential across samples
    16. - Visualization:
      Creating rich visual outputs including PCA plots, rarefaction curves, stacked bar charts, heatmaps, Krona charts, volcanoplot, GO and pathway enrichment, and network analysis to interpret the data effectively

Preparation

- Metatranscriptomics : https://en.wikipedia.org/wiki/Metatranscriptomics
- NCBI : https://pubmed.ncbi.nlm.nih.gov/39029911/

Instructor

Industry Experienced

Target Audiance

This course is designed for graduate students, postdoctoral researchers, and professionals working in the fields of conservation biology, evolutionary genomics, and population genetics or any life sciences who are interested in applying genomic tools to real-world conservation challenges.

Contact

Please write us at info@arraygen.com or call or whatsapp us on mobile +91-9673625446 if you need any clarification or for any custom training based on candidate reference paper or candidate own content/tools.