1.1 Reference Genome
The reference genome file should be in fasta or multi fasta format. It allows any file extensions but file should have FASTA format. Both single-entry and multiple-entry Fasta files are supported. Multiple reference genome files can be uploaded but only one genome can be viewed at a time. Multiple selection of reference genome is not allowed.
For eg. >chr1 or >chr1 Homo sapiens
>chr1
AGCTTTTCATTCTGACTGCAACGGGCAATATGTCTCTGTGTGGATTAAAAAAAGAGTCTC
TGACAGCAGCTTCTGAACTGGTTACCTGCCGTGAGTAAATTAAAATTTTATTGACTTAGG
After uploading of reference genome, user can use following options :
Forward & Reverse Strand:
Select option show forward /reverse strand to view forward/reverse strand. A user needs to select a particular option as per interest.
Change Base Color :
Users can change base color as per their choice. A Customized option can be used to select
colors for bases ATGC and default option will keep default colors.
The nucleotide's color by default option.
The nuclotides are shown below by using default color setting.
The nucleotide's color by Customized option. Users can change the base colors as per their choice.
The nuclotides are shown below by using customized color setting.
Retrieve Sequence :
A user can retrieve sequence of region of interest by specifying the position and chromosome number. The chromosome dropdown
box will give a choice for selection of chromosome numbers. The retrieved sequence will be in fasta format.
Save file in a particular folder.
1.2 Annotations
In annotation, there are three files format which can be uploaded here.
Gene transfer format (GTF) file should be tab separated, contains one line per feature. GTF borrows from GFF, but has additional structure. GFF has several versions, the most recent of which is GFF3.
Structure is as GFF so the fields are:
<seqname> <source> <feature> <start> <end> <score> <strand> <frame> [attributes] [comments]
ANG Genome Browser takes UCSC gtf format. These files contain information about features such as exon, CDS , start codon. It also has start & end, frame, strand, attribute.
Image of .gtf file format
GTF Track
After loading gtf file, if a user takes a cursor on any strand, it gives information about that strand. It shows ID, start & stop codon positions. The arrows on the strand tells you about the orientation of the strands. The arrow -> is the forward strand & arrow <- is the reverse strand. If a user clicks on any strand, it gives transcript details such as transcript ID, chromosome, strand, total number of exons & CDS. It also shows nucleotide sequence as well as its translated protein sequence. There is an option for selection of genetic code.
BED Track
The BED file has 7 fields, which give the information about chrom, ExonStart and ExonEnd or CDSStart and CDSEnd,type Exon or CDS, strand, id and description.
Example of .bed file format
After loading bed track, CDS are shown in golden color and exons present in CDS are shown in green color. The arrow indicates the type of strand i.e. forward or reverse. The arrow -> indicates forward strand and <- indicates reverse strand.
After loading gff file, if a user takes a cursor on any strand,
it gives information about that strand. It shows Chromosome ,
Gene or Exon or CDS start & Gene or Exon or CDS End positions.
The arrows on the strand tells you about the orientation of the strands.
The arrow -> is the forward strand & arrow <- is the reverse strand.