Practicals:
Answers
[Questions]
| 1 Retrieve Sequences and Files |
|
First run showdb to retrieve the list of available databases [output]. The SwissProt database can be accessed using the name "swissprot" . To retrieve
the sequence in fasta format of the Swiss-Prot entry P57727
use the the program
seqret: type swissprot:P57727 as input for the program [input]. The default output format is fasta. [output]
|
| 2 Find Protein domains/motifs |
| The patmatmotifs program reports the occourrence of [input][output]: 1) The catalytic activity active site of the serine proteases (from the trypsin family); 2) The low-density lipoprotein (LDL) receptor domain; 3) The scavenger receptor cysteine-rich (SRCR) domain; In particular the patmatmotifs reports the position of two (HIS and SER) of the three residues which are part of the catalytic triad of the serine proteases of the trypsin family. Please read the documentation for matching patterns for details about the catalytic triad of the active site. The occurrence of the domains is also annotated in the FT lines of the SwissProt entry (you already retrieved in the previous exercise with the entret program) [output]: 1) LDL receptor domain, from amino acid 72 to amino acid 108 of the sequence. 2) SRCR domain, from amino acid 109 to amino acid 205 of the sequence. 3) The peptidase domain containing the active site, from amino acid 217 to amino acid 449 of the sequence |
| 3 Pairwise sequence alignment |
|
You
can use needle
to perform a global parwise alignment and water
to do a local parwise alignment. |
| 4 Producing a restriction map |
| Use for instance the EMBL sequence AF201380 and run the program restrict [input] [output] or remap [input] [output] . Type '6' in the input option 'Minimum recognition site length'. And in the advance section type '1' and '2' respectively in the 'Minimum cuts per RE' and 'Maximum cuts per RE' options. |
|
5
Translation
|
|
entret
reads and writes the complete sequence entry together with the heading
annotation. The coding sequence of the mRNA is reported in the FT lines
of the entry: from nucletotide 144 to nucleotide 1511 [output]. |
| 6 Designing primers |
|
Use the program
eprimer3 to design the 6 best primers
for the embl sequence AF201380 (type
'6' in the advanced option 'Number of results to return').
If you check the advanced output option 'Explain flag' you will see that
393 primer pairs are considered OK by the program [input] [output]. |
Questions: L. Bordoli (Lorenza.Bordoli@unibas.ch) or L. Falquet (Laurent.Falquet@isb-sib.ch)