Till KTH:s startsida Till KTH:s startsida

How often are domains single-exon?

Exons and domains

The objective with this assignment is to determine the correlation between exons and domains. The working hypothesis is that domains are predominantely evolving within a single exon, thereby facilitating easier "domain exchange" between genes/proteins.

Your assignment is to answer the following questions.

  • How often are domains found within a single exon, and how often do they cross exon boundaries?
  • Are these numbers significant, i.e., are they different from what is expected by chance? Perform a randomization experiment where you ignore each domain's actualposition within a protein and place it randomly.

You should make a genome wide survey, i.e., you must use as many protein coding genes as possible. Exluding data requires serious justification.

Data

Download exon data for human genes from Ensembl's BioMart. There is domain information available at Ensembl, but you might want to use domain definitions from Pfam.

Tools

You might need tools from the HMMER package. These are installed on CSC computers.

Lars Arvestad skapade sidan 2 november 2015

Lärare Lars Arvestad ändrade rättigheterna 30 november 2015

Kan därmed läsas av alla och ändras av lärare.