Till KTH:s startsida Till KTH:s startsida

How often are domains single-exon?

Exons and domains

The objective with this assignment is to determine the correlation between exons and domains. The working hypothesis is that domains are predominantely evolving within a single exon, thereby facilitating easier "domain exchange" between genes/proteins.

Your assignment is to answer the following questions.

  • How often are domains found within a single exon, and how often do they cross exon boundaries?
  • Are these numbers significant, i.e., are they different from what is expected by chance? Perform a randomization experiment where you ignore each domain's actualposition within a protein and place it randomly.

You should make a genome wide survey, i.e., you must use as many protein coding genes as possible. Exluding data requires serious justification.

Data

Download exon data for human genes from Ensembl's BioMart. There is domain information available at Ensembl, but you might want to use domain definitions from Pfam.

Tools

You might need tools from the HMMER package. These are installed on CSC computers.

Lars Arvestad skapade sidan 27 oktober 2016

kommenterade 20 december 2016

Hi!
The links seem to be broken. Also, how many domains should be looked at and how should they be choosed?

Lärare kommenterade 4 januari 2017

Both links have now been fixed. I hope this was not a big concern!

The goal is to study all human genes to see what domains they contain and whether they cross exon boundaries or not.