Probabilités Statistiques et réseaux de neurones

Vendredi 11 juin 2004, 10h-12h

Exploration of genome data

Samuel Kaski (University of Helsinki)

Résumé : Current high-throughput measurement technologies are revolutionizing biology. With DNA microarray techniques, the expression of thousands of genes can be routinely measured. The genomes of several organisms have already been sequenced, and individual differences in genome sequences can be mapped with increasing accuracy. I will start by introducing the necessary biological background from the computer science/statistics perspective, and then discuss some of the most burning data analysis problems. Statistical machine learning methods are being intensively applied to the problems of clustering, information visualization, classification, and prediction of genome data. I will discuss some current applications on gene expression and sequences of human endogenous retroviruses.

