CUDA Parallel Programming/Anova with CUDA: Difference between revisions

From CS486wiki
Jump to navigationJump to search
Content deleted Content added
Core (talk | contribs)
No edit summary   (change visibility)
Core (talk | contribs)
No edit summary   (change visibility)
Line 4: Line 4:
= Introduction =
= Introduction =


Anova can be used to define and solve the mathematical formula of species like Human, Rabbit etc. Each of those species formed by variables like: Height, Eye Color, Bone thickness, Hair Color etc. or sicknesses like Cancer.
Like to cotton example you gave I want to give an example: I can thing that cotton might be height of each person. There might be of course other variables like eye color; hair color etc. but I will concentrate on one measurement that height. Related to my comparison between DNA sequences I have chance to observe different Gene-pairs between each person. By looking my result I have to conclude that which part of SNPs that (different pairs of DNA) causes the height.

Genome-Wide Association Study: is an examination of many common genetic variants in different to see if any variant is associated with a trait. Genetic Variant: A single-nucleotide polymorphism (SNP).
As a understanding my problem:
Height(It is one of variable that defines Human) is a specific property of each person.

There might be of other variables like eye color; hair color etc. but I will concentrate on one measurement that causing difference that occurs on Gene. Related to my comparison between DNA sequences I have chance to observe different Gene-pairs between each person. By looking my result I have to conclude that which part of SNPs that (different pairs of DNA) causes the height.

<p>Genome-Wide Association Study: is an examination of many common genetic variants in different to see if any variant is associated with a trait. Genetic Variant: A single-nucleotide polymorphism (SNP).


== Problem ==
== Problem ==
Compare the DNA of two groups of patients: people worth disease. SNPs are then considered to mark a region of the human genome which influences the risk of disease, eye color, height etc.
Compare the DNA of two groups of patients: people worth disease. SNPs are then considered to mark a region of the human genome which influences the risk of disease, eye color, height etc.


== My Aim ==
== My Aim: ==
I have to find SNP-pairs that have significant association with a given quantitative phenotype (height or weight). ANOVA tests on all SNP pairs
I have to find SNP-pairs that have significant association with a given quantitative phenotype (height or weight). ANOVA tests on all SNP pairs
Two people, all 3.1 billion molecules of it, is more than 99.9 percent identical but that 0.1 percent accounts for all the genetic differences between people.
Two people, all 3.1 billion molecules of it, is more than 99.9 percent identical but that 0.1 percent accounts for all the genetic differences between people.
Line 19: Line 25:
For example, if 1000 people share the same disease that all these people share – genetic mutations (SNPs) that healthy people don’t have.
For example, if 1000 people share the same disease that all these people share – genetic mutations (SNPs) that healthy people don’t have.


== My Data ==
== My Data: ==
DNA sequence of each person. As factor I can change the number of people that I am experience on. I can’t change the original sequence of DNA. Factor to Change: I can change the person that I am using in my dataset. I can compare healthy and not-healthy persons. I can compare tall and short people. By depending on the sequence of DNA each person has its unique characteristics (eye color, height, any diseases …). So in to my experiment data set I can put specific type of person. For example 99 person blue eyes and 1 person as brown eyes, by comparing all of them I will have chance to result that which part of DNA sequence affect eye color as blue.
DNA sequence of each person. As factor I can change the number of people that I am experience on. I can’t change the original sequence of DNA. Factor to Change: I can change the person that I am using in my dataset. I can compare healthy and not-healthy persons. I can compare tall and short people. By depending on the sequence of DNA each person has its unique characteristics (eye color, height, any diseases …). So in to my experiment data set I can put specific type of person. For example 99 person blue eyes and 1 person as brown eyes, by comparing all of them I will have chance to result that which part of DNA sequence affect eye color as blue.


== Measurement ==
== Measurement: ==
Compare the DNA sequence of 2 or more than 2 people. Found pair differences are my SNP genes. How many SNP genes there are, where they are located.
Compare the DNA sequence of 2 or more than 2 people. Found pair differences are my SNP genes. How many SNP genes there are, where they are located.


== My Goal ==
== My Goal: ==
To find 0.1 difference of DNA sequence between each person. Depending on my result and looking at my characteristics of each person, I can conclude that which part of DNA sequence causing those characteristics.
To find 0.1 difference of DNA sequence between each person. Depending on my result and looking at my characteristics of each person, I can conclude that which part of DNA sequence causing those characteristics.

Revision as of 04:34, 15 May 2012

← Back to project main page

Anova with CUDA

Introduction

Anova can be used to define and solve the mathematical formula of species like Human, Rabbit etc. Each of those species formed by variables like: Height, Eye Color, Bone thickness, Hair Color etc. or sicknesses like Cancer.

As a understanding my problem: Height(It is one of variable that defines Human) is a specific property of each person.

There might be of other variables like eye color; hair color etc. but I will concentrate on one measurement that causing difference that occurs on Gene. Related to my comparison between DNA sequences I have chance to observe different Gene-pairs between each person. By looking my result I have to conclude that which part of SNPs that (different pairs of DNA) causes the height.

Genome-Wide Association Study: is an examination of many common genetic variants in different to see if any variant is associated with a trait. Genetic Variant: A single-nucleotide polymorphism (SNP).

Problem

Compare the DNA of two groups of patients: people worth disease. SNPs are then considered to mark a region of the human genome which influences the risk of disease, eye color, height etc.

My Aim:

I have to find SNP-pairs that have significant association with a given quantitative phenotype (height or weight). ANOVA tests on all SNP pairs Two people, all 3.1 billion molecules of it, is more than 99.9 percent identical but that 0.1 percent accounts for all the genetic differences between people. The difference occurred in people DNA Sequence causes that one person might have blue eyes or lung cancer, or perfect pitch.

Rather than having A-T pair of molecule at a certain spot on the DNA chain, a person might have a G-C pair. On the other hand that difference might now have any effect at all on a person’s health or appearance. These differences also called as SNPs.

For example, if 1000 people share the same disease that all these people share – genetic mutations (SNPs) that healthy people don’t have.

My Data:

DNA sequence of each person. As factor I can change the number of people that I am experience on. I can’t change the original sequence of DNA. Factor to Change: I can change the person that I am using in my dataset. I can compare healthy and not-healthy persons. I can compare tall and short people. By depending on the sequence of DNA each person has its unique characteristics (eye color, height, any diseases …). So in to my experiment data set I can put specific type of person. For example 99 person blue eyes and 1 person as brown eyes, by comparing all of them I will have chance to result that which part of DNA sequence affect eye color as blue.

Measurement:

Compare the DNA sequence of 2 or more than 2 people. Found pair differences are my SNP genes. How many SNP genes there are, where they are located.

My Goal:

To find 0.1 difference of DNA sequence between each person. Depending on my result and looking at my characteristics of each person, I can conclude that which part of DNA sequence causing those characteristics.