Merging Data Sets

Team 2 Data

Team 3 Data

The T2 data has transcript ID, while the T3 data has flybase id and gene. So we need to convert the transcript ids to gene names/flybase ids, or vice versa. We converted transcript id to flybase id because a single gene can have multiple transcript ids.

A python module was written for the conversion and is available at github. 


After the conversion the next step was to check for each gene in the T2 list if that also existed in the T3 list. If they did a row was written to the results file.

At the end the merged file contained the genes that were common across both data sets with their respective averages.

The merged list can be found at github.

Team 1 and 4

R-Script Used to Find Commonalities between the data sets




Team 1 and 4: Finalized Gene Lists + Localization of Genes

Click here to expand the entire drive and see more lists