Changes between Version 6 and Version 7 of BIOS_PreparedData


Ignore:
Timestamp:
Oct 19, 2016 10:47:41 AM (8 years ago)
Author:
rick
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • BIOS_PreparedData

    v6 v7  
    11= Recommended BIOS datasets for downstream analysis =
    22
    3 == Freeze I ==
    4 = RNAseq data =
     3= Freeze I =
     4== RNAseq data ==
    55=== Data available ===
    66Raw RNA seq data is avalable at the grid, see [wiki:BIOS_RnaSeq RNASeq data]. This data has been aligned using the pipeline described at [wiki:BIOS_Pipeline RNAseq alignment and quantification pipeline], the exon, transcript and gene level count output is described in the following. Count data is available from the so called 'Freeze1': These are the 2116 samples from Groningen (N=626), Leiden (N=654), Rotterdam (N=652) and Maastricht (N=184) that passed QC. This is around half of the BIOS RNA seq data that is used for the first papers: the other half has been measured but is still in the process of aligning and QC. Both raw and TMM normalized data are available. TMM normalization corrects for the different library sizes across subjects, see attached script for R code or the R package edgeR, and http://genomebiology.com/2010/11/3/r25.
     
    1919ensembl v.71 for annotation, see [wiki:BIOS_ReferenceFiles Reference and annotation]. If you want to export the data to a tab delimited text file, use write.table(RNAs, file='yourfile.txt', quote =FALSE, col.names=TRUE, row.names=TRUE, sep='\t').[[BR]]
    2020
    21 = DNA methylation data =
     21== DNA methylation data ==
    2222
    2323=== Data available ===
     
    3535The Bioconductor/R packages minfi and illuminaio provide reading capabilities for the idat-files.
    3636
    37 = Genotype data =
     37== Genotype data ==
    3838
    3939=== Data available ===
     
    5555Note that the HRC imputed data is in VCF format, which you may need to convert before usage.
    5656
    57 = Phenotype data =
     57== Phenotype data ==
    5858
    5959=== Data available ===
     
    6565These files are available in .RData and .csv file formats.
    6666See for column name explanations the page [wiki:BIOS_Phenotype Phenotype data]. Phenotype data is not complete yet: we are currently contacting the biobanks to complete there files.
     67
    6768=== Location on VM ===
    6869
     
    7475Link the files to the RNA-seq, genotype or methylation data by mapping the corresponding IDs.
    7576
    76 == Freeze II ==
     77= Freeze II =
    7778=== Data available ===
    7879=== Location on VM ===