2021-03-24 MAR
Journal Club - 9:30-10:30
Lambert, S.A., Gil, L., Jupp, S. et al. The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nat Genet (2021). https://doi.org/10.1038/s41588-021-00783-5
See also:
Chatterjee, N., Shi, J. & García-Closas, Developing and evaluating polygenic risk prediction models for stratified disease prevention. M. Nat. Rev. Genet. 17, 392–406 (2016).
____I (Jonas) just read the white paper for deep sparce learning, which is part of the basis for the Thousand Brains idea. I'd be happy to review it in a future journal club if there is interrest.____
Hackathon 10:30-11:30
Tile server
(Praful)
Full story at MedPath 2021
BigQuery Genomics
https://isb-cgc.appspot.com
(let's take our time on this one)
FAIR onboarding of Stats&Epi
First Box, then BG?Compare your gitter notes on structuring it (Jeya, Daniel, Lorena ...)
Sandbox this Friday?
APIs for large files (test in Box and GCP Buckets)
FAIR on large files in Box from Biowulf by streaming
Lorena's challenge: https://nih.app.box.com/file/751586322923
Calculate summary statistics (mean) on one of the variables.
Serialize json version of the file.
Move to BG and handle the traversal there.