2021-12-08 DEC
Journal Club - FAIR Socrata
FAIR Data
https://episphere.github.io/rdfTable
Note how context aware format does not come with querying approaches beyond AV pairs.
GraphQL
The quest for queryable API middle layers, https://observablehq.com/@jonasalmeida/graphql
http://www.hl7.org/fhir/graphql.html
https://www.topquadrant.com/graphql/graphql-queries.html
https://docs.gdc.cancer.gov/API/Users_Guide/GraphQL_Examples
If the query requires only a subset of the data to be returned, GDC GraphQL may speed up requests as GraphQL queries return only the specified data. This may require less work on the GDC server-side to fulfill those requests. Conversely, if an entire data-set is required for each request, the GDC REST API may be a better fit. No matter which method is used, the data returned by the GDC REST API and the GraphQL query will be identical as they query the same source.
An HL7 FHIR and GraphQL approach for interoperability between heterogeneous Electronic Health Record systems.
Mukhiya SK, Lamo Y.
Health Informatics J. 2021 Jul-Sep;27(3):14604582211043920. doi: 10.1177/14604582211043920.
PMID: 34524029
Cite Share
A model-driven framework for data-driven applications in serverless cloud computing.
Samea F, Azam F, Rashid M, Anwar MW, Haider Butt W, Muzaffar AW.
PLoS One. 2020 Aug 28;15(8):e0237317. doi: 10.1371/journal.pone.0237317. eCollection 2020.
PMID: 32857770
WebAPI / Data entrepĂ´t
... epiVerse
How does graphql fit in the client side?
https://github.com/antoniogarrote/rdfstore-js
Hackathon
BigQuery fest
... Zhuoliang with public data
The Data
https://www.fao.org/faostat/en/#data/QCL
The Model
CREATE OR REPLACE MODEL `cse6242.XGBoost_Classifier`
OPTIONS
(model_type='BOOSTED_TREE_CLASSIFIER' , l2_reg = 0.1, num_parallel_tree = 8, max_tree_depth = 10) AS
# add in new features
SELECT pro_label As label,
Country,
Year,
_1,
_2,
_3,
_4,
_5,
_6,
_7,
_8,
_9,
_10,
_11,
_12,
_121,
_345,
_678,
_910,
_999,
Product
FROM
`cse6242.lab_join_pivot`
WHERE Year BETWEEN 1961 AND 2010 # train 50 years
;
How dumb can autoML be?
... Monjoy, Praful
?
10:30
BIOSTATISTICS BRANCH SEMINAR SERIES PRESENTS
BB seminar:
Wenyi Wang Ph.D.
Professor,
Department of Bioinformatics and Computational Biology
Department of Biostatistics
The University of Texas MD Anderson Cancer Center
Date: Wednesday, December 8th, 2021
Time: 10:30 am to 11:30 am (EST.)
Location: Join Via WebEx
Meeting number: 2317 984 5303
Password: BBSem12.08
Join by phone: 1-650-479-3207 Call-in toll number (US/Canada)
Access code: 2317 984 5303