2021-12-08 DEC

Journal Club - FAIR Socrata

FAIR Data

https://episphere.github.io/rdfTable

Note how context aware format does not come with querying approaches beyond AV pairs.

GraphQL

The quest for queryable API middle layers, https://observablehq.com/@jonasalmeida/graphql

https://graphql.org

http://www.hl7.org/fhir/graphql.html

https://dbpedia.org/sparql

https://www.topquadrant.com/graphql/graphql-queries.html

https://docs.gdc.cancer.gov/API/Users_Guide/GraphQL_Examples

If the query requires only a subset of the data to be returned, GDC GraphQL may speed up requests as GraphQL queries return only the specified data. This may require less work on the GDC server-side to fulfill those requests. Conversely, if an entire data-set is required for each request, the GDC REST API may be a better fit. No matter which method is used, the data returned by the GDC REST API and the GraphQL query will be identical as they query the same source.

An HL7 FHIR and GraphQL approach for interoperability between heterogeneous Electronic Health Record systems.

Mukhiya SK, Lamo Y.

Health Informatics J. 2021 Jul-Sep;27(3):14604582211043920. doi: 10.1177/14604582211043920.

PMID: 34524029

Cite Share

A model-driven framework for data-driven applications in serverless cloud computing.

Samea F, Azam F, Rashid M, Anwar MW, Haider Butt W, Muzaffar AW.

PLoS One. 2020 Aug 28;15(8):e0237317. doi: 10.1371/journal.pone.0237317. eCollection 2020.

PMID: 32857770

WebAPI / Data entrepĂ´t

... epiVerse

How does graphql fit in the client side?

https://github.com/antoniogarrote/rdfstore-js

Hackathon

BigQuery fest

... Zhuoliang with public data

The Data

https://www.fao.org/faostat/en/#data/QCL

The Model

CREATE OR REPLACE MODEL `cse6242.XGBoost_Classifier`

OPTIONS

(model_type='BOOSTED_TREE_CLASSIFIER' , l2_reg = 0.1, num_parallel_tree = 8, max_tree_depth = 10) AS


# add in new features

SELECT pro_label As label,

Country,

Year,

_1,

_2,

_3,

_4,

_5,

_6,

_7,

_8,

_9,

_10,

_11,

_12,

_121,

_345,

_678,

_910,

_999,

Product

FROM

`cse6242.lab_join_pivot`

WHERE Year BETWEEN 1961 AND 2010 # train 50 years

;

How dumb can autoML be?

... Monjoy, Praful

?

10:30

BIOSTATISTICS BRANCH SEMINAR SERIES PRESENTS

BB seminar:

Wenyi Wang Ph.D.

Professor,

Department of Bioinformatics and Computational Biology

Department of Biostatistics

The University of Texas MD Anderson Cancer Center

Date: Wednesday, December 8th, 2021

Time: 10:30 am to 11:30 am (EST.)

Location: Join Via WebEx

Meeting number: 2317 984 5303

Password: BBSem12.08

Join by phone: 1-650-479-3207 Call-in toll number (US/Canada)

Access code: 2317 984 5303