...
lorenasandoval88 @lorenasandoval88 10:01
Nicole noticed that there is data missing in a table pulled from bigquery. The table is complete on GCP and I'm doing a simple "select *" query from R. It might just be my error, but I can't figure out what is going wrong when using the bigrquery package in R. I updated my pbigrquery package and I'm updating my R version today.
There is a bug in the package, that may be related here. r-dbi/bigrquery#311
```######### project
project <- "nih-nci-dceg-connect-dev"
### query
sql <- "SELECT * FROM nih-nci-dceg-connect-dev.Connect.Sanford_recruitment_AM"
tb <- bq_project_query(project, sql)
Sanford_AM = bq_table_download(tb)
```
danielruss @danielruss 10:09
@lorenasandoval88 I get 34 rows, is that correct?
lorenasandoval88 @lorenasandoval88 10:20
Yes! There should be 10 Connect_ID values, but I get 1.
danielruss @danielruss 10:22
@lorenasandoval88 can we talk over teams. These data frames are big and encoded, making it hard to understand.
Actually, maybe we can discuss it in the FAIR clinic in a few minutes.
Bhaumik Patel @bhaumik55231 11:31
@danielruss here is the SQL to cast Connect_ID as string
SELECT
CAST (Connect_ID as STRING) AS Connect_ID
FROM
`nih-nci-dceg-connect-dev.Connect.Sanford_recruitment_AM`
lorenasandoval88 @lorenasandoval88 11:52
Problem fixed! Thanks!
...
Tom is identifying the phenotype source files in Box
Studying it in PLCO first - https://observablehq.com/@episphere/plco