On this page I have tried to put everything we have seen so far together.
Below is the final workflow (as uploaded to the KNIME Pro hub) , and below that you can see the result of its execution.
Let's first talk about the different Data Sources I am using, all of them through a Databricks Workspace:
the first one, in the red box, is the tpcds_sf1 schema, which is a default example schema that comes pre-installed with Databricks. The bar chart on the right uses these data.
the second one, in the blue box, is our airlines Data warehouse on Aiven cloud (see here ) , but accessed through Data bricks. The pie chart on the bottom uses this data source.
the third one, in the green box), is the puertos Excel file we used in the Geospatial example , but here accessed thru Databricks Unity file system. The pie chart on the top left draws from this.
This. I hope, gives you an example on how to use various cloud-based data resources in your data analysis.
On the next page I will have a look at how to make your system more resilient to errors using failover techniques