This is a cleaned and elaborated training dataset extract of the Divvy Bikes Case Study from Google Data Analytics Certificate (Coursera) from here. The set contains 12 .csv files of monthly stats for 2022, data contains 7 basic attributes (user type, bike type, trip duration as a raw digit, and some data divided into numbers). The main goal targeted - to create simplified dataset for data manipulations in R, produce basic exploratory calculations (mean, median, average etc) and build simple visualizations.
It is pretty good for getting a big solid combined table of 12 ingredients made by in R and by R in a clip of time, and making magic impressions on the researcher - it total in up to 5,6M rows. And then dig in, and do your discoveries - that is why the set was published:)
This part represents the original dataset which is allocated here
The dataset was simply exctracted as archive from AWS data bucket in .csv format. Then, after some data examination, the set was simplified till just a few categories for basic exploratory analysis purposes and loaded to Kaggle, but as my product. Fair enough, I suppose...:)
The set contains only seven attributes of a 12-months records of bikes used by clients of some company, so it is not difficult to define the combinations to be elaborated and visualised.
Simple, primitive and ready for beginner's purposes.