Business Analytics

Lectures:

What is Data? (pdf)

Databases, XML and JSON (pdf)

Text Processing - Information Retrieval by Ian Soboroff (pdf)

Introduction to Data Processing by Brendan O’Connor (pdf)

Clustering (pdf)

Useful links:

Big Data

4 Vs https://www.ibmbigdatahub.com/tag/587

10 Vs https://www.123formbuilder.com/blog/big-data-pyramid-from-data-to-wisdom/

42 Vs https://www.elderresearch.com/blog/42-v-of-big-data

Open Data

OpenData HandBook https://opendatahandbook.org/

Kaggle https://www.kaggle.com/

OpenStreetMap https://www.openstreetmap.org/#map=4/-15.13/-53.19

Data

Data+Dataset Types/Semantics Tasks Lecture by Munzner/Moller http://vda.univie.ac.at/Teaching/Vis/14s/LectureNotes/04_data_types_and_tasks.pdf

Introduction to Text Processing with nltk

https://likegeeks.com/es/tutorial-de-nlp-con-python-nltk/

Wordcloud generation

https://www.datacamp.com/community/tutorials/wordcloud-python

Software:

Projection Explorer (PEx) http://vis.icmc.usp.br/vicg/tool/1/projection-explorer-pex

Twint library https://github.com/twintproject/twint

Datasets:

News from AP_BBC_CNN_Reuters from VICG-USP (zip)

Trump_Johnson (zip)

Johnson.db (zip)

Codes:

Tweet Collector (zip)

Text Processing (zip) jupyter notebook --NotebookApp.iopub_data_rate_limit=10000000000