Welcome to the site for the Master's Course CSE 6514: Big Data Analytics. This course provides an in-depth introduction to the principles, techniques, and tools used in big data analytics. It begins by defining big data and examining its core challenges, followed by essential data preprocessing and data-warehouse integration concepts. The course also covers modern data management frameworks, including NoSQL systems such as key-value and document stores. Students will gain practical experience with large-scale data processing through Apache Hadoop, MapReduce, Apache Spark, and Spark programming using Python and PySpark. Key topics include distributed analytics, code optimization, cluster configuration, and distributed file storage systems. By the end of the course, students will be equipped with the foundational skills required to design and implement scalable data-driven solutions in distributed computing environments.
We will mainly study the cutting edge research published in recent conferences. This course is expected to be a research-based course. Research work will be more emphasized than traditional lectures and examinations.
Muhammad Abdullah Adnan, PhD (UC San Diego, USA)
Professor
Department of Computer Science and Engineering,
Bangladesh University of Engineering and Technology (BUET),
Dhaka-1000, Bangladesh.
Email: abdullah.adnan@gmail.com, adnan@cse.buet.ac.bd
Web: https://sites.google.com/site/abdullahadnan
Cell: +880 1552 336926
Saturday (5:00 pm - 8:00 pm, BD time)
Room 504,
ECE Building, West Palashi,
BUET, Dhaka.