Operating Systems
Indian Institute of Information Technology Dharwad
Data Science and Intelligent Systems
Course Objectives:
● To understand the importance of Operating Systems.
● To use OS algorithms for solving problems, which are related to the real-world or IT industry.
● To extend the use of OS to the extent of other domains and fields.
https://github.com/animesh88/Operating-System
Project: Establish the Hadoop cluster and Spark cluster. Chose any open-source data. Run Spark ML-Lib algorithms. Then, report Hadoop cluster installation (5 marks), Spark+Hadoop Cluster Demo (5 marks), Annotations (4 marks), PPT (6 marks), Write-up (10 marks), Video (10 marks).
References:
Mythili Vutukur. Lectures on Operating Systems, Department of Computer Science and Engineering, IIT Bombay, https://www.cse.iitb.ac.in/~mythili/os/
Avi Silberschatz, Peter Baer Galvin, and Greg Gagne. Operating System Concepts (Tenth Edition). https://www.os-book.com/OS10/slide-dir/index.html
Online textbook Operating Systems: Three Easy Pieces (OSTEP)
William Stallings. “Computer Organization and Architecture”. 10th Edition
Fox Armando, Rean Griffith, A. Joseph, R. Katz, A. Konwinski, G. Lee, D. Patterson, A. Rabkin, and I. Stoica. "Above the clouds: A Berkeley view of cloud computing." Dept. Electrical Eng. and Comput. Sciences, University of California, Berkeley, Rep. UCB/EECS 28 (2009): 13.
Creeger, Mache. "Cloud Computing: An Overview." ACM Queue 7.5 (2009): 2.
http://www.w3.org/TR/2004/NOTE-ws-gloss-20040211
http://en.wikipedia.org/wiki/Web_Services_Description_Language,
http://en.wikipedia.org/wiki/Cloud_computing
http://aws.amazon.com/what-is-cloud-computing/
http://en.wikipedia.org/wiki/Data_center
https://www.microsoft.com/en-us/research/blog/leslie-lamport-receives-turing-award/
https://en.wikipedia.org/wiki/Hypervisor
Cloud Computing: Past, Present, and Future, Professor Anthony D. Joseph, UC Berkeley Reliable Adaptive Distributed systems Lab (RAD lab) UC Berkley http://abovetheclouds.cs.berkeley.edu/
https://en.wikipedia.org/wiki/Google_File_System
https://sites.google.com/site/gfsassignmentwiki/home
Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung. "The Google file system." Proceedings of the nineteenth ACM symposium on Operating systems principles. 2003.
Jeffrey Dean, and Sanjay Ghemawat. "MapReduce: simplified data processing on large clusters." Communications of the ACM 51.1 (2008): 107-113.
Matei Zaharia, et al. "Apache spark: a unified engine for big data processing." Communications of the ACM 59.11 (2016): 56-65.
Matei Zaharia, et al. "Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing." Presented as part of the 9th USENIX Symposium on Networked Systems Design and Implementation (NSDI). 2012.
https://spark.apache.org/docs/latest/cluster-overview.html
https://spark.apache.org/
https://spark.apache.org/docs/3.3.1/ml-frequent-pattern-mining.html
Rakesh Agrawal, Tomasz Imieliński, and Arun Swami. "Mining association rules between sets of items in large databases." SIG-MOD. 1993.
Ramakrishnan Srikant, and Rakesh Agrawal. "Mining Generalized Association Rules.“ VLDB 1995.
Han Jiawei, Jian Pei, and Yiwen Yin. "Mining frequent patterns without candidate generation." ACM SIGMOD Record 29.2 (2000): 1-12.
Haoyuan Li, et al. "PFP: Parallel FP-Growth for query recommendation." Proceedings of the 2008 ACM Conference on Recommender systems. 2008.
Weiser, Mark. "The computer for the 21st century." ACM SIGMOBILE Mobile Computing and Communications Review 3.3 (1999): 3-11.
https://en.wikipedia.org/wiki/Ubiquitous_computing
https://en.wikipedia.org/wiki/Edge_computing
https://en.wikipedia.org/wiki/Cloudlet
IEEE Standard Association. "IEEE 1934-2018-IEEE Standard for adoption of OpenFog reference architecture for fog computing." (2018).
https://en.wikipedia.org/wiki/Fog_computing
Course outcome:
CO1. Execute Spark based Map-Reduce algorithms using existing theories and knowledge.
CO2. Learning Distributed File Systems and Sparks Distributed Computing skills to use, create, and develop systems.
CO3. Test, evaluate, and assess a project's capability to work for Web and Data Science technologies.
CO4. Implementing a part of a project based on the Spark based algorithms.
CO5. Applying the learned theories and Distributed Systems algorithms for real world applications, projects, and systems.
Evaluation Method
Item, Weightage (%)
Assignment 1 Write-up of the project: 10
Project Technical work: 20
Assignment 2 Presentation of Project: 10
Quiz or Viva: 10
Mid Term: 20
End Term: 30
Prepared by: Dr. Animesh Chaturvedi