Introduction
Microsoft Excel (MS Excel) is a spreadsheet application developed by Microsoft that allows users to store, organize, analyze, and visualize data using tables, formulas, and various built-in tools. It is widely used in businesses, education, and personal projects for tasks such as:
Spreadsheet Structure: Data is arranged in rows and columns within worksheets.
Formulas & Functions: Users can perform calculations using built-in functions (SUM, AVERAGE, IF, VLOOKUP, etc.).
Charts & Graphs: Helps visualize data using bar charts, pie charts, line graphs, etc.
Pivot Tables: Summarizes and analyzes large datasets efficiently.
Conditional Formatting: Highlights cells based on conditions (e.g., color-coding values).
Data Sorting & Filtering: Helps organize and find specific information quickly.
Macros & Automation: Automates repetitive tasks using VBA (Visual Basic for Applications).
Data analysis is the process of collecting, organizing, cleaning, analyzing, and interpreting data to extract useful information, identify patterns, and make informed decisions.
Processes involved in data analysis:ย
Data collection
The first step is gathering data from different sources. The data can be:
โ Primary Data โ Collected directly (e.g., surveys, experiments, observations).
โ Secondary Data โ Obtained from existing sources (e.g., reports, websites, databases).
Example: A teacher collects students' marks from exams.
Data cleaning
Raw data often contains errors, duplicates, missing values, or inconsistencies. Data cleaning helps ensure accuracy.ย
Key Cleaning Techniques in Excel:
โ Conditional formatting: Used for highlighting data or cell according the condition given.
โ Sort & Filter: Used for organizing data to find errors, duplicates, or missing values.ย
โ Data validation: Used for restricting incorrect entries.
Analysing using statistical tools and charts .
Statistical Tools
โ Function: A function is a predefined formula in Excel that performs a specific task. Functions simplify complex calculations by following a structured format.
Example:
โ MAX: =MAX(A1:A10) โ Returns the highest value in the range A1 to A10.
โ MIN: =MIN(A1:A10) โ Returns the lowest value in the range A1 to A10.
โ AVERAGE: =AVERAGE(A1:A10) โ Finds the mean (average) of the values in A1 to A10.
โ MEDIAN: =MEDIAN(A1:A10) โ Returns the middle value in the range A1 to A10.
โ MODE: =MODE.SNGL(A1:A10) โ Returns the most frequently occurring value in A1 to A10.
โ IF: =IF(A1>=40, "Pass", "Fail") โ Returns "Pass" if A1 is 40 or more, otherwise returns "Fail".
โ IFS: =IFS(A1>=80, "Excellent", A1>=60, "Good", A1>=40, "Average", A1<40, "Fail") โ Evaluates multiple conditions and returns the corresponding result.
โ SUM: =SUM(A1:A10) โ Adds all the values in the range A1 to A10.
โ SUMIF: =SUMIF(A1:A10, ">50") โ Returns the sum of values in A1:A10 that are greater than 50.
โ SUMIFS: =SUMIFS(A1:A10, B1:B10, ">50", C1:C10, "<30") โ Returns the sum of values in A1:A10 where B1:B10 is greater than 50 and C1:C10 is less than 30.
โ COUNT: =COUNT(A1:A10) โ Counts the number of numeric values in A1 to A10.
โ COUNTA: =COUNTA(A1:A10) โ Counts the number of non-empty cells in A1 to A10 (including text and numbers).
โ COUNTIF: =COUNTIF(A1:A10, ">50") โ Counts the number of values in A1:A10 that are greater than 50.
โ COUNTIFS: =COUNTIFS(A1:A10, ">50", B1:B10, "<30") โ Counts the number of values in A1:A10 that are greater than 50 and in B1:B10 that are less than 30.
โ RANK: =RANK(A1, A1:A10, 0) โ Returns the rank of A1 within the range A1 to A10 (0 for descending order, 1 for ascending order).
โ VLOOKUP: =VLOOKUP(75, A1:B10, 2, FALSE) โ Searches for 75 in column A and returns the corresponding value from column B.
โ HLOOKUP: =HLOOKUP(75, A1:J2, 2, FALSE) โ Searches for 75 in row 1 and returns the corresponding value from row 2.
โ CONCATENATE (Old Method): =CONCATENATE(A1, " ", B1) โ Joins values from A1 and B1 with a space in between.
โ TEXTJOIN (Newer Method): =TEXTJOIN(", ", TRUE, A1:A10) โ Joins values from A1 to A10, separated by commas.
โ Formula: A formula is a user-defined mathematical expression that performs calculations using operators (e.g., +, -, *, /). It can include cell references, numbers, and functions.
Example:
โ =A1+A2+A3+A4+A5 โ Adds the values in A1, A2, A3, A4, and A5 manually.
โ =A1+A2*B1 โ Adds A1 to the product of A2 and B1.
โ =(A1+A2)/2 โ Adds A1 and A2, then divides by 2.ย
3. Data Visualization Using Chartsย
Data visualization is the graphical representation of information to help in analysis, decision-making, and communication. Charts and graphs make complex data easy to understand.
๐น Purpose of Data Visualizationย
โ
Quick Insights โ Helps in identifying patterns, trends, and outliers in data.
โ
Better Decision-Making โ Simplifies complex data for informed decision-making.
โ
Comparing Data โ Allows for side-by-side comparisons using different chart types.
โ
Enhanced Communication โ Presents data visually for better understanding in reports.
๐น Types of Charts & Their Elements
โ Column Chart โ Best for comparing values across categories.
โ Bar Chart โ Similar to column charts but displayed horizontally.
โ
Example: Comparing monthly sales for different products.
โ Best for showing trends or progressions over time.
โ
Example: Tracking student performance over a semester.
โ Shows percentages or proportions of a whole.
โ
Example: Market share of different companies.
โ Used to analyze relationships between two numerical variables.
โ
Example: Relationship between study time and test scores.
โ A mix of two chart types, like bars and lines together.
โ
Example: Revenue (column) vs. profit percentage (line).
In MS Excel, cell references are used in formulas to refer to data from other cells. There are three main types of cell referencing styles:ย
Relative Referencing: Changes when copied to another cell (default behavior).
โ
Example:
If cell B2 contains =A2+10 and you copy it to B3, it changes to =A3+10.
Absolute Referencing ($A$1): Does NOT change when copied. The dollar sign ($) locks the row and column.
โ Example:
If C2 has =$A$1+10 and you copy it to C3, it remains =$A$1+10.
Mixed Referencing ($A1 or A$1): Locks either the row OR the column when copied.
โ Example:
$A1 โ The column (A) is fixed, but the row changes when copied.
A$1 โ The row (1) is fixed, but the column changes when copied.
Formula in B2 Copied to C3 Explanation
=A2+10 =B3+10 Relative (both row & column change)
=$A$2+10 =$A$2+10 Absolute (does not change)
=$A2+10 =$A3+10 Mixed (column A is fixed, row changes)
=A$2+10 =B$2+10 Mixed (row 2 is fixed, column changes)
Google Slide
Google Sheets is a free, web-based spreadsheet application developed by Google. It works similarly to Microsoft Excel, but it runs entirely in your web browser. Hereโs a quick breakdown:
Create and edit spreadsheets online โ You can use it from any device with internet access.
Real-time collaboration โ Multiple people can work on the same sheet at the same time.
Autosave โ All changes are automatically saved to your Google Drive.
Built-in functions and formulas โ Like SUM, AVERAGE, IF, VLOOKUP, and more.
Charts and graphs โ You can visualize your data easily.
Conditional formatting โ Highlight cells based on specific rules.
Google Apps Script โ Automate tasks or create custom functions.
Integration โ Connects easily with other Google Workspace tools like Docs, Forms, Slides, etc.
Google Slide vs. Microsoft Excel
Google Sheets Microsoft Excel
Free to use with a Google account Requires Microsoft 365 subscription
Web-based; works in browser Desktop-based (also has web version)
Auto-saves to Google Drive Auto-saves with OneDrive (if enabled)
Real-time collaboration built-in Collaboration available in Excel Online
Limited advanced features Powerful features for data analysis
Uses Google Apps Script for automation Uses VBA (Visual Basic for Applications)
Accessible from any device with internet Best experience on desktop
Good for small to medium data sets Handles large data sets efficiently
Easy sharing via link or email Can share via OneDrive or email
Ideal for team projects and cloud use Ideal for complex calculations and offline use