<?xml version="1.0" encoding="UTF-8"?>
<Item xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" id="X-b126_3" TextType="CompleteItem" SchemaVersion="2.0" PageStartNumber="0" Template="Generic_A4_Unnumbered" DiscussionAlias="Discussion" SecondColour="None" ThirdColour="None" FourthColour="None" Logo="colour" Rendering="OpenLearn" x_oucontentversion="2025101000">
    <meta name="aaaf:olink_server" content="http://www.open.edu/openlearn/ocw"/>
    <meta name="vle:osep" content="false"/>
    <meta name="equations" content="mathjax"/>
    <CourseCode>B126_3</CourseCode>
    <CourseTitle/>
    <ItemID/>
    <ItemTitle>Data analysis: visualisations in Excel </ItemTitle>
    <FrontMatter>
        <Imprint>
            <Standard>
                <GeneralInfo>
                    <Paragraph><b>About this free course</b></Paragraph>
                    <Paragraph>This free course is an adapted extract from the Open University course B126 <i>Business data analytics and decision making</i> - <a href="https://www.open.ac.uk/courses/modules/b126https://www.open.ac.uk/courses/modules/b126https://www.open.ac.uk/courses/modules/b126?LKCAMPAIGN=ebook_&amp;amp;MEDIA=ou">www.open.ac.uk/courses/modules/b126</a><!--[MODULE code] [Module title- Italics] THEN LINK to Study @ OU page for module. Text to be page URL without http;// but make sure href includes http:// (e.g. <a href="http://www3.open.ac.uk/study/undergraduate/course/b190.htm">www3.open.ac.uk/study/undergraduate/course/b190?LKCAMPAIGN=ebook_&amp;amp;MEDIA=ou</a>)] -->.</Paragraph>
                    <Paragraph>This version of the content may include video, images and interactive content that may not be optimised for your device. </Paragraph>
                    <Paragraph>You can experience this free course as it was originally designed on OpenLearn, the home of free learning from The Open University –</Paragraph>
                    <!--[course name] hyperlink to page URL make sure href includes http:// with trackingcode added <Paragraph><a href="http://www.open.edu/openlearn/money-management/introduction-bookkeeping-and-accounting/content-section-0?LKCAMPAIGN=ebook_&amp;amp;MEDIA=ol">www.open.edu/openlearn/money-management/introduction-bookkeeping-and-accounting/content-section-0</a>. </Paragraph>-->
                    <Paragraph>There you’ll also be able to track your progress via your activity record, which you can use to demonstrate your learning.</Paragraph>
                </GeneralInfo>
                <Address>
                    <AddressLine/>
                    <AddressLine/>
                </Address>
                <FirstPublished>
                    <Paragraph><?oxy_custom_start type="oxy_content_highlight" color="140,255,140"?>First published 2025.<?oxy_custom_end?></Paragraph>
                </FirstPublished>
                <Copyright>
                    <Paragraph>Unless otherwise stated, copyright © 2025 The Open University, all rights reserved.</Paragraph>
                </Copyright>
                <Rights>
                    <Paragraph/>
                    <Paragraph><b>Intellectual property</b></Paragraph>
                    <Paragraph>Unless otherwise stated, this resource is released under the terms of the Creative Commons Licence v4.0 <a href="http://creativecommons.org/licenses/by-nc-sa/4.0/deed.en">http://creativecommons.org/licenses/by-nc-sa/4.0/deed.en</a>. Within that The Open University interprets this licence in the following way: <a href="http://www.open.edu/openlearn/about-openlearn/frequently-asked-questions-on-openlearn">www.open.edu/openlearn/about-openlearn/frequently-asked-questions-on-openlearn</a>. Copyright and rights falling outside the terms of the Creative Commons Licence are retained or controlled by The Open University. Please read the full text before using any of the content. </Paragraph>
                    <Paragraph>We believe the primary barrier to accessing high-quality educational experiences is cost, which is why we aim to publish as much free content as possible under an open licence. If it proves difficult to release content under our preferred Creative Commons licence (e.g. because we can’t afford or gain the clearances or find suitable alternatives), we will still release the materials for free under a personal end-user licence. </Paragraph>
                    <Paragraph>This is because the learning experience will always be the same high quality offering and that should always be seen as positive – even if at times the licensing is different to Creative Commons. </Paragraph>
                    <Paragraph>When using the content you must attribute us (The Open University) (the OU) and any identified author in accordance with the terms of the Creative Commons Licence.</Paragraph>
                    <Paragraph>The Acknowledgements section is used to list, amongst other things, third party (Proprietary), licensed content which is not subject to Creative Commons licensing. Proprietary content must be used (retained) intact and in context to the content at all times.</Paragraph>
                    <Paragraph>The Acknowledgements section is also used to bring to your attention any other Special Restrictions which may apply to the content. For example there may be times when the Creative Commons Non-Commercial Sharealike licence does not apply to any of the content even if owned by us (The Open University). In these instances, unless stated otherwise, the content may be used for personal and non-commercial use.</Paragraph>
                    <Paragraph>We have also identified as Proprietary other material included in the content which is not subject to Creative Commons Licence. These are OU logos, trading names and may extend to certain photographic and video images and sound recordings and any other material as may be brought to your attention.</Paragraph>
                    <Paragraph>Unauthorised use of any of the content may constitute a breach of the terms and conditions and/or intellectual property laws.</Paragraph>
                    <Paragraph>We reserve the right to alter, amend or bring to an end any terms and conditions provided here without notice.</Paragraph>
                    <Paragraph>All rights falling outside the terms of the Creative Commons licence are retained or controlled by The Open University.</Paragraph>
                    <Paragraph>Head of Intellectual Property, The Open University</Paragraph>
                </Rights>
                <Edited>
                    <Paragraph/>
                </Edited>
                <Printed>
                    <Paragraph/>
                </Printed>
                <ISBN><!--INSERT EPUB ISBN WHEN AVAILABLE (.kdl)-->
        <!--INSERT KDL ISBN WHEN AVAILABLE (.epub)--></ISBN>
                <Edition/>
            </Standard>
        </Imprint>
        <Covers>
            <Cover template="false" type="ebook" src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_1_ebook_cover.jpg"/>
            <Cover template="false" type="A4" src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_1_pdfimage_19x12-6_300d.jpg"/>
        </Covers>
    </FrontMatter>
    <Unit>
        <UnitID><!--leave blank--></UnitID>
        <UnitTitle><!--leave blank--></UnitTitle>
        <Session>
            <Title>Introduction</Title>
            <Paragraph>The objectives of this course are to explore ways to visualise data sets in Excel, familiarise yourself with Excel functions and explore the relationship(s) between variables. </Paragraph>
            <Figure>
                <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_u01_s02_f01.tif" src_uri="file:////openuniv.sharepoint.com@SSL/DavWWWRoot/sites/bmodules/b126/lmimages/b126_u01_s02_f01.tif" width="100%" x_printonly="y" x_folderhash="60bd249e" x_contenthash="3c990ad8" x_imagesrc="b126_u01_s02_f01.tif.jpg" x_imagewidth="450" x_imageheight="385"><!--Asset 448301--></Image>
                <Caption><b>Figure 1</b> Making sense of visualising and exploring data</Caption>
                <Alternative>A graphical illustration of different types of charts (e.g. pie chart, line graph).</Alternative>
                <Description>A colourful graphical illustration of different types of charts (e.g. pie chart, line graph). In the foreground is a histogram with a magnifying glass hovering over a section of the graph.</Description>
            </Figure>
            <Paragraph>Please note: this course will require the use Microsoft Excel or a similar program. </Paragraph>
            <Paragraph>This OpenLearn course is an adapted extract from the Open University course <a href="https://www.open.ac.uk/courses/modules/b126">B126 <i>Business data analytics and decision making</i></a>.</Paragraph>
        </Session>
        <Session>
            <Title>Learning outcomes</Title>
            <Paragraph>After studying this course, you should be able to:</Paragraph>
            <BulletedList>
                <ListItem>explore the functionalities of Excel that are used for problem solving in a business context</ListItem>
                <ListItem>demonstrate the numeracy skills required for gathering and organising data for decision making related to a specific problem</ListItem>
                <ListItem>use graphical techniques (histograms and scatter diagrams) to provide a visual summary of available data</ListItem>
                <ListItem>recognise data presentation and communication techniques used in a range of traditional and electronic media</ListItem>
                <ListItem>describe the relationship between two variables (independent and dependent variables).</ListItem>
            </BulletedList>
        </Session>
        <Session>
            <Title>1 Excel spreadsheets: a tool for organising and starting to process data</Title>
            <Paragraph>Before making any business decision, managers need to see a clear picture of their data, which provides them with all the relevant information. Therefore, it is good practice to organise and present data in a way that makes it useful for decision making and problem solving. Microsoft Excel is the most attractive and useful tool for data analysis. Researchers and analysts alike use this tool for various applications in the real world, such as in business, medicine, academia, logistics, operations, transportation, tax and auditing, marketing, accounting and finance. Moreover, it is flexible enough to be used with all types of data, irrespective of whether the data is qualitative or quantitative. </Paragraph>
            <Paragraph>In this section, you will make extensive use of Excel spreadsheets. Therefore, your task is to learn and familiarise yourself with the basics of using Excel. This will enhance your analytical skills as well as your employability skills. </Paragraph>
            <Paragraph>This section briefly explains the various features and functions of Excel that are used by researchers and data analysts to explore, organise and analyse data. </Paragraph>
            <Box>
                <Heading>Excel for OU students</Heading>
                <Paragraph>If you are currently studying with the OU as a fee-paying student, you have free access to Microsoft Office 365. This includes the spreadsheet software Excel. For this you need to go to the OU Computing Guide and scroll down to ‘Microsoft Office 365’. Click on the link. If you have not done so, then you should follow the instructions to sign up to get access for your free version of the software. If you have already installed Microsoft Excel on your laptop, then you may prefer to use your own version. Although earlier versions of Excel are not significantly different, the layout of some tabs and menus may vary slightly. </Paragraph>
            </Box>
            <Section>
                <Title>1.1 Accessing Microsoft Excel</Title>
                <Paragraph><?oxy_custom_start type="oxy_content_highlight" color="255,215,0"?>There are many other options, of course, but being an OU student you have free access to Microsoft 365 that includes the spreadsheet software Excel. If you have already installed Microsoft Excel on your laptop, then you may prefer to use your own version. Although earlier versions of Excel are not significantly different, the layout of some tabs and menus may vary slightly. <?oxy_custom_end?></Paragraph>
                <BulletedList>
                    <ListItem>You can run Excel by double-clicking on the Excel icon on your desktop or laptop (or select <b>Excel</b> from your list of programs). </ListItem>
                    <ListItem>Excel will open with a clean, new worksheet called <b>Book 1</b> that contains only one worksheet (<b>Sheet1</b>). You can add more sheets by clicking on the plus sign at the bottom of the worksheet or spreadsheet in case one sheet becomes too small or too cluttered. This will help you organise and manage your large sets of data, or a variety of data. </ListItem>
                    <ListItem>Add a new worksheet by clicking on the plus sign.</ListItem>
                    <ListItem>The new sheet <b>Sheet2</b> looks exactly like <b>Sheet1</b>. Each individual cell is labelled according to the <b>A1</b> reference style. This means that columns are labelled with letters from A to XFD and rows with number 1 to 1048576. </ListItem>
                </BulletedList>
                <Figure>
                    <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_3_fig2.png" x_folderhash="9cb7b1d8" x_contenthash="540be868" x_imagesrc="b126_3_fig2.png" x_imagewidth="1927" x_imageheight="1048"><!--Asset 420001--></Image>
                    <Caption><b>Figure 2</b> An empty worksheet in Excel</Caption>
                    <Alternative>A blank Excel worksheet with the different editing tools at the top</Alternative>
                    <Description>A blank Excel worksheet with the different editing tools at the top; it shows columns labelled A to O and rows numbered 1 to 19. The tabs running along the very top of the sheet are: File, Home, Insert, Page layout, Formulas, Data, Review View, Automate, Developer, Help. Home, Formulas and Data are circled in black.</Description>
                </Figure>
                <Paragraph>The latest versions of Excel are organised using a <b>ribbon</b> interface, which consists of a collection of icons for each tab. The screenshot above shows the <b>Home</b> ribbon with several icons for general editing such as font size, text alignment or cell styles. In this session, you will use the <b>Home</b>,<b> Formulas</b> and <b>Data</b> tabs in Excel. You may click on each tab and familiarise yourself with the large main icons. </Paragraph>
            </Section>
            <Section>
                <Title>1.2 Opening an Excel file</Title>
                <Paragraph>Excel stores the data in files that contain one or multiple worksheets. When you open a worksheet by double-clicking the Excel icon, it will usually display the data contained in the last active sheet. </Paragraph>
                <BulletedList>
                    <ListItem>Download the file: <olink targetdoc="JC Electrics spreadsheet - OpenLearn">JC Electrics template</olink>. JC Electrics is a company manufacturing heavy machinery, such as generators, electric motors and transformers. </ListItem>
                    <ListItem>Click on <b>File</b>, then <b>Open</b> and navigate to the folder in which you saved this file and open the file by double-clicking it. </ListItem>
                    <ListItem>Navigate to the first sheet, labelled ‘JC Electrics 1’.</ListItem>
                </BulletedList>
                <Paragraph>You should now see quarterly data of units sold for JC Electrics in four columns. The screenshot below shows a spreadsheet with these columns labelled as: ‘Quarter’, ‘Generators’, ‘Transformers’ and ‘Electric Motors’. </Paragraph>
                <Figure>
                    <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_u01_s02_f03.tif" src_uri="https://openuniv.sharepoint.com/sites/bmodules/b126/lmimages/b126_u01_s02_f03.tif.jpg" x_printonly="y" x_folderhash="9cb7b1d8" x_contenthash="461c8012" x_imagesrc="b126_u01_s02_f03.tif.jpg" x_imagewidth="377" x_imageheight="555"><!--Asset 420003--></Image>
                    <Caption><b>Figure 3</b> Data in Excel</Caption>
                    <Alternative>A spreadsheet that displays the quarterly data of units sold for JC Electrics</Alternative>
                    <Description>A screenshot of a spreadsheet that displays the quarterly data of units sold for JC Electrics, which manufactures three products: generators, electric motors and transformers. This spreadsheet consists of four columns and each column is labelled as: Quarter, Generators, Electric Motors, and Transformer.</Description>
                </Figure>
            </Section>
            <Section>
                <Title>1.3 Adding the Data Analysis ToolPak in Excel </Title>
                <Paragraph>In the next section of this unit, you are going to learn how to add the Data Analysis ToolPak in Excel and what its purpose is. </Paragraph>
                <Paragraph>Data Analysis ToolPak is a Microsoft Excel add-in function that provides you with a set of statistical tools for analysing data. This add-in comes with Excel during installation. However, if it has not already been loaded or activated on the machine you are using, you need to do so to proceed further. </Paragraph>
                <Paragraph>To load the Data Analysis ToolPak:</Paragraph>
                <BulletedList>
                    <ListItem>Open the Excel worksheet (this will already be opened if you have opened ‘JC Electrics template’ file).</ListItem>
                    <ListItem>Click on the <b>File</b> tab, then click on <b>Options</b>, and select <b>Add-Ins</b>. </ListItem>
                    <ListItem>In the <b>Manage</b> box, select Excel <b>Add-ins</b> as shown in the screenshot below and click <b>Go (see Figure 4).</b></ListItem>
                    <ListItem>A new box will appear, and in the <b>Add-Ins</b> dialog box, select the <b>Analysis Tool Pak</b> check box, and then click <b>OK</b>. </ListItem>
                </BulletedList>
                <Paragraph>The Data Analysis icon should now be available on the <b>Data</b> ribbon in the <b>Analysis</b> group. Start of Figure</Paragraph>
                <Figure>
                    <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_3_fig4.png" width="100%" x_folderhash="60bd249e" x_contenthash="196c79d0" x_imagesrc="b126_3_fig4.png" x_imagewidth="845" x_imageheight="691"><!--Asset 420007--></Image>
                    <Caption><b>Figure 4</b> Activating the Data Analysis ToolPak </Caption>
                    <Alternative>‘Add Ins’ dialogue box</Alternative>
                    <Description>This figure shows the ‘Add Ins’ dialogue box to activate the ‘DataAnalysis Toolpack’ in Excel.</Description>
                </Figure>
                <Paragraph>The Data Analysis icon should now be available on the <b>Data</b> ribbon in the <b>Analysis</b> group. </Paragraph>
            </Section>
            <Section>
                <Title>1.4 Decimal points and dates </Title>
                <Paragraph>If you are using a version of Excel other than the English (UK) one, you may have noticed the different date format and decimal point. In addition, one of the Excel’s habits of automatically adjusting the format of cell contents can sometimes produce unwanted or incorrect results. Therefore, it is useful to learn how things can be corrected and adjusted in Excel. </Paragraph>
                <Paragraph>In Excel spreadsheet, click on <b>File, </b>select <b>Options</b> and then select the <b>Advanced tab</b>; the results of this are shown below. </Paragraph>
                <Paragraph>If you untick the box ‘Use system separators’, you can enter alternative symbols to use instead.</Paragraph>
                <Figure>
                    <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_3_fig5.png" width="100%" x_folderhash="60bd249e" x_contenthash="19703011" x_imagesrc="b126_3_fig5.png" x_imagewidth="843" x_imageheight="691"><!--Asset 420004--></Image>
                    <Caption><b>Figure 5</b> Advanced options in Excel </Caption>
                    <Alternative>Screenshot showing how to select the advanced option in Excel</Alternative>
                    <Description>A screenshot showing how to select the advanced option in Excel to format the date. For this click on ‘File’, select ‘Options’, and then select the ‘Advanced’ tab. The section on the Use system separators is unselected and framed in a red box. </Description>
                </Figure>
                <Paragraph>You can adjust the date format by selecting a cell that contains a date, then clicking on <b>Format</b> in the <b>Home</b> ribbon under <b>Cells</b> and <b>Format Cells …</b> or by right-clicking on the cells and selecting <b>Format Cells …</b> from the context menu. </Paragraph>
                <Paragraph>The figure below shows you the ‘Format’ dialog for an English version of Excel installed on a computer. You can produce the desired date format; for example, <b>DD/MM/YYYY </b>by selecting the option and then clicking <b>OK</b>. </Paragraph>
                <Figure>
                    <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_3_fig6.png" width="100%" x_folderhash="60bd249e" x_contenthash="96db29cc" x_imagesrc="b126_3_fig6.png" x_imagewidth="524" x_imageheight="518"><!--Asset 420005--></Image>
                    <Caption><b>Figure 6</b> Changing the date format</Caption>
                    <Alternative>Format Cells window in Excel</Alternative>
                    <Description>Figure 6 shows the Format Cells window in Excel. The Number tab is selected; under the Category menu on the left, Date is highlighted, and on the right is the Type menu showing various different date formatting options. At the bottom of the window are the OK and Cancel buttons.</Description>
                </Figure>
            </Section>
            <Section>
                <Title>1.5 Using shortcut keys in Excel</Title>
                <Paragraph>One useful task in Excel is to learn to use shortcut keys. This will speed up your Excel projects, especially when working under pressure in a professional environment. </Paragraph>
                <Paragraph>Some shortcut keys are listed below. Try some of these out while reading the list.</Paragraph>
                <Table class="normal" style="topbottomrules">
                    <TableHead>Table 1 Navigating inside and between worksheets</TableHead>
                    <tbody>
                        <tr>
                            <td>Arrow keys </td>
                            <td>Move around the spreadsheet.</td>
                        </tr>
                        <tr>
                            <td>Page Down/Page Up</td>
                            <td>Move screen down or up.</td>
                        </tr>
                        <tr>
                            <td>Ctrl + Arrow keys</td>
                            <td>Move to the edge of a region. This is useful for navigating large blocks of data, particularly with the Ctrl + Shift + Arrow selection functionality. </td>
                        </tr>
                        <tr>
                            <td>Home</td>
                            <td>Move to the beginning of a row.</td>
                        </tr>
                        <tr>
                            <td>Ctrl + Home</td>
                            <td>Move to the beginning of a worksheet. This is most useful if you have multiple worksheets and want to prepare a nice-looking workbook, by cycling through all worksheets pressing Ctrl + Page Down and Ctrl + Home for each sheet, which quickly puts the cursor in the upper-left corner.</td>
                        </tr>
                        <tr>
                            <td/>
                            <td/>
                        </tr>
                        <tr>
                            <td/>
                            <td/>
                        </tr>
                        <tr>
                            <td/>
                            <td/>
                        </tr>
                    </tbody>
                </Table>
                <Table class="normal" style="topbottomrules">
                    <TableHead>Table 2 Data selection</TableHead>
                    <tbody>
                        <tr>
                            <td>Shift + Space</td>
                            <td> Select the entire row at the cursor position.</td>
                        </tr>
                        <tr>
                            <td/>
                            <td/>
                        </tr>
                        <tr>
                            <td>Ctrl + A</td>
                            <td>Select the entire worksheet or the data-containing area. Press Ctrl + A a second time to select the entire worksheet.</td>
                        </tr>
                        <tr>
                            <td/>
                            <td/>
                        </tr>
                        <tr>
                            <td>Shift + Arrow key</td>
                            <td>Extend the selection by one cell. This is one of the most useful shortcuts.</td>
                        </tr>
                        <tr>
                            <td>Ctrl + Shift + Arrow key</td>
                            <td>Extend the selection to the last cell with content in row or column. You can do this with the Page Up/Down keys.</td>
                        </tr>
                        <tr>
                            <td/>
                            <td/>
                        </tr>
                        <tr>
                            <td>Esc</td>
                            <td>Cancel selection.</td>
                        </tr>
                    </tbody>
                </Table>
                <Table class="normal" style="topbottomrules">
                    <TableHead>Table 3  Editing</TableHead>
                    <tbody>
                        <tr>
                            <td>Ctrl + C</td>
                            <td>Copy active selection.</td>
                        </tr>
                        <tr>
                            <td>Ctrl + X</td>
                            <td>Cut active selection. Think carefully about whether you want to copy or cut a selection before pasting in each situation, because cell references in copied selections will point to other cells and not the original references when pasted.</td>
                        </tr>
                        <tr>
                            <td>Ctrl + V</td>
                            <td>Paste active selection.</td>
                        </tr>
                        <tr>
                            <td>Ctrl + Z</td>
                            <td>Undo last action.</td>
                        </tr>
                        <tr>
                            <td>Ctrl + Y</td>
                            <td>Redo last action.</td>
                        </tr>
                        <tr>
                            <td/>
                            <td/>
                        </tr>
                        <tr>
                            <td/>
                            <td/>
                        </tr>
                        <tr>
                            <td>Alt + Enter</td>
                            <td>Start a new line in the same cell when entering text.</td>
                        </tr>
                        <tr>
                            <td>Ctrl + D</td>
                            <td>Copy above cell down.</td>
                        </tr>
                        <tr>
                            <td/>
                            <td/>
                        </tr>
                        <tr>
                            <td/>
                            <td/>
                        </tr>
                        <tr>
                            <td/>
                            <td/>
                        </tr>
                        <tr>
                            <td/>
                            <td/>
                        </tr>
                    </tbody>
                </Table>
                <Table class="normal" style="topbottomrules">
                    <TableHead>Table 4 Formulas and special functions</TableHead>
                    <tbody>
                        <tr>
                            <td>Ctrl + Shift + Enter</td>
                            <td>Enter an array formula. Must have a range selected first. (This is shown here only for reference and will be explained later.)</td>
                        </tr>
                        <tr>
                            <td>Shift + F3</td>
                            <td>Display the ‘Insert Function’ dialog.</td>
                        </tr>
                        <tr>
                            <td>F4</td>
                            <td>When editing a cell reference (e.g. ‘H5’), pressing F4 makes this reference absolute (e.g., ‘$H$5’). Pressing F4 repeatedly makes only row or column absolute.</td>
                        </tr>
                        <tr>
                            <td>F9</td>
                            <td>Force re-calculation of worksheets. It can be used to calculate part of a formula, when selecting part of formula and pressing F9, this evaluates the selected part.</td>
                        </tr>
                        <tr>
                            <td>Shift + F9 </td>
                            <td>Calculate the active worksheet.</td>
                        </tr>
                        <tr>
                            <td/>
                            <td/>
                        </tr>
                        <tr>
                            <td>Ctrl + S</td>
                            <td>Save the current workbook. Extremely useful for the occasional power outage or computer crash. </td>
                        </tr>
                        <tr>
                            <td/>
                            <td/>
                        </tr>
                    </tbody>
                </Table>
            </Section>
            <Section>
                <Title>1.6 Use of Excel spreadsheets</Title>
                <Paragraph>Excel spreadsheets are used to manage, organise and present data in a systematic way. They can be used in many ways in different fields; for example, in finance, they are mostly used to present and analyse data such as accounting transactions (e.g., sales, payments, cost, forecasting and budgeting). Spreadsheets are also used to design templates of financial statements and survey results. </Paragraph>
                <Paragraph>Some other uses are listed below:</Paragraph>
                <BulletedList>
                    <ListItem>In schools and universities, spreadsheets are often used to manage student data in areas such as their grade performance, attendance or personal biography. </ListItem>
                    <ListItem>In hospitals, spreadsheets are used to manage patient data, like their personal information, details of their illness or details of the medicines they use. </ListItem>
                    <ListItem>Data is often exported from more complicated computer systems, such as manufacturing, financial or marketing systems, to allow managers or analysts to manipulate the data once it has been created, and to carry out forecasts, simulations and ‘what-if’ scenarios. </ListItem>
                </BulletedList>
                <Paragraph>Excel also has many formatting options (borders, colour highlighting), to allow you to draw attention to aspects of your figures and tables. </Paragraph>
                <Activity>
                    <Heading>Activity 1: Test your Excel knowledge</Heading>
                    <Timing>Allow approximately 20 minutes to complete this activity</Timing>
                    <Multipart>
                        <?oxy_insert_start author="sm36828" timestamp="20251103T090424+0000"?>
                        <Paragraph>Choose an answer to each of the questions below, and then hit ‘Next’ to move on to the next section. You can check your answer to each question as you go. </Paragraph>
                        <?oxy_insert_end?>
                        <?oxy_delete author="sm36828" timestamp="20251103T090427+0000" content="&lt;Part&gt;&lt;Question&gt;&lt;Paragraph&gt;Choose an answer to each of the questions below, and then hit ‘Next’ to move on to the next section. You can check your answer to each question as you go. &lt;/Paragraph&gt;&lt;Paragraph&gt;Which of the following functions counts all cells?&lt;/Paragraph&gt;&lt;/Question&gt;&lt;Interaction&gt;&lt;MultipleChoice&gt;&lt;Wrong&gt;&lt;Paragraph&gt;a) SUMIF&lt;/Paragraph&gt;&lt;/Wrong&gt;&lt;Wrong&gt;&lt;Paragraph&gt;b) COUNTIF&lt;/Paragraph&gt;&lt;/Wrong&gt;&lt;Wrong&gt;&lt;Paragraph&gt;c) AVERAGE&lt;/Paragraph&gt;&lt;/Wrong&gt;&lt;Right&gt;&lt;Paragraph&gt;d) COUNT&lt;/Paragraph&gt;&lt;/Right&gt;&lt;/MultipleChoice&gt;&lt;/Interaction&gt;&lt;/Part&gt;"?>
                        <Part>
                            <Question>
                                <Paragraph>What is the shortcut key to copy a cell in an Excel spreadsheet?</Paragraph>
                            </Question>
                            <Interaction>
                                <MultipleChoice>
                                    <Right>
                                        <Paragraph>a) Ctrl + C</Paragraph>
                                    </Right>
                                    <Wrong>
                                        <Paragraph>b) Ctrl + F</Paragraph>
                                    </Wrong>
                                    <Wrong>
                                        <Paragraph>c) Shift + F3</Paragraph>
                                    </Wrong>
                                </MultipleChoice>
                            </Interaction>
                        </Part>
                        <Part>
                            <Question>
                                <Paragraph>What is the shortcut key to save the work in an Excel spreadsheet?</Paragraph>
                            </Question>
                            <Interaction>
                                <MultipleChoice>
                                    <Wrong>
                                        <Paragraph>a) Ctrl + V</Paragraph>
                                    </Wrong>
                                    <Wrong>
                                        <Paragraph>b) Ctrl + C</Paragraph>
                                    </Wrong>
                                    <Right>
                                        <Paragraph>c) Ctrl + S</Paragraph>
                                    </Right>
                                </MultipleChoice>
                            </Interaction>
                        </Part>
                        <Part>
                            <Question>
                                <Paragraph>What is the shortcut key to cancel the selection in the sheet or cell?</Paragraph>
                            </Question>
                            <Interaction>
                                <MultipleChoice>
                                    <Wrong>
                                        <Paragraph>a) Ctrl + Alt + Delete</Paragraph>
                                    </Wrong>
                                    <Right>
                                        <Paragraph>b) Esc</Paragraph>
                                    </Right>
                                    <Wrong>
                                        <Paragraph>c) F12</Paragraph>
                                    </Wrong>
                                </MultipleChoice>
                            </Interaction>
                        </Part>
                        <Part>
                            <Question>
                                <Paragraph>What is the shortcut key to display the insert function dialog?</Paragraph>
                            </Question>
                            <Interaction>
                                <MultipleChoice>
                                    <Right>
                                        <Paragraph>a) Shift + F3</Paragraph>
                                    </Right>
                                    <Wrong>
                                        <Paragraph>b) Ctrl + Z</Paragraph>
                                    </Wrong>
                                    <Wrong>
                                        <Paragraph>c) Shift + F2</Paragraph>
                                    </Wrong>
                                </MultipleChoice>
                            </Interaction>
                        </Part>
                    </Multipart>
                </Activity>
                <Paragraph>In the next section, you will learn how to present and summarise a univariate (one variable only) data set in a table and graphical form.</Paragraph>
            </Section>
        </Session>
        <Session>
            <Title>2 Univariate data visualisation</Title>
            <Paragraph>In practice, there are two ways to visualise data in Excel. These are:</Paragraph>
            <BulletedList>
                <ListItem>tabular form </ListItem>
                <ListItem>graphical form.</ListItem>
            </BulletedList>
            <Paragraph>While presenting data in Excel, it is important to know the features of data. If your data is <b>univariate</b> (that is, the data consists of many observations for only one variable) then you can either use a frequency table or a histogram to present the data and get an idea of its features. In the JC Electrics example, if the analysis is only carried out for ‘Generators’ (therefore only for one variable), this will be seen as a univariate analysis. </Paragraph>
            <Paragraph>However, if your data is <b>bivariate (</b>that is, the data consists of two variables (an independent variable and a dependent variable)) and you need to know the relationship between these two variables, then, for example, you can use either a contingency table or scatter diagram to present the data and get an idea of its structure. You will learn about bivariate data visualisation later in this session. </Paragraph>
            <Paragraph>The next section will briefly explain frequency tables.</Paragraph>
            <Section>
                <Title>2.1 Frequency tables</Title>
                <Paragraph>Before learning how to make a frequency tables and histograms in Excel, you first need to know what frequency distribution is, and why we need histograms. </Paragraph>
                <Paragraph>The frequency of a value is the number of times that value appears in a data set. A frequency distribution table displays the pattern of frequencies of a variable in a tabular form. It gives the information of how many times each value of a variable occurs in a data set. A frequency distribution table is an effective way to present and organise the collected raw data so that all its features are summarised in a tabular form. The first step that a researcher or analyst must take with collected raw data is to organise and present the data in a way that is more meaningful, and easy to digest. </Paragraph>
                <Paragraph>Frequency distribution tables are also called frequency tables, and in practice both terms are used interchangeably. In short, a frequency table gives you a snapshot of how your data is distributed and spread out. </Paragraph>
                <Paragraph>A frequency distribution table has two columns: Column A and Column B. Column A presents the outcome of the values and Column B presents the frequency of the outcomes. We can understand this better with the example below. </Paragraph>
                <Paragraph>Anna is an analyst, and she works Monday and Tuesday in a hospital. On Wednesday and Thursday, she works in a small accounting firm. On Friday, Saturday and Sunday she works in a bank. </Paragraph>
                <Paragraph>Now you can display this data through a frequency distribution table, as shown in Figure 7. </Paragraph>
                <Figure>
                    <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_u01_s02_f07.tif" src_uri="https://openuniv.sharepoint.com/sites/bmodules/b126/lmimages/b126_u01_s02_f07.tif.jpg" x_printonly="y" x_folderhash="9cb7b1d8" x_contenthash="45146a5f" x_imagesrc="b126_u01_s02_f07.tif.jpg" x_imagewidth="280" x_imageheight="126"><!--Asset 447245--></Image>
                    <Caption><b>Figure 7</b> Displaying the data in the frequency distribution table</Caption>
                    <Alternative>Table displaying the data in the frequency distribution table</Alternative>
                    <Description>This figure displays a table and describes clearly how many days in a week Anna, who is an analyst, works in these three different organisations: a hospital, an accounting firm and a bank. This figure consists of two columns. The first column is labelled as Column A, which describes the types of organisations (hospital, accounting firm and bank) in which Anna works each week. The second column is labelled as Column B, which describes the number of days Anna works in the above mentioned organisations: 2 in hospital, 2 in accounting firm, 3 in bank).</Description>
                </Figure>
                <Paragraph>This table gives you an idea of how many days Anna works in each different organisation.</Paragraph>
                <Paragraph>In the next section, you will learn various types of frequency distribution table.</Paragraph>
                <Paragraph>In the next section, you will learn about various types of frequency distribution table.</Paragraph>
            </Section>
            <Section>
                <Title>2.2 Types of frequency distribution</Title>
                <Paragraph>There are four types of frequency distribution table:</Paragraph>
                <BulletedList>
                    <ListItem>Ungrouped frequency distribution tables</ListItem>
                    <ListItem>Grouped frequency distribution tables</ListItem>
                    <ListItem>Relative frequency distribution tables</ListItem>
                    <ListItem>Cumulative frequency distribution tables</ListItem>
                </BulletedList>
                <Paragraph>Before describing each type of frequency distribution table, you need to know the difference between <b>ungrouped data</b> and <b>grouped data</b>. </Paragraph>
                <Paragraph>In simple terms, <b>ungrouped data</b> is raw data that has not been categorised or grouped in categories. For example, a manager in a firm knows that 100 employees work in their firm; this is raw data because it does not tell you how many employees work in each department (e.g. buying, production, sales, distribution). However, if you have raw data that is categorised, it is defined as <b>grouped data</b>. For example, if this manager knows that 20 employees work in buying department, 30 in production and 40 in sales and 10 in distribution, then it means that the data is organised in such a way that it provides a clear indication of how many employees work in each department. </Paragraph>
                <SubSection>
                    <Title>2.2.1 Concepts involved in frequency tables</Title>
                    <Paragraph>The following terms are frequently used in frequency distribution:</Paragraph>
                    <Paragraph><b>Class interval or class limit</b>: The lowest and the highest value defined for a class or group are called class limits. The lowest value is called the lower class limit and the highest value is called the upper class limit of that class. In the example below (see Table 5), the lower class limits are 7, 9, 11, 13, 15, and the upper limits are 8, 10, 12, 14 and 16. </Paragraph>
                    <Table>
                        <TableHead>Table 5 Class intervals for JC Electrics<?oxy_delete author="sm36828" timestamp="20251103T090506+0000" content=","?> <?oxy_insert_start author="sm36828" timestamp="20251103T090507+0000"?>(<?oxy_insert_end?>Generators<?oxy_insert_start author="sm36828" timestamp="20251103T090510+0000"?>)<?oxy_insert_end?></TableHead>
                        <tbody>
                            <tr>
                                <th borderbottom="true" borderleft="true" borderright="true" bordertop="true"><b>Class intervals</b></th>
                                <th borderbottom="true" borderleft="true" borderright="true" bordertop="true"/>
                            </tr>
                            <tr>
                                <td borderbottom="true" borderleft="true" borderright="true" bordertop="true">7–8</td>
                                <td borderbottom="true" borderleft="true" borderright="true" bordertop="true"/>
                            </tr>
                            <tr>
                                <td borderbottom="true" borderleft="true" borderright="true" bordertop="true">9–10</td>
                                <td borderbottom="true" borderleft="true" borderright="true" bordertop="true"/>
                            </tr>
                            <tr>
                                <td borderbottom="true" borderleft="true" borderright="true" bordertop="true">11–12</td>
                                <td borderbottom="true" borderleft="true" borderright="true" bordertop="true"/>
                            </tr>
                            <tr>
                                <td borderbottom="true" borderleft="true" borderright="true" bordertop="true">13–14</td>
                                <td borderbottom="true" borderleft="true" borderright="true" bordertop="true"/>
                            </tr>
                            <tr>
                                <td borderbottom="true" borderleft="true" borderright="true" bordertop="true">15–16</td>
                                <td borderbottom="true" borderleft="true" borderright="true" bordertop="true"/>
                            </tr>
                        </tbody>
                    </Table>
                    <Paragraph><b>Midpoint or class mark</b>: This is the average of a class interval, and is obtained by dividing the sum of upper and lower class limits by 2. Thus, the class mark of the interval 7– 8 is 7.5, as</Paragraph>
                    <Equation>
                        <MathML>
                            <math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                <mrow>
                                    <mi>(7+8)/2=7.5</mi>
                                </mrow>
                            </math>
                        </MathML>
                    </Equation>
                    <Paragraph><b>Range</b>: this is the difference between the maximum value and the minimum value of the data set. For example, the data set of a student’s score in a maths quiz contains the maximum value 30 and the minimum value 8. Hence, to calculate the range, you must calculate <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline" scriptlevel="0" displaystyle="false">
                                    <mrow>
                                        <mi>30-8=22</mi>
                                    </mrow>
                                </math>
</MathML></InlineEquation>. In the JC Electrics, Generators example, the range is the difference between 15 (as the maximum value sold) and 7 (as the minimum value sold), hence the range is 8. </Paragraph>
                </SubSection>
                <SubSection>
                    <Title>2.2.2 Ungrouped frequency distribution tables</Title>
                    <Paragraph>When you are summarising small amounts of data, then it is better to organise and represent it in an ungrouped frequency distribution table. This is a type of distribution that shows how many times each individual value occurs in a data set. </Paragraph>
                    <Paragraph>For example, say you are interested to know how many plants people have in their homes. A survey gives the following figures as number of plants that 18 different people have in their homes: </Paragraph>
                    <Paragraph>Number of plants = 1, 5, 2, 2, 3, 3, 5, 5, 1, 1, 1, 3, 4, 4, 2, 3, 3, 3 </Paragraph>
                    <Paragraph>To answer the question, first you need to see the frequency of each value in the data. Value 1 occur 4 times, so you can describe it as four people having one plant. Then, you do the same for the rest of the values, so: three people have two plants, six people have three plants, two people have four plants, and three people have five plants. </Paragraph>
                    <Paragraph>In the following activity, you will learn how to make ungrouped frequency tables in Excel.</Paragraph>
                    <Activity>
                        <Heading>Activity 2: How to make an ungrouped frequency table in Excel and a Bar chart</Heading>
                        <Timing>Allow approximately 55 minutes to complete this activity</Timing>
                        <Question>
                            <Paragraph>In this activity, you will learn how to make an ungrouped frequency table in Excel, by following the instructions that are given below. Once you have produced the ungrouped frequency table in Excel, you may need to compare it with our final output by clicking ‘Reveal discussion’. This will help you to see whether you have produced the accurate ungrouped frequency table or not. </Paragraph>
                            <BulletedList>
                                <ListItem>Open the Excel file <olink targetdoc="JC Electrics spreadsheet - OpenLearn">JC Electrics</olink> <?oxy_insert_start author="sm36828" timestamp="20251103T090526+0000"?>(<?oxy_insert_end?>template<?oxy_insert_start author="sm36828" timestamp="20251103T090529+0000"?>)<?oxy_insert_end?>. Select the worksheet named ‘Ungrouped frequency table’. This sheet contains only the quarterly data of numbers of generators sold from the raw data in worksheet ‘JC Electrics 1’.</ListItem>
                                <ListItem>Label <b>Column B</b> using the variable name (such as ‘Generators’) and label <b>Column C</b> ‘Frequency’. </ListItem>
                                <ListItem>Find the minimum and maximum value in the data. In this example <b>=MAX(A5:A28)</b>, which is 15, and <b>=MIN(A5:A28)</b>, which is 7. </ListItem>
                                <ListItem>Calculate the range: <b>(MAX – MIN)</b>, so, which is 8. </ListItem>
                                <ListItem>Arrange the values in Column A from Smallest to Largest order. Select the values <b>(A5:A28)</b> in Column A, click <b>Data</b> in the toolbar and then click <b>Sort</b>, select <b>Continue with current selection</b> and <b>press Enter</b>. See below Figure 8, which shows how your information should be displayed. </ListItem>
                            </BulletedList>
                            <Figure>
                                <?oxy_attributes src="&lt;change type=&quot;modified&quot; oldValue=&quot;\\dog\printlive\nonCourse\OpenLearn\Courses\B126\b126_1\b126_1_fig8_data_ascending_to_descending.tif&quot; author=&quot;sm36828&quot; timestamp=&quot;20251103T090912+0000&quot; /&gt;" src_uri="&lt;change type=&quot;removed&quot; oldValue=&quot;file:////dog/printlive/nonCourse/OpenLearn/Courses/B126/b126_1/b126_1_fig8_data_ascending_to_descending.tif&quot; author=&quot;sm36828&quot; timestamp=&quot;20251103T090914+0000&quot; /&gt;"?>
                                <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_3_fig8.png" x_folderhash="9cb7b1d8" x_contenthash="c159da6c" x_imagesrc="b126_3_fig8.png" x_imagewidth="825" x_imageheight="938"><!--Asset 420011--></Image>
                                <Caption><b>Figure 8 </b>Arranging the data from ascending to descending order</Caption>
                                <Alternative>JC Electrics Excel sheet</Alternative>
                                <Description>This figure shows the JC Electrics Excel sheet Generators column values on the left, which are: 7, 7, 7, 7, 8, 8, 9, 9, 9, 9, 9, 10, 10, 11, 11, 11, 12, 12, 12, 13, 13, 14, 14, 15. Next to this column are two empty columns labelled Value and Frequency. All three columns are under the headings ‘JC Electrics’ followed by (Quarterly data of units sold).</Description>
                            </Figure>
                            <BulletedList>
                                <ListItem>To count the number of quarters in which 7 units of Generators were sold, you need to calculate the frequency in Column C. Type <b>=COUNTIF (Range, value)</b>. For example, <b>=COUNTIF (A5:A28,7</b><b>). </b>This calculation will return a value of 4, therefore there are 4 quarters when only 7 units of generators were sold.<b/></ListItem>
                                <ListItem>In Column B insert the different values of units sold such as: 7, 8, 9, 10, 11, 12, 13, 14 and 15 as they appear in Column A (see Figure 8). </ListItem>
                                <ListItem>To continue calculating the frequency in Column C, the formula in cell C5 can now be adjusted to =COUNTIF($A$5:$A$28, B5). This is the same formula as above; however, this is now presented in a format that will allow you to copy the formula up to Cell C13. As noted previously (see Table 4), to lock a cell or a group of cells you can use function F4. In your formula, this will show the $ in front of the rows or columns you prefer to keep locked in your calculations. </ListItem>
                                <ListItem>The copied formula in Cell C13, should be =COUNTIF($A$5:$A$28, B13). In column C, you will now have the calculated frequencies for all units sold. </ListItem>
                            </BulletedList>
                            <Figure>
                                <?oxy_attributes src="&lt;change type=&quot;modified&quot; oldValue=&quot;\\dog\printlive\nonCourse\OpenLearn\Courses\B126\b126_1\b126_1_fig9_frequency_ungroup_data.tif&quot; author=&quot;sm36828&quot; timestamp=&quot;20251103T091048+0000&quot; /&gt;" src_uri="&lt;change type=&quot;removed&quot; oldValue=&quot;file:////dog/printlive/nonCourse/OpenLearn/Courses/B126/b126_1/b126_1_fig9_frequency_ungroup_data.tif&quot; author=&quot;sm36828&quot; timestamp=&quot;20251103T091049+0000&quot; /&gt;"?>
                                <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_3_fig9.png" x_folderhash="9cb7b1d8" x_contenthash="7dd3866d" x_imagesrc="b126_3_fig9.png" x_imagewidth="668" x_imageheight="949"><!--Asset 420013--></Image>
                                <Caption><b>Figure 9</b> Calculating the frequency of ungroup data </Caption>
                                <Alternative>JC Electrics Excel sheet</Alternative>
                                <Description>This figure shows the JC Electrics Excel sheet Generators column values on the left, which are: 7, 7, 7, 7, 8, 8, 9, 9, 9, 9, 9, 10. Next to this column are two further columns, with the first labelled as Value and the second labelled as Frequency. The Value column contains the values 7, 8, 9, 10, 11, 12, 13, 14, 15. The Frequency column in the row Generator:7, Value 7 shows the formula: =COUNTIF(A5:A28,7).</Description>
                            </Figure>
                            <BulletedList>
                                <ListItem>Check your answers in the Excel file: <olink targetdoc="JC Electrics spreadsheet - solved">JC Electrics (solved)</olink>. </ListItem>
                                <ListItem>Optional: As further practice, you can now use the data for the other two products from JC Electrics 1 (Electric motors and Transformers) to identify the frequency of their products sold.</ListItem>
                                <ListItem>You can check your answers in for solved files: <olink targetdoc="JC Electrics motors spreadsheet - solved">JC Electrics (solved) Electric motors</olink> and <olink targetdoc="JC Electrics transformers spreadsheet - solved">JC Electrics (solved) Transformers</olink>.</ListItem>
                            </BulletedList>
                        </Question>
                        <Discussion>
                            <Figure>
                                <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_1_fig10_ungrouped_frequency_table.tif" src_uri="file:////dog/printlive/nonCourse/OpenLearn/Courses/B126/b126_1/b126_1_fig10_ungrouped_frequency_table.tif" width="100%" x_printonly="y" x_folderhash="8f25bc15" x_contenthash="18cfd99a" x_imagesrc="b126_1_fig10_ungrouped_frequency_table.tif.jpg" x_imagewidth="512" x_imageheight="582"><!--Asset 493719--></Image>
                                <Caption><b>Figure 10</b> Ungrouped frequency table </Caption>
                                <Alternative>Excel spreadsheet for JC Electrics</Alternative>
                                <Description>This figure shows an Excel spreadhseet for JC Electrics (Quarterly data of units sold). The first column, labelled Values, lists the values 7, 8, 9, 10, 11, 12, 13, 14, 15 The second column, labelled Frequency, contains the values 4, 2, 5, 2, 3, 3, 2, 2, 1.</Description>
                            </Figure>
                            <Paragraph>Producing the correct ungrouped frequency table provides you with information about the frequency of Generators sold with a particular value for the units sold. For example, 7 Generators are sold during 4 quarters, 8 Generators are sold during 2 quarters, 9 generators are sold during 5 quarters and so on... </Paragraph>
                        </Discussion>
                    </Activity>
                    <Paragraph>As is mentioned above, ungrouped frequency tables are useful when you have a small set of data, and you want to easily observe the frequency of each value in the data set. However, if you have a large data set then a grouped frequency distribution table is the best option. You will learn about these in the next section. </Paragraph>
                </SubSection>
                <SubSection>
                    <Title>2.2.3 Grouped frequency distribution tables</Title>
                    <Paragraph>When you are summarising large masses of raw data, it is useful to represent the data in groups. The groups are commonly known as classes or class intervals. You might then want to determine the number of values belonging to each class or class interval; this is called <i>class frequency.</i> A tabular arrangement of data by class together with the corresponding class frequencies is called a <i>grouped frequency distribution table</i>. This is a more efficient way to find the trends within the data, but there is a possibility that the grouping process may sacrifice some of the original detail of the data. </Paragraph>
                    <Paragraph>In the following activity, you will learn how to make a grouped frequency table in Excel.</Paragraph>
                    <Activity>
                        <Heading>Activity 3: How to make a grouped frequency distribution table in Excel</Heading>
                        <Timing>Allow approximately 45 minutes to complete this activity</Timing>
                        <Question>
                            <Paragraph>In this activity, you will produce a grouped frequency table in Excel by following the instructions given below. Once you have produced the grouped frequency table in Excel, you can check your answer by clicking ‘Reveal discussion’. </Paragraph>
                            <BulletedList>
                                <ListItem>Open the Excel file <olink targetdoc="JC Electrics spreadsheet - OpenLearn">JC Electrics template</olink>.</ListItem>
                                <ListItem>Open the worksheet named ‘Grouped frequency table’. This is the third worksheet in the file JC Electrics template. </ListItem>
                                <ListItem>In Column A you already have the data for Generators sorted. </ListItem>
                                <ListItem>The tables for the following column are indicated as: Column B: Class intervals; Column C: Lower limit of class intervals, Column D: Upper limit of class intervals, Column E: Frequency and last column, </ListItem>
                                <ListItem>Calculate the range. As mentioned before, this is the difference between the maximum and minimum value in the data set. For Generators, you can do this either by entering the formula <b>=MAX (</b>A5:A28<b>)-MIN (</b>A5:A28<b>), </b>or by calculating this on a separate section in the spreadsheet. For example, insert the formula =MAX(A5:A28) in cell J4. Insert formula =MIN(A5:A28) in cell J5. Calculate the rage in cell J6 by inserting the formula =J4-J5. </ListItem>
                            </BulletedList>
                            <Figure>
                                <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_3_fig11.png" x_folderhash="9cb7b1d8" x_contenthash="10316fb0" x_imagesrc="b126_3_fig11.png" x_imagewidth="1637" x_imageheight="821"><!--Asset 420017--></Image>
                                <Caption><b>Figure 11 </b>Calculating the range </Caption>
                                <Alternative>JC Electrics Excel sheet</Alternative>
                                <Description>This figure shows the JC Electrics Excel sheet Generators column values on the left, which are: 7, 7, 7, 7, 8, 8, 9, 9, 9, 9, 9, 10, 10, 11, 11, 11, 12, 12, 12, 13, 13, 14, 14, 15. Next to this column are five empty columns, with the first also labelled as ‘Generators’ and the second labelled as ‘Frequency’. All three columns are under the headings ‘JC Electrics’ followed by ‘(Quarterly data of units sold)’. To the right of this table are three cells labelled, from top to bottom, max, min, range, sample size and width. The max cell shows a value of 15, the min cell shows a value of 7, and the range cell contains the formula = J4-J5. </Description>
                            </Figure>
                            <BulletedList>
                                <ListItem>Decide the class interval width. There are no firm rules on how to choose the width. However, the following formula is the most common method to calculate the width: </ListItem>
                            </BulletedList>
                            <Equation>
                                <MathML>
                                    <?oxy_custom_start type="oxy_content_highlight" color="255,215,0"?>
                                    <math xmlns="http://www.w3.org/1998/Math/MathML" displaystyle="true" scriptlevel="0" display="block">
                                        <mrow>
                                            <mtext>Width</mtext>
                                            <mspace depth="0.0ex" height="0.0ex" width="0.3em"/>
                                            <mtext>=</mtext>
                                            <mspace depth="0.0ex" height="0.0ex" width="0.3em"/>
                                            <mfrac>
                                                <mrow>
                                                  <mtext>Range</mtext>
                                                </mrow>
                                                <mrow>
                                                  <msqrt>
                                                  <mrow>
                                                  <mtext>SQRT (Sample size)</mtext>
                                                  </mrow>
                                                  </msqrt>
                                                </mrow>
                                            </mfrac>
                                        </mrow>
                                    </math><?oxy_custom_end?>
                                </MathML>
                                <Alternative>width equals range over square root of sample size</Alternative>
                            </Equation>
                            <BulletedList>
                                <ListItem>You can round this value to a whole number or a number that is convenient to add (such as multiple of 10). For example, the width calculated in the given data set is 1.63, so will be taken as <b>2</b> (see Figure 12). The =SQRT(number) is the formula in Excel for calculating a square root. </ListItem>
                            </BulletedList>
                            <Figure>
                                <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_3_fig12.png" x_folderhash="9cb7b1d8" x_contenthash="e0783f6a" x_imagesrc="b126_3_fig12.png" x_imagewidth="663" x_imageheight="514"><!--Asset 447263--></Image>
                                <Caption><b>Figure 12</b> Calculating the width </Caption>
                                <Alternative>Table arranged in rows showing max, min, range, sample size and width</Alternative>
                                <Description>This figure shows a table arranged in rows as follows: max: 15; min: 7; range: 8; sample size: 24; width: =1.63 and a red square indicating the formula used in cell J8 as J6/SQRT(J7). </Description>
                            </Figure>
                            <BulletedList>
                                <ListItem>Decide the number of groups or class intervals into which data is to be distributed. Each class interval is defined by a lower limit and an upper limit. The lower limit of first class interval is the lowest value in the data set, in our case 7. Add the class interval width to find the lower limit of the next class interval. Keep adding the interval width to calculate more class intervals until you exceed the highest value. For example, in the given data set, you determined the class intervals width equals 2, so you should make the class intervals as 7_8, 9_10, 11_12, 13_14, 15_16. </ListItem>
                                <ListItem>This means that the first class interval has lower limit of 7 and upper limit of 8. See Figure 13 below.</ListItem>
                            </BulletedList>
                            <Figure>
                                <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_u01_s02_f13.tif" src_uri="\\dog\printlive\nonCourse\OpenLearn\Courses\B126\b126_1\Thumbnails\resized\b126_u01_s02_f13.tif.jpg" webthumbnail="true" x_printonly="y" x_folderhash="693cb533" x_contenthash="bd122567" x_imagesrc="b126_u01_s02_f13.tif.jpg" x_imagewidth="800" x_imageheight="667" x_smallsrc="b126_u01_s02_f13.tif.small.jpg" x_smallfullsrc="\\dog\printlive\nonCourse\OpenLearn\Courses\B126\b126_1\Thumbnails\resized\b126_u01_s02_f13.tif.small.jpg" x_smallwidth="512" x_smallheight="433"><!--Asset 420018--></Image>
                                <Caption><b>Figure 13</b> Making the class intervals </Caption>
                                <Alternative>Excel spreadsheet for JC Electrics</Alternative>
                                <Description>This figure shows the JC Electrics Quarterly data of units sold, with columns labelled from left to right as: Generators, Class intervals, Lower limit of class interval, Upper limit of class interval. From top to bottom, the Generators column contains the values 7, 7, 7, 7, 8, 8, 9, 9, 9, 9, 9, 10, 10, 11, 11, 11, 12, 12, 12, 13, 13, 14, 14, 15. The Class intervals column contains the values 7_8, 9_10, 11_12, 13_14, 15_16. The Lower limit of class interval contains the values 7, 9, 11, 13, 15; and the Upper limit of class interval contains the values 8, 10, 12, 14, 16.</Description>
                            </Figure>
                            <BulletedList>
                                <ListItem>Next step is to calculate the frequency. This will be displayed in the section E5:E9. In cell E5 insert the formula =FREQUENCY(A5:A28,D5:D9). This will return the values 6 in the first cell followed by 7, 6, 4, 1 and 0. </ListItem>
                                <ListItem>As you see, the Upper limit of class interval is selected here, therefore the returned value 6 indicates, that 6 times up to 8 units of Generators were sold. The next calculation in cell E6 shows a value of 7, therefore there 7 times Generators were sold with up to 10 units, but above 8 and so on. </ListItem>
                                <ListItem>Check your answers in the Excel file: <olink targetdoc="JC Electrics spreadsheet - solved">JC Electrics (solved)</olink>. </ListItem>
                                <ListItem>Optional: As further practice, you can now use the data for the other two products from JC Electrics 1 (Electric motors and Transformers) to identify the class intervals, lower limit of class interval, upper limit of class interval and frequency.</ListItem>
                                <ListItem>You can check your answers in for solved files: <olink targetdoc="JC Electrics motors spreadsheet - solved">JC Electrics (solved) Electric motors</olink> and <olink targetdoc="JC Electrics transformers spreadsheet - solved">JC Electrics (solved) Transformers</olink>. </ListItem>
                            </BulletedList>
                        </Question>
                        <Discussion>
                            <Figure>
                                <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_3_fig14.png" x_folderhash="9cb7b1d8" x_contenthash="daaf2652" x_imagesrc="b126_3_fig14.png" x_imagewidth="1184" x_imageheight="766"><!--Asset 420019--></Image>
                                <Caption><b>Figure 14</b> Calculating the frequency by using an array formula </Caption>
                                <Alternative>Excel spreadsheet for JC Electrics</Alternative>
                                <Description>This figure shows the JC Electrics Quarterly data of units sold, with columns labelled from left to right as: Generators, Class intervals, Lower limit of class interval, Upper limit of class interval, Frequency and Cumulative frequency. From top to bottom, the Generators column contains the values 7, 7, 7, 7, 8, 8, 9, 9, 9, 9, 9, 10, 10, 11, 11, 11, 12, 12, 12, 13, 13, 14, 14, 15. The Class intervals column contains the values 7_8, 9_10, 11_12, 13_14, 15_16. The Lower limit of class interval contains the values 7, 9, 11, 13, 15; and the Upper limit of class interval contains the values 8, 10, 12, 14, 16. Finally, the Frequency column contains the values 6 7, 6, 4, 1, 0. </Description>
                            </Figure>
                            <Paragraph>Figure 14 shows the results of using the array formula to calculate frequency. It is important to be aware than any error entered may result in an incorrect grouped frequency table and provide false information about the business. </Paragraph>
                        </Discussion>
                    </Activity>
                    <Paragraph>As mentioned above, a grouped frequency table is the best option to visualise the frequency of values in a large data set. However, if you are interested to know the proportion of a particular value in relation to the total number of values in the data set, then a relative frequency distribution table is the better option. In the next session, you will learn how to produce a relative frequency table in Excel. </Paragraph>
                </SubSection>
                <SubSection>
                    <Title>2.2.4 Relative frequency distribution tables</Title>
                    <Paragraph>Relative frequency distribution is another type of frequency distribution. This type of distribution tells us the proportion of each value or class interval of a variable in a sample. In other words, relative frequency distribution describes the number of times a particular value occurs in relation to the total number of values. You can use this type of frequency distribution for any type of variable when you are more interested in comparing frequencies than the actual number of observations. </Paragraph>
                    <Paragraph>For example, Team A has won 6 football games from a total of 12 football games played. The frequency of winning is 6 and the relative frequency of winning is 50% (i.e. 60/12=0.5).</Paragraph>
                    <Paragraph>You will learn how to make a relative frequency table in Excel in the following activity.</Paragraph>
                    <Activity>
                        <Heading>Activity 4: How to make a relative frequency table in Excel</Heading>
                        <Timing>Allow approximately 45 minutes to complete this activity</Timing>
                        <Question>
                            <Paragraph>In this activity, you will build a relative frequency table using the ungrouped frequency distribution table from Activity 2. Use the ungrouped frequency table that you made in the previous activities. </Paragraph>
                            <Paragraph>Use the file JC Electrics template, sheet Relative frequency table. In Column D insert the name Relative Frequency. Data in Columns A, B and C is the same as calculated for the Ungrouped frequency table. In Column C, Cell C14 insert the formula =SUM(C5:C13). This will calculate the sum of all frequencies calculated. The returned value is 24. </Paragraph>
                            <Paragraph>In Column D, Cell D5 insert formula =C5/$C$14. The returned value is 0.166666667. If you prefer to see this value in percentages, you can select the cells in column D, right mouse click, select Format Cells, followed by Percentages. For a decimal place of value 2, it will return a value for the relative frequency as 16.66%. If this value is rounded up, the relative frequency in this case is 17%. </Paragraph>
                            <Paragraph>You can use the same formula to continue calculate the relative frequency. </Paragraph>
                            <Paragraph>Once you have made the relative frequency distribution table in Excel, check your answer by clicking ‘Reveal discussion’ below. </Paragraph>
                            <Paragraph>The ungrouped frequency distribution table consists of three columns. Column A is labelled ‘Generators’, Column B is labelled ‘Generators’ and Column C is labelled ‘Frequency’. Add a fourth column to the table for the Relative Frequency. </Paragraph>
                            <Paragraph>To calculate the relative frequencies, you need to divide each frequency by the sample size (frequency / sample size). You can calculate the sample size by taking the sum of all the frequencies in Column C, which is 24. </Paragraph>
                            <BulletedList>
                                <ListItem>Check your answers in the Excel file JC Electrics (solved). </ListItem>
                            </BulletedList>
                            <Paragraph>Optional: As further practice, you can now use the data for the other two products from JC Electrics 1 (Electric motors and Transformers) to identify the relative frequency of products sold. You can check your answer in the Excel files: <olink targetdoc="JC Electrics motors spreadsheet - solved">JC Electrics (solved) Electric motors</olink> and <olink targetdoc="JC Electrics transformers spreadsheet - solved">JC Electrics (solved) Transformers</olink>. </Paragraph>
                        </Question>
                        <Discussion>
                            <Figure>
                                <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_3_fig15.png" x_folderhash="9cb7b1d8" x_contenthash="5bfed539" x_imagesrc="b126_3_fig15.png" x_imagewidth="700" x_imageheight="784"><!--Asset 493720 --></Image>
                                <Caption><b>Figure 15</b> Calculating the relative frequency of ungrouped data </Caption>
                                <Alternative>Excel spreadsheet for JC Electrics</Alternative>
                                <Description>This figure shows the JC Electrics Quarterly data of units sold, with columns labelled from left to right as: ‘Generators’, ‘Generators’, ‘Frequency’ and ‘Relative frequency’. From top to bottom, the first Generators column contains the values 7, 7, 7, 7, 8, 8, 9, 9, 9, 9, 9, 10, 10, 11, 11, 11, 12, 12, 12, 13, 13, 14, 14, 15. The second ‘Generators’ column contains the values 7, 8, 9, 10, 11, 12, 13, 14, 15 and Sample size. The Frequency column contains the values 4, 2, 5, 2, 3, 3, 2, 2, 1, and in red, 24. The Relative frequency column contains the values 0.166666667, 0.083333333, 0.208333333, 0.083333333, 0.125, 0.125, 0.083333333, 0.083333333, 0.041666667</Description>
                            </Figure>
                            <Paragraph>We calculate the relative frequency table whenever we are interested in knowing what proportion or percentage of the observations in a data set take on a certain value or fall within a range of values. Any error in the calculation may lead a business to make the wrong decision based on faulty information. </Paragraph>
                        </Discussion>
                    </Activity>
                    <Paragraph>In the next section, you will learn how to make cumulative frequency distribution tables in Excel.</Paragraph>
                </SubSection>
                <SubSection>
                    <Title>2.2.5 Cumulative frequency distribution tables</Title>
                    <Paragraph>Cumulative frequency distribution is the fourth type of frequency distribution table. This type of frequency distribution can be used for <b>ordinal</b> or <b>quantitative variables</b>, especially when you want to understand how often observations fall below certain values. </Paragraph>
                    <Paragraph>For example, Company A sells 250 books in the first week, 150 books in the second week and 400 books in the third week. The cumulative number of books sold in the second week by Company A is 400 books (250 books in the first week + 150 books in the second week). The cumulative number in the third week is 800 books (250 books in the first week + 150 books in the second week + 400 books in the third week). </Paragraph>
                    <Paragraph>In the following activity, you will learn how to build a cumulative frequency distribution table in Excel.</Paragraph>
                    <Activity>
                        <Heading>Activity 5: How to make a cumulative frequency distribution table in Excel</Heading>
                        <Timing>Allow approximately 45 minutes to complete this activity</Timing>
                        <Question>
                            <Paragraph>In this activity, you will build a cumulative frequency distribution table using the grouped frequency distribution table in Activity 3. Once you have built the cumulative frequency distribution table, you can check your answer by clicking ‘Reveal discussion’ below. </Paragraph>
                            <Paragraph>Use the data from the grouped frequency distribution table from Activity 3 in the template provided in JC Electrics template file. This table consists of six columns. Column A is labelled Generators, Column B is labelled Class intervals, Column C is labelled Lower limit of class interval, Column D is labelled Upper limit of class interval and Column E is labelled Frequency. Column F considers the cumulative frequency. The cumulative frequency is calculated by adding each frequency from a frequency distribution table to the sum of its predecessors. The last value will always be equal to the total for all observations, since all frequencies will already have been added to the previous total. Therefore, the expected final answer is 24. </Paragraph>
                            <Paragraph>To calculate the cumulative frequency, in Cell F5 enter formula = =E5, this is the same value of the first frequency. Now, in Cell F6, to calculate the cumulative frequency, you will add the new frequency value to the previous one. In Cell F6, enter the formula =F5+E6. In Cell F7, the formula will be =F6+E7. Continue this step up to Cell F9.</Paragraph>
                            <BulletedList>
                                <ListItem>Check your answers in the Excel file: <olink targetdoc="JC Electrics spreadsheet - solved">JC Electrics (solved)</olink>. </ListItem>
                                <ListItem>Optional: As further practice, you can now use the data for the other two products from JC Electrics 1 (Electric motors and Transformers) to identify the Cumulative frequency of products sold.</ListItem>
                            </BulletedList>
                        </Question>
                        <Discussion>
                            <Figure>
                                <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_3_fig16.png" x_folderhash="9cb7b1d8" x_contenthash="0b40e702" x_imagesrc="b126_3_fig16.png" x_imagewidth="1413" x_imageheight="827"><!-- Asset 420022--></Image>
                                <Caption><b>Figure 16</b> Calculating the cumulative frequency</Caption>
                                <Alternative>Excel spreadsheet for JC Electrics</Alternative>
                                <Description>This figure shows the JC Electrics Quarterly data of units sold, with columns labelled from left to right as: Generators, Class intervals, Lower limit of class interval, Upper limit of class interval, Frequency and Cumulative frequency. From top to bottom, the Generators column contains the values 7, 7, 7, 7, 8, 8, 9, 9, 9, 9, 9, 10, 10, 11, 11, 11, 12, 12, 12, 13, 13, 14, 14, 15. The Class intervals column contains the values 7_8, 9_10, 11_12, 13_14, 15_16. The Lower limit of class interval contains the values 7, 9, 11, 13, 15. The Upper limit of class interval contains the values 8, 10, 12, 14, 16. The Frequency column contains the values 6, 7, 6, 4, 1; and the Cumulative frequency column contains the values 6, 13, 19, 23, 24.</Description>
                            </Figure>
                            <Paragraph>A cumulative frequency table is used to determine the number of observations that lie above or below a certain value in a data set. Any error in the table may not reflect the reality of the data. </Paragraph>
                        </Discussion>
                    </Activity>
                    <Paragraph>In the next section, you will learn how to draw histograms in Excel. </Paragraph>
                </SubSection>
            </Section>
            <Section>
                <Title>2.3 Histograms: a graphical visualisation of frequency tables</Title>
                <Paragraph>A histogram is a very popular visualisation tool to summarise the distribution of continuous or discrete data. In a histogram, the variable is divided into intervals called ‘bins’. You then count the number of observations in each bin. The horizontal <i>x</i>-axis displays the ‘bins’ and vertical <i>y</i>-axis displays the number of observations in each bin. Histograms can help you to see whether the data is clustered around certain values or whether there are many small or many large values. A typical histogram in Excel looks like the following bar chart. In a correct histogram, there are no spaces or gaps between the bars.</Paragraph>
                <Figure>
                    <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_1_fig17_typical_histogram.tif" src_uri="file:////dog/printlive/nonCourse/OpenLearn/Courses/B126/b126_1/b126_1_fig17_typical_histogram.tif" x_printonly="y" x_folderhash="8f25bc15" x_contenthash="0dac5f28" x_imagesrc="b126_1_fig17_typical_histogram.tif.jpg" x_imagewidth="512" x_imageheight="299"><!--Asset 493705--></Image>
                    <Caption><b>Figure 17</b> A typical histogram in Excel </Caption>
                    <Alternative>A histogram in Excel</Alternative>
                    <Description>This figure shows a histogram in Excel. The horizontal <i>x</i>-axis the numbers from minus 3 to 5 in increments of 0.5, with a final bar labelled ‘&gt;5’. The vertical <i>y</i>-axis shows the ‘frequency’ and ranges from 0 to 140 in increments of 20.</Description>
                </Figure>
                <Paragraph>In the following activity, you will learn how to plot a histogram in Excel. </Paragraph>
                <Activity>
                    <Heading>Activity 6: Using Excel to draw a histogram</Heading>
                    <Timing>Allow approximately 60 minutes to complete this activity</Timing>
                    <Question>
                        <Paragraph>In this activity, you will learn how to produce a histogram in Excel by following the instructions that are given below. Once you have produced the histogram in Excel, you can check your answer by clicking ‘Reveal discussion’ below. </Paragraph>
                        <BulletedList>
                            <ListItem>Open the Excel file called <olink targetdoc="JC Electrics spreadsheet - OpenLearn">JC Electrics template</olink>, sheet Histogram which contains the quarterly data of number of generators sold. Column A contains information about the number of generators sold per quarter.</ListItem>
                            <ListItem>Use the same data and table format as calculated in the Group frequency table. </ListItem>
                            <ListItem>There are many ways to calculate the width of the bin in Excel. One of the easiest ways to calculate this is similar as the width of the bin or class intervals. As previously indicated, this would be the range divided by square root of sample size, which was 1.63. </ListItem>
                            <ListItem>Click on ‘Data Analysis’ in the ‘Data’ ribbon. This will bring up a list of some of the statistical analyses that you can perform in Excel. </ListItem>
                            <ListItem>Select ‘Histogram’ and click ‘OK’.</ListItem>
                            <ListItem>Specify the input range as A5:A28 and the bin range as D5:D9. Excel automatically allocates a $ sign for each selection as in Figure 18.</ListItem>
                            <ListItem>Tick the box ‘Chart Output’ and specify the output location as H12, as shown below.</ListItem>
                        </BulletedList>
                        <Figure>
                            <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_u01_s02_f18.tif" src_uri="\\dog\printlive\nonCourse\OpenLearn\Courses\B126\b126_1\Thumbnails\resized\b126_u01_s02_f18.tif.jpg" webthumbnail="true" x_printonly="y" x_folderhash="693cb533" x_contenthash="669fb8c1" x_imagesrc="b126_u01_s02_f18.tif.jpg" x_imagewidth="800" x_imageheight="557" x_smallsrc="b126_u01_s02_f18.tif.small.jpg" x_smallfullsrc="\\dog\printlive\nonCourse\OpenLearn\Courses\B126\b126_1\Thumbnails\resized\b126_u01_s02_f18.tif.small.jpg" x_smallwidth="512" x_smallheight="363"><!--Asset 420024--></Image>
                            <Caption><b>Figure 18</b> A frequency distribution table in Excel </Caption>
                            <Alternative>Excel spreadsheet for JC Electrics</Alternative>
                            <Description>This figure shows the JC Electrics Quarterly data of units sold, with columns labelled from left to right as: Generators, Class intervals, Lower limit of class interval, Upper limit of class interval and Frequency. From top to bottom, the Generators column contains the values 7, 7, 7, 7, 8, 8, 9, 9, 9, 9, 9, 10, 10, 11, 11, 11, 12, 12, 12, 13, 13, 14, 14, 15. The Class intervals column contains the values 7_8, 9_10, 11_12, 13_14, 15_16. The Lower limit of class interval contains the values 7, 9, 11, 13, 15. The Upper limit of class interval contains the values 8, 10, 12, 14, 16. The Frequency column contains the values 6, 7, 6, 4, 1.</Description>
                        </Figure>
                        <Paragraph>Click ‘OK’. Excel will put the histogram next to your frequency table.</Paragraph>
                        <Figure>
                            <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_u01_s02_f19.tif" src_uri="https://openuniv.sharepoint.com/sites/bmodules/b126/lmimages/b126_u01_s02_f19.tif.jpg" x_printonly="y" x_folderhash="9cb7b1d8" x_contenthash="c163abda" x_imagesrc="b126_u01_s02_f19.tif.jpg" x_imagewidth="375" x_imageheight="252"><!--Asset 420025--></Image>
                            <Caption><b>Figure 19</b> Histogram dialog box in Excel </Caption>
                            <Alternative>Histogram dialog box in Excel</Alternative>
                            <Description>This figure shows the histogram dialog box in Excel. On the left is a heading that reads Input, under which are Input range and Bin range, with boxes to the right to add the required information. Under the range tags is an empty tick box named Labels. This is followed by the header Output options, with radio buttons underneath called Output range (which is selected), New worksheet ply, and New workbook. Output range and New worksheet ply have boxes to the right to input values. Under the radio buttons are three tick boxes labelled, from top to bottom, Pareto (sorted histogram), Cumulative percentage, and Chart output. Chart output is selected. To the top right of the window are buttons labelled OK, Cancel, and Help.</Description>
                        </Figure>
                        <Paragraph>Click ‘OK’. Excel will put the histogram next to your frequency table.</Paragraph>
                        <Figure>
                            <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_1_fig20_histogram_units_sold.tif" src_uri="file:////dog/printlive/nonCourse/OpenLearn/Courses/B126/b126_1/b126_1_fig20_histogram_units_sold.tif" webthumbnail="false" x_printonly="y" x_folderhash="8f25bc15" x_contenthash="7b1101e1" x_imagesrc="b126_1_fig20_histogram_units_sold.tif.jpg" x_imagewidth="512" x_imageheight="217"><!--Asset 420026--></Image>
                            <Caption><b>Figure 20</b> Histogram of number of units sold (Generators) </Caption>
                            <Alternative>Excel spreadsheet showing a table and histogram for JC Electrics</Alternative>
                            <Description>This figure shows a table to the left and a histogram to the right. The table has two columns headed Bin and Frequency. The values in Bin are 8, 10, 12, 14, 16, More; the values in Frequency are 6, 7, 6, 4, 1, 0. The histogram shows five blue bars. The x-axis is labelled Bin and runs from 8 to More in increments of 2. The y-axis is labelled Frequency and runs from 0 to 8 in increments of 2. The bars represent the values from the table, showing 8 on the x-axis with 6 on the y-axis, followed by 10 (x) with 7 (y); 12 (x) with 6 (y), 14 (x) with 4 (y) and 16 (x) with 1 (y). There is no bar in the More (x) with 0 (y) space.</Description>
                        </Figure>
                        <BulletedList>
                            <ListItem>To remove the space between the bars, right click a bar, click<b> Format Data Series</b>, and change the Gap Width to 0%. </ListItem>
                            <ListItem>To add borders, right click a bar, click <b>Format Data Series</b>, click the <b>Fill &amp; Line</b> icon, click <b>Border</b>, and select a colour. </ListItem>
                            <ListItem>Now click ‘Reveal discussion’ to compare what you have made against the answer.</ListItem>
                        </BulletedList>
                    </Question>
                    <Discussion>
                        <Figure>
                            <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_1_fig21_histogram_units_sold_generators.tif" src_uri="file:////dog/printlive/nonCourse/OpenLearn/Courses/B126/b126_1/b126_1_fig21_histogram_units_sold_generators.tif" webthumbnail="false" x_printonly="y" x_folderhash="8f25bc15" x_contenthash="95fd7d7a" x_imagesrc="b126_1_fig21_histogram_units_sold_generators.tif.jpg" x_imagewidth="512" x_imageheight="269"><!--Asset 493723--></Image>
                            <Caption><b>Figure 21</b> Histogram showing units sold of generators </Caption>
                            <Alternative>A histogram in Excel</Alternative>
                            <Description>This figure shows a histogram with five blue bars. The x-axis is labelled Bin and runs from 8 to More in increments of 2. The y-axis is labelled Frequency and runs from 0 to 8 in increments of 2. The bars show 8 on the x-axis with 6 on the y-axis, followed by 10 (x) with 7 (y); 12 (x) with 6 (y), 14 (x) with 4 (y) and 16 (x) with 1 (y). There is no bar in the More (x) with 0 (y) space.</Description>
                        </Figure>
                        <Paragraph>Working through the steps given above should return the completed histogram shown in Figure 20.</Paragraph>
                    </Discussion>
                </Activity>
            </Section>
            <Section>
                <Title>2.4 Frequency density</Title>
                <Paragraph>Frequency density is defined as the frequency per unit of the data in each class. Frequency density is calculated by dividing the frequency by the class width (the class width is the difference between the upper limit of the class interval and the lower limit of the class interval). Frequency density allows for a meaningful comparison of different classes where the class widths may not be equal. </Paragraph>
                <Equation>
                    <MathML>
                        <math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                            <mrow>
                                <mtext>Frequency density</mtext>
                                <mspace depth="0.0ex" height="0.0ex" width="0.3em"/>
                                <mtext>=</mtext>
                                <mspace depth="0.0ex" height="0.0ex" width="0.3em"/>
                                <mfrac>
                                    <mrow>
                                        <mtext>Frequency</mtext>
                                    </mrow>
                                    <mrow>
                                        <mtext>Class width</mtext>
                                    </mrow>
                                </mfrac>
                            </mrow>
                        </math>
                    </MathML>
                    <Alternative>Frequency density equals frequency over class width</Alternative>
                </Equation>
                <Paragraph>The frequency density <b>gives us the ratio of the frequency of a class to its width</b>. Frequency density is used to plot a frequency density histogram; here, we plot frequency density instead of frequency on the <i>y</i>-axis. Frequency density gives us the total area of bars and tells us about the frequency in the histogram (rather than the height). </Paragraph>
                <Paragraph>We calculate frequency density when we have a set of grouped data that consists of unequal widths of class intervals. For example, see the following Excel worksheet, which gives us information about the ages of people playing cricket. </Paragraph>
                <Figure>
                    <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_u01_s02_f22.tif" src_uri="https://openuniv.sharepoint.com/sites/bmodules/b126/lmimages/b126_u01_s02_f22.tif.jpg" x_printonly="y" x_folderhash="9cb7b1d8" x_contenthash="dbe183c3" x_imagesrc="b126_u01_s02_f22.tif.jpg" x_imagewidth="492" x_imageheight="146"><!--Asset 420027--></Image>
                    <Caption><b>Figure 22</b> Information about the age of people playing football in an Excel file </Caption>
                    <Alternative>A histogram in Excel</Alternative>
                    <Description><Paragraph>This figure shows a spreadsheet which consists of four columns labelled Age, Frequency, Class width and Frequency density. The Age column contains the values <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                        <mrow>
                                            <mn>0</mn>
                                            <mo>≤</mo>
                                            <mo>×</mo>
                                            <mo>&lt;</mo>
                                            <mn>10</mn>
                                        </mrow>
                                    </math>
</MathML></InlineEquation>, <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                        <mrow>
                                            <mn>10</mn>
                                            <mo>≤</mo>
                                            <mo>×</mo>
                                            <mo>&lt;</mo>
                                            <mn>15</mn>
                                        </mrow>
                                    </math>
</MathML></InlineEquation>, <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                        <mrow>
                                            <mn>15</mn>
                                            <mo>≤</mo>
                                            <mo>×</mo>
                                            <mo>&lt;</mo>
                                            <mn>20</mn>
                                        </mrow>
                                    </math>
</MathML></InlineEquation>, <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                        <mrow>
                                            <mn>20</mn>
                                            <mo>≤</mo>
                                            <mo>×</mo>
                                            <mo>&lt;</mo>
                                            <mn>30</mn>
                                        </mrow>
                                    </math>
</MathML></InlineEquation>, <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                        <mrow>
                                            <mn>30</mn>
                                            <mo>≤</mo>
                                            <mo>×</mo>
                                            <mo>&lt;</mo>
                                            <mn>50</mn>
                                        </mrow>
                                    </math>
</MathML></InlineEquation>. The Frequency column contains the values 10, 9, 6, 4, 7. The remaining two columns are empty.</Paragraph></Description>
                </Figure>
                <Paragraph>To calculate the frequency densities: </Paragraph>
                <BulletedList>
                    <ListItem>In Column C, find the width of the class intervals by finding the difference of upper and lower bounds/limits. (For example, and so on.) </ListItem>
                </BulletedList>
                <Figure>
                    <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_u01_s02_f23.tif" src_uri="https://openuniv.sharepoint.com/sites/bmodules/b126/lmimages/b126_u01_s02_f23.tif.jpg" x_printonly="y" x_folderhash="9cb7b1d8" x_contenthash="fa8eddce" x_imagesrc="b126_u01_s02_f23.tif.jpg" x_imagewidth="495" x_imageheight="143"><!--Asset 420028--></Image>
                    <Caption><b>Figure 23</b> Calculating the class width </Caption>
                    <Alternative>Excel spreadsheet</Alternative>
                    <Description><Paragraph>This figure shows a spreadsheet which consists of four columns labelled Age, Frequency, Class width and Frequency density. The Age column contains the values <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                        <mrow>
                                            <mn>0</mn>
                                            <mo>≤</mo>
                                            <mo>×</mo>
                                            <mo>&lt;</mo>
                                            <mn>10</mn>
                                        </mrow>
                                    </math>
</MathML></InlineEquation>, <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                        <mrow>
                                            <mn>10</mn>
                                            <mo>≤</mo>
                                            <mo>×</mo>
                                            <mo>&lt;</mo>
                                            <mn>15</mn>
                                        </mrow>
                                    </math>
</MathML></InlineEquation>, <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                        <mrow>
                                            <mn>15</mn>
                                            <mo>≤</mo>
                                            <mo>×</mo>
                                            <mo>&lt;</mo>
                                            <mn>20</mn>
                                        </mrow>
                                    </math>
</MathML></InlineEquation>, <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                        <mrow>
                                            <mn>20</mn>
                                            <mo>≤</mo>
                                            <mo>×</mo>
                                            <mo>&lt;</mo>
                                            <mn>30</mn>
                                        </mrow>
                                    </math>
</MathML></InlineEquation>, <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                        <mrow>
                                            <mn>30</mn>
                                            <mo>≤</mo>
                                            <mo>×</mo>
                                            <mo>&lt;</mo>
                                            <mn>50</mn>
                                        </mrow>
                                    </math>
</MathML></InlineEquation>. The Frequency column contains the values 10, 9, 6, 4, 7. The Class width column contains the values 10, 5, 5, 10, 20. The Frequency density column is empty.</Paragraph></Description>
                </Figure>
                <BulletedList>
                    <ListItem>Then, in Column D, divide the frequency of each class interval by its width following the Frequency density formula given above. Even though the frequency in the first age bracket is higher, because the interval is twice as large as in the second age bracket, it could be misleading. From looking at the frequency density column, we can see that cricket is most popular in the 10-15 age category, with a frequency density of 1.8. </ListItem>
                </BulletedList>
                <Figure>
                    <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_u01_s02_f24.tif" src_uri="https://openuniv.sharepoint.com/sites/bmodules/b126/lmimages/b126_u01_s02_f24.tif.jpg" x_printonly="y" x_folderhash="9cb7b1d8" x_contenthash="a437828c" x_imagesrc="b126_u01_s02_f24.tif.jpg" x_imagewidth="574" x_imageheight="187"><!-- Asset 420029--></Image>
                    <Caption><b>Figure 24</b> Calculating the frequency density</Caption>
                    <Alternative>Excel spreadsheet</Alternative>
                    <Description><Paragraph>This figure shows a spreadsheet which consists of four columns labelled Age, Frequency, Class width and Frequency density. The Age column contains the values <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                        <mrow>
                                            <mn>0</mn>
                                            <mo>≤</mo>
                                            <mo>×</mo>
                                            <mo>&lt;</mo>
                                            <mn>10</mn>
                                        </mrow>
                                    </math>
</MathML></InlineEquation>, <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                        <mrow>
                                            <mn>10</mn>
                                            <mo>≤</mo>
                                            <mo>×</mo>
                                            <mo>&lt;</mo>
                                            <mn>15</mn>
                                        </mrow>
                                    </math>
</MathML></InlineEquation>, <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                        <mrow>
                                            <mn>15</mn>
                                            <mo>≤</mo>
                                            <mo>×</mo>
                                            <mo>&lt;</mo>
                                            <mn>20</mn>
                                        </mrow>
                                    </math>
</MathML></InlineEquation>, <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                        <mrow>
                                            <mn>20</mn>
                                            <mo>≤</mo>
                                            <mo>×</mo>
                                            <mo>&lt;</mo>
                                            <mn>30</mn>
                                        </mrow>
                                    </math>
</MathML></InlineEquation>, <InlineEquation><MathML><math xmlns="http://www.w3.org/1998/Math/MathML" display="block" scriptlevel="0" displaystyle="true">
                                        <mrow>
                                            <mn>30</mn>
                                            <mo>≤</mo>
                                            <mo>×</mo>
                                            <mo>&lt;</mo>
                                            <mn>50</mn>
                                        </mrow>
                                    </math>
</MathML></InlineEquation>. The Frequency column contains the values 10, 9, 6, 4, 7. The Class width column contains the values 10, 5, 5, 10, 20. The Frequency density column contains the values 1, 1.8, 1.2, 0.4, 0.35.</Paragraph></Description>
                </Figure>
                <Paragraph>In the activity below, you will test your knowledge of the difference between a frequency density histogram and a frequency histogram. </Paragraph>
                <Activity>
                    <Heading>Activity 7: The difference between frequency histograms and frequency density histograms</Heading>
                    <Timing>Allow approximately 35 minutes to complete this activity</Timing>
                    <Question>
                        <Paragraph>Watch the video below and note down in the box the difference between frequency histograms and frequency density histograms.</Paragraph>
                        <MediaContent src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_2024d_vsc019_1280x720.mp4" type="video" width="512" x_manifest="b126_2024d_vsc019_1_server_manifest.xml" x_filefolderhash="18107067" x_folderhash="18107067" x_contenthash="88a78d28" x_subtitles="b126_2024d_vsc019_1280x720.srt">
                            <Caption><b>Video 1</b></Caption>
                            <Transcript>
                                <Speaker>NARRATOR</Speaker>
                                <Remark>Difference between histograms. In this video, we will discuss the following topics-- what we mean by frequency density, how we calculate frequency density and construct the frequency density histogram, what is the difference between the frequency histogram and the frequency density histogram. </Remark>
                                <Remark>Let's first have a look at what frequency density is. Frequency density is defined as the frequency per unit of the data in each class. Frequency density gives us the ratio of the frequency of a class to its width. Frequency density is used to plot the histogram. And it allows for a meaningful comparison of different classes where the class width may not be equal. </Remark>
                                <Remark>So how is frequency density calculated? Frequency density is calculated by dividing the frequency with the class width. We calculate frequency density when we have a set of group data that consists of an unequal width of class intervals. </Remark>
                                <Remark>Let's have a look at this worksheet for an example. This Excel sheet consists of information relating to the age of people playing football. The spreadsheet consists of four columns. Column A shows the age range of the people playing football in distinct groups. Column B shows the frequency. Column C is labeled as class width. And column D is labeled as frequency density. </Remark>
                                <Remark>To calculate the frequency density of each class interval or group, first, you should determine the class width of each group. In column C, we can find the class width of the class intervals or groups by determining the difference of upper and lower bounds, also called limits. For example, in column C on the first row, type equals 10 minus 0 equals 10, equals 15 minus 10 equals 5, and so on. </Remark>
                                <Remark>Then in column D, you will calculate the frequency density by using the formula for frequency density, which is the frequency of each class interval or group divided by its width. For example, on the first row of column D, we divide the frequency of the first class interval or group, which is 10 by its class width, which is also 10, so equals 10 divided by 10 equals 1, which is the frequency density of the first class interval or group. </Remark>
                                <Remark>You can calculate the frequency density of the rest of the groups in the same way. And we get nine divided by five equals 1.8. 18 divided by 35 equals 0.5. 7 divided by 10 equals 0.7. And 5 divided by 10 equals 0.5. </Remark>
                                <Remark>Once you calculate the frequency density of each group, you will be able to construct the frequency density histogram. As we said earlier, frequency density is used to plot the histogram and allows for a meaningful comparison of different classes where the class width may not be equal. 
Here you can see that we plot the frequency density on the vertical y-axis and the class interval or group, which in this case is the age range of the people playing football, on the horizontal x-axis. </Remark>
                                <Remark>The height of each bar shows the frequency density of each group. You will also see that the bar of range 15 to 50 is wider than the bar of range 0 to 10. In the frequency density histogram, the frequency of each group is equal to the area of each class. And it can be calculated by multiplying the class width by the frequency density. </Remark>
                                <Remark>So the frequency is equal to the area of the class, which is equal to the class width multiplied by the frequency density. </Remark>
                                <Remark>Let's now have a look at what the difference between the frequency histogram and frequency density histogram is. </Remark>
                                <Remark>Here, we have a frequency histogram and frequency density histogram and their corresponding frequency tables. In each case, we are measuring quantitative data that is continuous. This here is the frequency histogram. And you will notice that the bars or groups consist of a range of quantitative data values that are equally spaced out, whereas in the case of the frequency density histogram, which is this histogram here, the bars or groups of each range of quantitative data values are not equally spaced out. </Remark>
                                <Remark>This is not always the case as you can have a frequency density histogram which has equally spaced out bars or columns. In both the cases of the frequency histogram and the frequency density histogram, the bars are connected with each other. And there are no spaces between the bars. </Remark>
                                <Remark>Another distinction between the frequency histogram and the frequency density histogram is that in the case of the frequency histogram, the height of the bars corresponds to the frequencies, whereas in the case of the frequency density histogram, the height of the bars corresponds to the frequency density. The order or sequence of the bars cannot be changed again. For example, once you determine the class intervals or groups, then you cannot change their orders.</Remark>
                            </Transcript>
                            <Figure>
                                <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_2024d_vsc019_1280x720_still.jpg" src_uri="file:////dog/printlive/nonCourse/OpenLearn/Courses/B126/b126_1/AV/b126_2024d_vsc019_1280x720_still.jpg" x_folderhash="18107067" x_contenthash="7a16224c" x_imagesrc="b126_2024d_vsc019_1280x720_still.jpg" x_imagewidth="512" x_imageheight="288"/>
                                <Alternative/>
                            </Figure>
                        </MediaContent>
                    </Question>
                    <Interaction>
                        <FreeResponse size="paragraph" id="b126_act1-16_fr001"/>
                    </Interaction>
                </Activity>
            </Section>
        </Session>
        <Session>
            <Title>3 Bivariate data </Title>
            <Paragraph>Bivariate data refers to an instance in which two separate variables are examined and compared. For example, a performance manager may be interested to know how long (time) it takes employees to complete their assigned task (output). In other words, the interest is to measure the efficiency of the employees. In this example, the performance manager is examining two variables: time and output. </Paragraph>
            <Paragraph>Bivariate data is collected to explore the relationship between two variables and then uses this relationship to inform future decisions. One of the main aims of the researcher is to find out whether changes in one variable might be caused by changes in the other variable. This type of research involves two basic types of variables: independent variables and dependent variables. </Paragraph>
            <Paragraph><b>Independent variables</b>: An independent variable is <b>one that stands alone and is not changed by the other variable you are trying to measure</b>. The researcher changes the independent variable to see the effect it will have on the dependent variable. </Paragraph>
            <Paragraph><b>Dependent variables</b>: A dependent variable is <b>the one that changes because of independent variable manipulation</b>. It is the outcome you are interested in measuring, and it ‘depends’ on your independent variable. In statistics, dependent variables are also called response variables (as they respond to a change in another variable). </Paragraph>
            <Paragraph>For example, say a researcher is interested to know if mature students’ performance in a maths class changes based on the time of the class. To answer this question, the researcher measures mature students’ performance in a morning class and an evening class. The study finds that mature students perform better in the evening class than in the morning class. </Paragraph>
            <Paragraph>What are the independent and dependent variables in the example above? The independent variable is the time of the class, and the dependent variable is mature students’ performance in maths, as it might change in relation to the independent variable. </Paragraph>
            <Paragraph>In the next activity, you will expand your knowledge of bivariate data. </Paragraph>
            <Activity>
                <Heading>Activity 8: Bivariate data</Heading>
                <Timing>Allow approximately 30 minutes to complete this activity</Timing>
                <Question>
                    <Paragraph>Watch the following video and make notes on bivariate data in the free response box below.</Paragraph>
                    <MediaContent src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_2024d_vsc022_1280x720.mp4" type="video" width="512" x_manifest="b126_2024d_vsc022_1_server_manifest.xml" x_filefolderhash="18107067" x_folderhash="18107067" x_contenthash="ebe9a176" x_subtitles="b126_2024d_vsc022_1280x720.srt">
                        <Caption><b>Video 2</b></Caption>
                        <Transcript>
                            <Speaker>NARRATOR</Speaker>
                            <Remark>Hello. In this video, we will discuss what we mean by bivariate data. Well, just by the word bivariate data, we know that we are dealing with some kind of data. But what kind of data is the question. To better understand the word bivariate, we need to break it down in two. The prefix bi means two or twice, and variant means a quantity that can take the value of any member of a particular set, a variable. </Remark>
                            <Remark>So bivariate data are data that have two variables with different values. With bivariate data, we are comparing two different sets of variables. The one variable is the dependent variable, and the other variable is the independent variable. Let's look at an example. </Remark>
                            <Remark>As you can see, this table has three columns. The columns are labeled as days, temperature, and the number of Pepsi cans sold. So this table is showing the data relating to the number of Pepsi cans sold in the different days of a week. The two sets of variant data we have here are the temperature, and the number of Pepsi cans sold. </Remark>
                            <Remark>The temperature on this table is the independent variable, and the number of Pepsi cans is the dependent variable, because the number of Pepsi cans sold depends on the increase or decrease in temperature. As you can see in this table, on the day the temperature was 12 degrees celcius, the number of Pepsi cans sold was 12. When the temperature increased from 12 to 18 degrees celcius, the sale of Pepsi cans also increased. </Remark>
                            <Remark>Now, let's have a look at how these two different sets of data can be compared. We can plot this data into a graph, and the scatter plot is one of the best graphs for representing bivariate data. On this scatter plot here, we have the x-axis, which is the horizontal axis, representing the temperature, which is our independent variable. And on the y-axis, which is the vertical axis, we have the dependent variable, which is the number of Pepsi cans sold. </Remark>
                            <Remark>So from the graph, you can see that with the increase in temperature, we also have an increase in the number of Pepsi cans sold. In other words, this graph shows a positive correlation between the two variables because with the increase in temperature, the sale of Pepsi cans has also increased. </Remark>
                            <Remark>There can be cases where there is no positive correlation, as with the example here, where the increase in temperature has not seen an increase in the number of packets of biscuits sold. This is what we call a negative correlation. </Remark>
                            <Remark>In this video, you learned what bivariate data are, and that they are best graphed in a scatter plot to show either a positive or negative correlation.</Remark>
                        </Transcript>
                        <Figure>
                            <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_2024d_vsc022_1280x720_still.jpg" src_uri="file:////dog/printlive/nonCourse/OpenLearn/Courses/B126/b126_1/AV/b126_2024d_vsc022_1280x720_still.jpg" x_folderhash="18107067" x_contenthash="6080c016" x_imagesrc="b126_2024d_vsc022_1280x720_still.jpg" x_imagewidth="512" x_imageheight="288"/>
                            <Alternative/>
                        </Figure>
                    </MediaContent>
                </Question>
                <Interaction>
                    <FreeResponse size="paragraph" id="b126_act1-17_fr001"/>
                </Interaction>
            </Activity>
            <Paragraph>Bivariate data can be visualised using contingency tables and scatter diagrams. In the next section you will learn about contingency tables</Paragraph>
            <Section>
                <Title>3.1 Contingency tables</Title>
                <Paragraph>A contingency table is called a cross table or two-way-table and counts observations for each unique combination of values in two variables. It is a table of data in which the row entries tabulate the data according to one variable and the column entries tabulate the data according to another variable. </Paragraph>
                <Paragraph>The tool best suited for a data set will depend on the variable’s scale of measurement. The most important distinction here is whether a variable is discrete or continuous. If it is discrete, a convenient method to summarise bivariate data is a contingency table. If it is continuous, a scatter diagram can be more useful in visualising the data set. </Paragraph>
                <Paragraph>Contingency tables are used in statistics to understand the relationship between categorical variables. For example, say we want to summarise the following sample of firms regarding their industry sector and size: </Paragraph>
                <Table class="normal" style="topbottomrules">
                    <TableHead>Table 7 Sample of firms</TableHead>
                    <tbody>
                        <tr>
                            <th>Firm</th>
                            <th>Sector</th>
                            <th>Employees</th>
                        </tr>
                        <tr>
                            <td>1</td>
                            <td>Technology</td>
                            <td>&lt;50</td>
                        </tr>
                        <tr>
                            <td>2</td>
                            <td>Food</td>
                            <td>50+</td>
                        </tr>
                        <tr>
                            <td>3</td>
                            <td>Technology</td>
                            <td>&lt;50</td>
                        </tr>
                        <tr>
                            <td>4</td>
                            <td>Food</td>
                            <td>&lt;50</td>
                        </tr>
                        <tr>
                            <td>5</td>
                            <td>Food</td>
                            <td>&lt;50</td>
                        </tr>
                        <tr>
                            <td>6</td>
                            <td>Food</td>
                            <td>50+</td>
                        </tr>
                        <tr>
                            <td>7</td>
                            <td>Technology</td>
                            <td>&lt;50</td>
                        </tr>
                        <tr>
                            <td>8</td>
                            <td>Technology</td>
                            <td>&lt;50</td>
                        </tr>
                        <tr>
                            <td>9</td>
                            <td>Technology</td>
                            <td>50+</td>
                        </tr>
                        <tr>
                            <td>10</td>
                            <td>Food</td>
                            <td>50+</td>
                        </tr>
                    </tbody>
                </Table>
                <Paragraph>See the following cross table, which elegantly summarises the information of our sample data. </Paragraph>
                <Table class="normal" style="topbottomrules">
                    <TableHead>Table 8 Cross table for sample firms</TableHead>
                    <tbody>
                        <tr>
                            <th>Sector</th>
                            <th>&lt;50 Employees
</th>
                            <th>
50+ Employees</th>
                            <th>Total firms</th>
                        </tr>
                        <tr>
                            <td>Technology</td>
                            <td>4</td>
                            <td>1</td>
                            <td>5</td>
                        </tr>
                        <tr>
                            <td>Food</td>
                            <td>2</td>
                            <td>3</td>
                            <td>5</td>
                        </tr>
                        <tr>
                            <td>Total</td>
                            <td>6</td>
                            <td>4</td>
                            <td>10</td>
                        </tr>
                    </tbody>
                </Table>
                <Paragraph>The cross table shows that there are 2 firms in the food manufacturing sector that have less than 50 employees. However, the cell 50+ shows that there are 3 firms in the food manufacturing sector that have more than 50 employees. The sum of the total food manufacturing firm is 5 which is the 50% of the grand total. </Paragraph>
                <Paragraph>Contingency tables vary in size and type because the size of the contingency table depends on the number of different ways in which data is categorised. </Paragraph>
                <Paragraph>There is no formula to draw a contingency table in Excel. However, analysts use a PivotTable to build contingency tables. A PivotTable is considered a powerful statistical tool to summarise bivariate and multivariate data sets in an Excel spreadsheet or database table and obtain the desired report. This tool does not actually change the spreadsheet or database itself; it simply pivots or turns the data to view it from different perspectives. Researchers and analysts use PivotTables especially when they have large amounts of data that would be time consuming to calculate by hand. A PivotTable can perform a few data processing functions such as identifying sums, averages, ranges or outliers. It then arranges this information in a simple and meaningful way that draws attention to key values. </Paragraph>
            </Section>
            <Section>
                <Title>3.2 Scatter diagrams</Title>
                <Paragraph>A scatter diagram is another way to visualise a quantitative bivariate data set. This is a two-dimensional diagram or graph with one variable on the <i>x</i>-axis (the independent variable) and the other variable on the <i>y</i>-axis (the dependent variable). We then plot the corresponding point on the diagram. </Paragraph>
                <Paragraph>In the next activity, you will learn how to produce a scatter diagram in Excel.</Paragraph>
                <Activity>
                    <Heading>Activity 9: Drawing scatter diagrams</Heading>
                    <Timing>Allow approximately 60 minutes to complete this activity</Timing>
                    <Question>
                        <Paragraph>In this activity you will learn how to make an ungrouped frequency table in Excel, either by watching the screencast or by following the instructions that are given below. </Paragraph>
                        <MediaContent type="video" src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_2024d_vsc126_720x1280.mp4" width="512" x_manifest="b126_2024d_vsc126_1_server_manifest.xml" x_filefolderhash="18107067" x_folderhash="18107067" x_contenthash="a41b6682" x_subtitles="b126_2024d_vsc126_720x1280.srt">
                            <Caption><b>Video 3</b></Caption>
                            <Transcript>
                                <Speaker>NARRATOR</Speaker>
                                <Remark>This video shows how to draw scatter plots in Microsoft Excel. We'll use the data file W4_Apple_Amzn_returns.xlxs to draw a scatter plot of monthly stock returns for Apple and Amazon. The data file has three columns. The first column, Date, represents calendar months. The second column, Amazon, shows monthly stock returns for Amazon. And the third column, Apple, shows monthly stock returns for Apple. </Remark>
                                <Remark>We see in the month that ended on the 1st of January, 2018, Amazon had positive stock returns of about 24%, while Apple stock went down by about 1%. The data has 25 rows, including one row for the labels, which means that our data has 24 observations of monthly stock returns for the two firms. </Remark>
                                <Remark>Let's highlight our data columns Amazon and Apple by clicking on the columns B and C. Go to Insert. Look under Charts and choose Scatter, which is under the little graph that looks like a few unconnected dots scattered around. Excel will now make a scatter plot from the highlighted data. </Remark>
                                <Remark>A scatter plot helps you visualise how two sets of numerical values are related. Each dot represents two separate measurements for a single instance. So each dot represents one month. And Amazon stock returns for that month are plotted on the x-axis, while Apple returns are plotted on the y-axis. </Remark>
                                <Remark>Once we have a scatter plot, we can add a title. We want to describe both of our measurements in the title, so we'll call this Monthly Returns, Amazon versus Apple. Next, we need to add labels for the x and y-axis. Click on the Chart area. </Remark>
                                <Remark>Go to Add Chart element. Choose Axis Titles. And click on Primary Horizontal to add the x-axis label. And choose Primary Vertical for the y-axis label. It's important that the axis labels show the units of measurement. Both the x-axis and y-axis show percentage returns, so we label them accordingly. </Remark>
                                <Remark>It can be useful to change the position of the horizontal and vertical axes. To change the position of the vertical axis, right-click on the horizontal axis and choose Format Axis. We can move the vertical axis by changing the value of the horizontal axis at the point which it crosses the vertical axis. </Remark>
                                <Remark>If we want to move the vertical axis all the way to the left, we tick Axis value and type in the lowest value for the horizontal axis. The lowest value can be seen here under Bounds. It's minus 30. So we type minus 30 here and click Enter. The y-axis has now moved all the way to the left of the graph and crosses the x-axis at minus 30. </Remark>
                                <Remark>We can move the position of the horizontal axis in the same way. Right-click on the vertical axis and choose Format Axis. To move the horizontal axis down, tick Axis value and type in the lowest value for the vertical axis. The lower vertical bound for our plot is minus 25, so type in minus 25 and click Enter. The x-axis is now moved all the way to the bottom. And the graph or scatter plot is now less cluttered. </Remark>
                            </Transcript>
                            <Figure>
                                <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_2024d_vsc126_720x1280_still.jpg" src_uri="file:////dog/printlive/nonCourse/OpenLearn/Courses/B126/b126_1/AV/b126_2024d_vsc126_720x1280_still.jpg" x_folderhash="18107067" x_contenthash="c0680425" x_imagesrc="b126_2024d_vsc126_720x1280_still.jpg" x_imagewidth="512" x_imageheight="288"/>
                                <Alternative/>
                            </Figure>
                        </MediaContent>
                        <Paragraph>Look at the following example, which shows a data set relating to the temperature on days in June, and the number of Pepsi drinks sold in a small shop. </Paragraph>
                        <Table>
                            <TableHead>Table 9 Temperature and number of drinks sold</TableHead>
                            <tbody>
                                <tr>
                                    <td>Temperature (X) </td>
                                    <td>12</td>
                                    <td>14</td>
                                    <td>15</td>
                                    <td>17</td>
                                    <td>22</td>
                                    <td>13</td>
                                    <td>20</td>
                                    <td>23</td>
                                </tr>
                                <tr>
                                    <td>Pepsi (Y)</td>
                                    <td>12</td>
                                    <td>16</td>
                                    <td>16</td>
                                    <td>19</td>
                                    <td>32</td>
                                    <td>10</td>
                                    <td>24</td>
                                    <td>40</td>
                                </tr>
                            </tbody>
                        </Table>
                        <Paragraph>Now we draw the scatter diagram (see file: <olink targetdoc="Pepsi data">Pepsi data.xlsx</olink>). In the example above, temperature is the independent variable and number of Pepsis is the dependent variable, as the sale of Pepsi is affected by changes in the temperature. Hence, we plot temperature on the <i>x</i>-axis and Pepsi on the <i>y</i>-axis. </Paragraph>
                        <Paragraph>To find out if there is any relationship between variable X (Temperature) and variable Y (Pepsi), execute the following steps in Excel. </Paragraph>
                        <BulletedList>
                            <ListItem>Select the range A1:B9</ListItem>
                        </BulletedList>
                        <Figure>
                            <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_u01_s02_f25.tif" src_uri="https://openuniv.sharepoint.com/sites/bmodules/b126/lmimages/b126_u01_s02_f25.tif.jpg" x_printonly="y" x_folderhash="9cb7b1d8" x_contenthash="0e97d2cf" x_imagesrc="b126_u01_s02_f25.tif.jpg" x_imagewidth="225" x_imageheight="161"><!--Asset 493715--></Image>
                            <Caption><b>Figure 25</b> Spreadsheet of Pepsi sold and Temperature </Caption>
                            <Alternative>Spreadsheet of Pepsi sold and Temperature </Alternative>
                            <Description>A spreadsheet which consists of two columns. Column A is labelled as Temperature and Column B is labelled as Pepsi. From top to bottom, the Temperature column contains the values 12, 14, 15, 17, 22, 13, 20, 23. The Pepsi column contains the values 12, 16, 16, 19, 32, 10, 24, 40.</Description>
                        </Figure>
                        <BulletedList>
                            <ListItem>On the <b>Insert</b> tab, in the <b>Charts</b> group, click the <b>Scatter</b> symbol. </ListItem>
                        </BulletedList>
                        <Figure>
                            <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_u01_s02_f26.tif" src_uri="https://openuniv.sharepoint.com/sites/bmodules/b126/lmimages/b126_u01_s02_f26.tif.jpg" x_printonly="y" x_folderhash="9cb7b1d8" x_contenthash="7442ad8e" x_imagesrc="b126_u01_s02_f26.tif.jpg" x_imagewidth="306" x_imageheight="101"><!--Asset 493717--></Image>
                            <Caption><b>Figure 26</b> How to select the Scatter symbol</Caption>
                            <Alternative>The Recommended charts toolbar in Excel</Alternative>
                            <Description>This figure shows a section of the Recommended charts toolbar in Excel. Over the label ‘Charts’ are different chart icons for histograms, boxplots, bar charts, pie charts and so on. the cursor hovers over the icon representing scatter plots. Next to the collection of icons is a globe icon labelled ‘Maps’ with a dropdown arrow, and a pivot chart icon labelled ‘PivotChart’, also with a dropdown arrow.</Description>
                        </Figure>
                        <Figure>
                            <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_u01_s02_f27.tif" x_printonly="y" x_folderhash="9cb7b1d8" x_contenthash="60518712" x_imagesrc="b126_u01_s02_f27.tif.jpg" x_imagewidth="264" x_imageheight="244"><!--Asset 493718--></Image>
                            <Caption><b>Figure 27</b> How to select the Scatter chart type</Caption>
                            <Alternative>Section of the Scatter type selection menu in Excel</Alternative>
                            <Description>This figure shows a section of the Scatter type selection menu in Excel with icons representing different styles of scatter charts arranged around the top and left edge. In the centre is text which reads: ‘Scatter. Use this chart type to: Compare at least two sets of values or pairs of data; show relationships between sets of values. Use it when: The data represents separate measurements.’</Description>
                        </Figure>
                        <BulletedList>
                            <ListItem>Optional – returning to our example JC Electrics, draw a scatter diagram for Generators</ListItem>
                        </BulletedList>
                    </Question>
                    <Discussion>
                        <Figure>
                            <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_u01_s02_f28.tif" src_uri="https://openuniv.sharepoint.com/sites/bmodules/b126/lmimages/b126_u01_s02_f28.tif.jpg" x_printonly="y" x_folderhash="9cb7b1d8" x_contenthash="0676857e" x_imagesrc="b126_u01_s02_f28.tif.jpg" x_imagewidth="529" x_imageheight="281"><!--Asset 420031--></Image>
                            <Caption><b>Figure 28</b> Temperature versus Pepsi sales </Caption>
                            <Alternative>Scatter diagram showing the relationship between temperature and Pepsi sales</Alternative>
                            <Description>Scatter diagram showing the relationship between Temperature (<i>x</i>-axis) and Pepsi sales (<i>y</i>-axis), with the heading ‘Pepsi versus Temperature’. The Temperature axis ranges from 0 to 25, and the Pepsi sales axis ranges from 0 to 45. The points plotted show that as temperature increases, so do the sales of Pepsi.</Description>
                        </Figure>
                        <Paragraph>The final image shows the positive relationship between the number of Pepsis sold and the temperature. This shows that as the temperature increases, sales of Pepsi also increase. </Paragraph>
                        <Paragraph>For JC Electrics Generators, a scatter diagram for Generators will show as in Figure 28. This presents a slight overall increase in sales, with peaks in sales in quarter 6 with a value of 14, and quarter 10 and 15 with a value of sales of 13 generators, as well as in quarter 21 with a value of 14 and quarter 22 with a value of 15. </Paragraph>
                        <Figure>
                            <Image src="https://www.open.edu/openlearn/pluginfile.php/4939076/mod_oucontent/oucontent/158793/b126_3_533492.png" src_uri="" x_folderhash="8f25bc15" x_contenthash="181d5fbb" x_imagesrc="b126_3_533492.png" x_imagewidth="625" x_imageheight="374"/>
                            <Caption><b>Figure 29</b> Generator sales scatter diagram</Caption>
                            <!--533492-->
                            <Description>Scatter diagram showing the relationship between quarters (x-axis) sales (y-axis), with the heading ‘Generators’. The Temperature axis ranges from 0 to 30, and the sales axis ranges from 0 to 16.</Description>
                        </Figure>
                    </Discussion>
                </Activity>
            </Section>
        </Session>
        <Session>
            <Title>4 Conclusion</Title>
            <Paragraph>You have started to familiarise yourself with the spreadsheet software Excel, which is widely used in workplaces, and useful in many different fields and contexts, such as business, medicine, marketing, tax and auditing, accounting and finance. </Paragraph>
            <Paragraph>You have also studied the basics of data analysis. The focus here was on the several ways to visualise and summarise data using tools available in Microsoft Excel, such as frequency tables, histograms, and scatter diagrams or plots. The main objective of data analysis and statistical modelling is to help make more evidence-based decisions. The various data visualisation tools studied in this session are only the first step toward starting the decision-making process using data. </Paragraph>
            <Paragraph>A second OpenLearn course on data analysis, <a href="https://www.open.edu/openlearn/science-maths-technology/data-analysis-hypothesis-testing/content-section-0">Data analysis: hypothesis testing</a>, is now also available should you wish to take your studies further.</Paragraph>
            <Paragraph>This OpenLearn course is an adapted extract from the Open University course <a href="https://www.open.ac.uk/courses/modules/b126">B126 <i>Business data analytics and decision making</i>.</a></Paragraph>
        </Session>
        <Session>
            <Title>Acknowledgements</Title>
            <Paragraph>This free course was written by the B126 Open University course team. <!--Author name, to be included if required--></Paragraph>
            <!--If archive course include following line: 
This free course includes adapted extracts from the course [Module title IN ITALICS]. If you are interested in this subject and want to study formally with us, you may wish to explore other courses we offer in [SUBJET AREA AND EMBEDDED LINK TO STUDY @OU].-->
            <Paragraph>Except for third party materials and otherwise stated (see <a href="http://www.open.ac.uk/conditions">terms and conditions</a>), this content is made available under a <a href="http://creativecommons.org/licenses/by-nc-sa/4.0/deed.en">Creative Commons Attribution-NonCommercial-ShareAlike 4.0 Licence</a>.</Paragraph>
            <Paragraph>The material acknowledged below is Proprietary and used under licence (not subject to Creative Commons Licence). Grateful acknowledgement is made to the following sources for permission to reproduce material in this free course: </Paragraph>
            <Paragraph>Course image: ismagilov / iStock / Getty Images Plus</Paragraph>
            <Paragraph>Figure 1: jadamprostore / iStock / Getty Images Plus</Paragraph>
            <Paragraph>Figures 2–28: © Excel</Paragraph>
            <!--The full URLs if required should the hyperlinks above break are as follows: Terms and conditions link  http://www.open.ac.uk/ conditions; Creative Commons link: http://creativecommons.org/ licenses/ by-nc-sa/ 4.0/ deed.en_GB]-->
            <!--<Paragraph>Course image <EditorComment>Acknowledgements provided in production specification or by LTS-Rights</EditorComment></Paragraph>-->
            <!--<Paragraph>
        <EditorComment>Please include  further acknowledgements as provided in production specification or by LTS-Rights in following order:
Text



Images



Figures



Illustrations



Tables



AV



Interactive assets</EditorComment>
      </Paragraph>-->
            <Paragraph>Every effort has been made to contact copyright owners. If any have been inadvertently overlooked, the publishers will be pleased to make the necessary arrangements at the first opportunity.</Paragraph>
            <Paragraph/>
            <Paragraph><b>Don't miss out</b></Paragraph>
            <Paragraph>If reading this text has inspired you to learn more, you may be interested in joining the millions of people who discover our free learning resources and qualifications by visiting The Open University – <a href="http://www.open.edu/openlearn/free-courses?LKCAMPAIGN=ebook_&amp;MEDIA=ol">www.open.edu/openlearn/free-courses</a>.</Paragraph>
        </Session>
    </Unit>
    <BackMatter><!--NOW ONLY FOR GLOSSARY: To be completed where appropriate--></BackMatter>
<settings>
    <numbering>
        <Session autonumber="false"/>
        <Section autonumber="false"/>
        <SubSection autonumber="false"/>
        <SubSubSection autonumber="false"/>
        <Activity autonumber="false"/>
        <Exercise autonumber="false"/>
        <Box autonumber="false"/>
        <CaseStudy autonumber="false"/>
        <Quote autonumber="false"/>
        <Extract autonumber="false"/>
        <Dialogue autonumber="false"/>
        <ITQ autonumber="false"/>
        <Reading autonumber="false"/>
        <StudyNote autonumber="false"/>
        <Example autonumber="false"/>
        <Verse autonumber="false"/>
        <SAQ autonumber="false"/>
        <KeyPoints autonumber="false"/>
        <ComputerDisplay autonumber="false"/>
        <ProgramListing autonumber="false"/>
        <Summary autonumber="false"/>
        <Tables autonumber="false"/>
        <Figures autonumber="false"/>
        <MediaContent autonumber="false"/>
        <Chemistry autonumber="false"/>
    </numbering>
    <discussion_alias>Discussion</discussion_alias>
    <session_prefix/>
<version>2025101000</version></settings></Item>
