
Tips.Net > ExcelTips Home > Formulas > Calculating Statistical Values on Different-Sized Subsets of Data
Summary: Analyzing huge amounts of data presents challenges that are different from those faced with smaller amounts of data. This tip examines how you can analyze subsets of a huge data table to derive statistical values from that data. It examines a simple grouping technique, and then applies Excel’s subtotaling and PivotTable features to derive the desired information. (This tip works with Microsoft Excel 97, Excel 2000, Excel 2002, and Excel 2003.)
Chris has a huge amount of data in a worksheet, and he wants to analyze the data based on different groupings within it. For instance, he has data in cells A2:B36001, where row 1 contains the column headings Time and Signal. He wants to divide the data into groups consisting of some arbitrary number of sequential values, and then extract, for each group, a mean value for the Time, a mean value for the Signal, and a standard deviation for the Signal.
The easiest way to handle this type of requirement is to add a column that is used to indicate a group number for each row. Follow these steps:
With the group numbers set up, you are ready to do the analysis. There are a couple of ways you can do this. One way is to use the subtotaling capabilities of Excel. Select one of the cells in the data area and follow these steps:
Excel groups and subtotals the data, as directed. You can hide the detail (and only show the subtotals) by clicking on the small 2 (with the box around it) in the outline area at the left of the worksheet. If you later want to change what is calculated, or if you need to change the number of items in each group, just remove the subtotals (Data | Subtotals | Remove All) and repeat the above steps.
Another way to derive the statistics from your data is to use a PivotTable. Make sure that there are no subtotals in the data and select a cell within the data. Then follow these steps:
You now have the data desired. You may prefer a setup that shows only one row per category for your data. If so, simply click on the Data header and hold down the mouse button as you slowly move the mouse to the right. As you move the mouse pointer into the Total header area, a small icon near the mouse pointer shows a "change" in the layout. Release the mouse button and you will have only one row for each group in your data. (Click here to see a related figure.)
If you need to format the columns (such as changing the format for the Average of Time column) you can do so. If you need to change the number of data items in each group, just go back to the data worksheet and change cell E1 to a different value. You can then return to the PivotTable, right click it, and choose Refresh Data.
Tip #2771 applies to Microsoft Excel versions: 97 2000 2002 2003
Got the Time? If you work with either times or dates in Excel, you really need ExcelTips: Times and Dates. Everything you need to know about slicing, dicing, and generally working with times and dates.
No, not that type of date. If you need to do any types of work with calendar dates, Excel has the tools you need. Learn how to use those tools the easy way. (more information...)
Ask an Excel Question
Make a Comment
ExcelTips FAQ
ExcelTips Premium
Beauty Tips
Car Tips
Cleaning Tips
College Tips
Cooking Tips
Excel2007 Tips
ExcelTips
Family Tips
Gardening Tips
Health Tips
Home Tips
Money Tips
Pet Tips
Word2007 Tips
WordTips
Advertise on the
ExcelTips Site