How to compute descriptive statistics in linux using Gnumeric spreadsheet statistics tool



gnumeric_descriptive_statistics

What is Gnumeric Spreadsheet Descriptive statistics

The descriptive statistics find a summary of the data set and describe its characteristics using various statistical parameters such as mean, error, median, mode, standard deviation, sample, variance,kurtosis,skewness,range,minimum,maximum,sum,count.

To compute descriptive statistics, we need to find the following parameters from the input dataset:

  • Mean: Sum up all the scores and divide by the number of scores.
  • Median: Arrange the scores in ascending order and find the middle value.
  • Mode: Identify the score that appears most frequently.
  • Range: Calculate the difference between the highest and lowest scores.
  • Variance: Calculate the average of the squared differences from the mean.
  • Standard Deviation: Take the square root of the variance.
  • Standard error: summation of difference between each value of given data and the mean value/number of values.
  • Kurtosis: Describes the degree to which scores data sets in the tails or the peak of a frequency distribution.
  • Skewness: Measure of the asymmetricness in the dataset/distribution.

About Gnumeric:

Gnumeric is a freely available tool in the GNOME software foundation. It is the best open-source spreadsheet processing tool with a GNU GPL license. It can work like a Microsoft Excel tool.
In Gnumeric, we can apply various mathematical operations to spreadsheets. While working with the Gnumeric application, I noticed that we can also process descriptive statistics on the spreadsheet data by following simple steps without writing any complex formulas to handle the spreadsheet data. Here we must create the datasets to work with in the Gnumeric spreadsheet.

How to install Gnumeric on Linux:

You need to open the Terminal and use following commands:
sudo apt-get update
sudo apt-get install gnumeric

The Gnumeric descriptive statistics tool provides 4 kinds of statistical data:

  • Summary Statistics: Calculates the mean, standard error, median, mode, standard deviation, sample variance, kurtosis, skewness, range, minimum, maximum, sum, and count for each variable.
  • Confidence Interval for the Mean: Also calculates confidence intervals for the population mean of each variable.
  • Kth largest: Finds the kth largest value of each of the variables.
  • Kth Smallest:Finds the kth smallest value of each of the variables.

Steps to calculate descriptive statistics on the input data in Gnumeric spreadsheet:

Open Gnumeric using Application from office applications

Gnumeric_app_main_window

Prepare dataset in Gnumeric spreadsheet

select the Descriptive statistics option from the toolbar menu

From the taskbar, select the Descriptive statistics operation from the toolbar menu. Navigate to "Statistics-> Descriptive Statistics -> Descriptive statistics."

Descriptive statistics tool operation window

Select your input data range (select columns with values for covariance as input).

Note: You can use input range selection to select input range.The first row indicates the labels for the dataset, and other values are inputs.

Select Statistics summary options-Median,Confidance,Kth small/large

Select your output display format from Descriptive statistics output dialog

New Sheet : To create result on new sheet
New WorkBook : To display result on new gnumeric workbook sheet
Output range : To display result on exiting data sheet on provided cells range
Click on Ok to generate the covariance result; this will create a new sheet with the name covariance of the covariance result for the selected data inputs.

Output result sheet

The intended audience for correlation usage

The intended audience for correlation usage is broad and spans across multiple disciplines.
  1. Business and Finance Professionals
  2. Scientists and Researchers
  3. Engineers and Technologists
  4. Educators and Students
  5. Health and Life Sciences Professionals
  6. Economists and Policymakers
  7. Artificial Intelligence and Robotics Experts
  8. Marketing and Advertising Specialists
  9. Legal and Forensic Experts
With the Gnumeric statistical data processing tool, it is the simplest way to calculate the descriptive statistics for the large datasets created in the spreadsheet format to analyze the required results.

Comments