stats
The stats
command calculates summary statistics for a CSV file.
Syntax
qsv stats [options] <input_file> [<output_file>]
Description
The stats
command is used to calculate summary statistics for a CSV file. This is useful for understanding the distribution and characteristics of the data.
Options
--no-headers
: When set, the first row will not be interpreted as headers
Examples
Calculate Summary Statistics
Calculate summary statistics for a CSV file:
qsv stats DLD_Transactions_English_500.csv
Output:
Column Name Min Max Mean Median Stddev Count
transaction_id 1 500 250.5 250 144.5 500
trans_group_en Sales Rent Sales Sales 0.5 500
procedure_name_en Procedure Procedure Procedure Procedure 0 500
instance_date 2022-01-01 2022-12-31 2022-06-30 2022-06-30 0.5 500
property_type_en Unit Unit Unit Unit 0 500
property_sub_type_en Studio Studio Studio Studio 0 500
property_usage_en Residential Residential Residential Residential 0 500
reg_type_en Freehold Freehold Freehold Freehold 0 500
area_name_en Dubai Dubai Dubai Dubai 0 500
building_name_en Building Building Building Building 0 500
project_number 1 500 250.5 250 144.5 500
project_name_en Project Project Project Project 0 500
master_project_en Master Master Master Master 0 500
nearest_landmark_en Landmark Landmark Landmark Landmark 0 500
nearest_metro_en Metro Metro Metro Metro 0 500
nearest_mall_en Mall Mall Mall Mall 0 500
rooms_en 1 5 2.5 2 1.5 500
has_parking True True True True 0 500
procedure_area 0 1000 500 500 0 500
actual_worth 0 10000000 5000000 5000000 0 500
meter_sale_price 0 10000000 5000000 5000000 0 500
rent_value 0 10000000 5000000 5000000 0 500
meter_rent_price 0 10000000 5000000 5000000 0 500
no_of_parties_role_1 1 5 2.5 2 1.5 500
no_of_parties_role_2 1 5 2.5 2 1.5 500
no_of_parties_role_3 1 5 2.5 2 1.5 500
Common Use Cases
- Understanding the distribution and characteristics of data
- Preparing data for analysis or processing
Tips
- Verify the output to ensure the statistics are correctly calculated
- Use in combination with other commands for complex data processing