Overview

Dataset statistics

Number of variables4
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory38.4 B

Variable types

Categorical3
Numeric1

Dataset

Description산업구조 현황입니다. 항목은 기준연도, 시도명, 한국표준산업분류명, 비율입니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=151

Alerts

비율 is highly overall correlated with 한국표준산업분류명High correlation
한국표준산업분류명 is highly overall correlated with 비율High correlation

Reproduction

Analysis started2024-01-09 22:18:05.196513
Analysis finished2024-01-09 22:18:05.558332
Duration0.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준연도
Categorical

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2020
10 
2019
10 
2018
10 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 10
33.3%
2019 10
33.3%
2018 10
33.3%

Length

2024-01-10T07:18:05.622416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:18:05.718507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 10
33.3%
2019 10
33.3%
2018 10
33.3%

시도명
Categorical

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
충남
15 
전국
15 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충남
2nd row충남
3rd row충남
4th row충남
5th row충남

Common Values

ValueCountFrequency (%)
충남 15
50.0%
전국 15
50.0%

Length

2024-01-10T07:18:05.825219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:18:05.919146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충남 15
50.0%
전국 15
50.0%

한국표준산업분류명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
농림어업
광업및제조업
전기가스증기업
건설업
서비스업

Length

Max length7
Median length6
Mean length4.8
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row농림어업
2nd row광업및제조업
3rd row전기가스증기업
4th row건설업
5th row서비스업

Common Values

ValueCountFrequency (%)
농림어업 6
20.0%
광업및제조업 6
20.0%
전기가스증기업 6
20.0%
건설업 6
20.0%
서비스업 6
20.0%

Length

2024-01-10T07:18:06.035369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:18:06.150924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
농림어업 6
20.0%
광업및제조업 6
20.0%
전기가스증기업 6
20.0%
건설업 6
20.0%
서비스업 6
20.0%

비율
Real number (ℝ)

HIGH CORRELATION 

Distinct25
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.993333
Minimum1.4
Maximum63.2
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2024-01-10T07:18:06.260978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.4
5-th percentile1.535
Q13.75
median5.95
Q334.225
95-th percentile62.37
Maximum63.2
Range61.8
Interquartile range (IQR)30.475

Descriptive statistics

Standard deviation22.07962
Coefficient of variation (CV)1.1043491
Kurtosis-0.72155877
Mean19.993333
Median Absolute Deviation (MAD)4.2
Skewness0.91021295
Sum599.8
Variance487.50961
MonotonicityNot monotonic
2024-01-10T07:18:06.378953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
1.9 2
 
6.7%
4.0 2
 
6.7%
1.4 2
 
6.7%
5.7 2
 
6.7%
6.0 2
 
6.7%
27.2 1
 
3.3%
61.6 1
 
3.3%
5.9 1
 
3.3%
29.2 1
 
3.3%
63.0 1
 
3.3%
Other values (15) 15
50.0%
ValueCountFrequency (%)
1.4 2
6.7%
1.7 1
3.3%
1.8 1
3.3%
1.9 2
6.7%
3.5 1
3.3%
3.7 1
3.3%
3.9 1
3.3%
4.0 2
6.7%
4.1 1
3.3%
5.7 2
6.7%
ValueCountFrequency (%)
63.2 1
3.3%
63.0 1
3.3%
61.6 1
3.3%
54.3 1
3.3%
51.9 1
3.3%
50.6 1
3.3%
35.0 1
3.3%
34.8 1
3.3%
32.5 1
3.3%
29.2 1
3.3%

Interactions

2024-01-10T07:18:05.322930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:18:06.456223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연도시도명한국표준산업분류명비율
기준연도1.0000.0000.0000.000
시도명0.0001.0000.0000.717
한국표준산업분류명0.0000.0001.0000.757
비율0.0000.7170.7571.000
2024-01-10T07:18:06.526995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
한국표준산업분류명시도명기준연도
한국표준산업분류명1.0000.0000.000
시도명0.0001.0000.000
기준연도0.0000.0001.000
2024-01-10T07:18:06.602772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비율기준연도시도명한국표준산업분류명
비율1.0000.0000.4860.616
기준연도0.0001.0000.0000.000
시도명0.4860.0001.0000.000
한국표준산업분류명0.6160.0000.0001.000

Missing values

2024-01-10T07:18:05.442416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:18:05.523963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연도시도명한국표준산업분류명비율
02020충남농림어업4.1
12020충남광업및제조업50.6
22020충남전기가스증기업4.0
32020충남건설업6.2
42020충남서비스업35.0
52019충남농림어업3.9
62019충남광업및제조업51.9
72019충남전기가스증기업3.7
82019충남건설업5.7
92019충남서비스업34.8
기준연도시도명한국표준산업분류명비율
202019전국농림어업1.8
212019전국광업및제조업27.7
222019전국전기가스증기업1.4
232019전국건설업6.0
242019전국서비스업63.0
252018전국농림어업1.9
262018전국광업및제조업29.2
272018전국전기가스증기업1.4
282018전국건설업5.9
292018전국서비스업61.6