Overview

Dataset statistics

Number of variables3
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory912.0 B
Average record size in memory30.4 B

Variable types

Categorical1
Numeric2

Dataset

Description전라남도 시험 회차별 공인중개사 자격현황
Author국토교통부
URLhttps://www.data.go.kr/data/15063467/fileData.do

Alerts

시도명 has constant value ""Constant
시험회차 is highly overall correlated with 관리대상자수High correlation
관리대상자수 is highly overall correlated with 시험회차High correlation
시험회차 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:12:22.522387
Analysis finished2023-12-12 14:12:23.185437
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
전라남도
30 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전라남도
2nd row전라남도
3rd row전라남도
4th row전라남도
5th row전라남도

Common Values

ValueCountFrequency (%)
전라남도 30
100.0%

Length

2023-12-12T23:12:23.606799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:12:23.708448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전라남도 30
100.0%

시험회차
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.5
Minimum1
Maximum30
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-12T23:12:23.816259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.45
Q18.25
median15.5
Q322.75
95-th percentile28.55
Maximum30
Range29
Interquartile range (IQR)14.5

Descriptive statistics

Standard deviation8.8034084
Coefficient of variation (CV)0.56796183
Kurtosis-1.2
Mean15.5
Median Absolute Deviation (MAD)7.5
Skewness0
Sum465
Variance77.5
MonotonicityStrictly increasing
2023-12-12T23:12:23.978478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1 1
 
3.3%
17 1
 
3.3%
30 1
 
3.3%
29 1
 
3.3%
28 1
 
3.3%
27 1
 
3.3%
26 1
 
3.3%
25 1
 
3.3%
24 1
 
3.3%
23 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
1 1
3.3%
2 1
3.3%
3 1
3.3%
4 1
3.3%
5 1
3.3%
6 1
3.3%
7 1
3.3%
8 1
3.3%
9 1
3.3%
10 1
3.3%
ValueCountFrequency (%)
30 1
3.3%
29 1
3.3%
28 1
3.3%
27 1
3.3%
26 1
3.3%
25 1
3.3%
24 1
3.3%
23 1
3.3%
22 1
3.3%
21 1
3.3%

관리대상자수
Real number (ℝ)

HIGH CORRELATION 

Distinct29
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean189.56667
Minimum7
Maximum1003
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-12T23:12:24.120574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7
5-th percentile12.9
Q161.75
median171
Q3208
95-th percentile496.1
Maximum1003
Range996
Interquartile range (IQR)146.25

Descriptive statistics

Standard deviation201.06607
Coefficient of variation (CV)1.0606615
Kurtosis8.7700368
Mean189.56667
Median Absolute Deviation (MAD)92.5
Skewness2.5925067
Sum5687
Variance40427.564
MonotonicityNot monotonic
2023-12-12T23:12:24.285496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
171 2
 
6.7%
1003 1
 
3.3%
199 1
 
3.3%
551 1
 
3.3%
284 1
 
3.3%
429 1
 
3.3%
400 1
 
3.3%
261 1
 
3.3%
167 1
 
3.3%
197 1
 
3.3%
Other values (19) 19
63.3%
ValueCountFrequency (%)
7 1
3.3%
12 1
3.3%
14 1
3.3%
21 1
3.3%
22 1
3.3%
49 1
3.3%
55 1
3.3%
61 1
3.3%
64 1
3.3%
76 1
3.3%
ValueCountFrequency (%)
1003 1
3.3%
551 1
3.3%
429 1
3.3%
400 1
3.3%
284 1
3.3%
261 1
3.3%
247 1
3.3%
211 1
3.3%
199 1
3.3%
197 1
3.3%

Interactions

2023-12-12T23:12:22.764653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:12:22.584469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:12:22.853197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:12:22.667172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:12:24.381967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시험회차관리대상자수
시험회차1.0000.487
관리대상자수0.4871.000
2023-12-12T23:12:24.487491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시험회차관리대상자수
시험회차1.0000.660
관리대상자수0.6601.000

Missing values

2023-12-12T23:12:22.966108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:12:23.098251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시험회차관리대상자수
0전라남도11003
1전라남도2101
2전라남도321
3전라남도4110
4전라남도576
5전라남도614
6전라남도722
7전라남도812
8전라남도97
9전라남도1049
시도명시험회차관리대상자수
20전라남도21172
21전라남도22179
22전라남도23211
23전라남도24197
24전라남도25167
25전라남도26261
26전라남도27400
27전라남도28429
28전라남도29284
29전라남도30551