Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.1 KiB
Average record size in memory52.3 B

Variable types

Categorical4
Numeric2

Alerts

ctprvn_cd is highly overall correlated with signgu_cd and 2 other fieldsHigh correlation
ctprvn_nm is highly overall correlated with signgu_cd and 2 other fieldsHigh correlation
signgu_cd is highly overall correlated with ctprvn_cd and 2 other fieldsHigh correlation
signgu_nm is highly overall correlated with signgu_cd and 2 other fieldsHigh correlation
ctprvn_cd is highly imbalanced (69.8%)Imbalance
ctprvn_nm is highly imbalanced (69.8%)Imbalance

Reproduction

Analysis started2023-12-10 10:02:26.919456
Analysis finished2023-12-10 10:02:28.629017
Duration1.71 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

cl
Categorical

Distinct9
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
조각
28 
회화
25 
기타
14 
미디어
11 
공예
Other values (4)
14 

Length

Max length3
Median length2
Mean length2.15
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row미디어
2nd row분수대
3rd row분수대
4th row조각
5th row회화

Common Values

ValueCountFrequency (%)
조각 28
28.0%
회화 25
25.0%
기타 14
14.0%
미디어 11
 
11.0%
공예 8
 
8.0%
벽화 6
 
6.0%
분수대 4
 
4.0%
사진 3
 
3.0%
서예 1
 
1.0%

Length

2023-12-10T19:02:28.749054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:02:28.938823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
조각 28
28.0%
회화 25
25.0%
기타 14
14.0%
미디어 11
 
11.0%
공예 8
 
8.0%
벽화 6
 
6.0%
분수대 4
 
4.0%
사진 3
 
3.0%
서예 1
 
1.0%

ctprvn_cd
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
11
92 
21
 
5
39
 
3

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row11
2nd row39
3rd row11
4th row11
5th row11

Common Values

ValueCountFrequency (%)
11 92
92.0%
21 5
 
5.0%
39 3
 
3.0%

Length

2023-12-10T19:02:29.159830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:02:29.316295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
11 92
92.0%
21 5
 
5.0%
39 3
 
3.0%

ctprvn_nm
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울특별시
92 
부산광역시
 
5
제주특별자치도
 
3

Length

Max length7
Median length5
Mean length5.06
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row제주특별자치도
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 92
92.0%
부산광역시 5
 
5.0%
제주특별자치도 3
 
3.0%

Length

2023-12-10T19:02:29.516487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:02:29.733761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 92
92.0%
부산광역시 5
 
5.0%
제주특별자치도 3
 
3.0%

signgu_cd
Real number (ℝ)

HIGH CORRELATION 

Distinct29
Distinct (%)29.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12468.5
Minimum11010
Maximum39020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:02:29.904405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11010
5-th percentile11010
Q111070
median11160
Q311222.5
95-th percentile21020.5
Maximum39020
Range28010
Interquartile range (IQR)152.5

Descriptive statistics

Standard deviation5168.1238
Coefficient of variation (CV)0.41449443
Kurtosis19.744218
Mean12468.5
Median Absolute Deviation (MAD)70
Skewness4.4103128
Sum1246850
Variance26709504
MonotonicityNot monotonic
2023-12-10T19:02:30.107711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
11240 7
 
7.0%
11230 7
 
7.0%
11220 7
 
7.0%
11010 6
 
6.0%
11020 5
 
5.0%
11190 5
 
5.0%
11070 4
 
4.0%
11080 4
 
4.0%
11090 4
 
4.0%
11200 4
 
4.0%
Other values (19) 47
47.0%
ValueCountFrequency (%)
11010 6
6.0%
11020 5
5.0%
11030 3
3.0%
11040 2
 
2.0%
11050 3
3.0%
11060 3
3.0%
11070 4
4.0%
11080 4
4.0%
11090 4
4.0%
11100 2
 
2.0%
ValueCountFrequency (%)
39020 3
3.0%
21030 2
 
2.0%
21020 2
 
2.0%
21010 1
 
1.0%
11250 3
3.0%
11240 7
7.0%
11230 7
7.0%
11220 7
7.0%
11210 3
3.0%
11200 4
4.0%

signgu_nm
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)28.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
송파구
서초구
강남구
종로구
 
6
중구
 
6
Other values (23)
67 

Length

Max length4
Median length3
Mean length3.03
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row종로구
2nd row서귀포시
3rd row종로구
4th row종로구
5th row종로구

Common Values

ValueCountFrequency (%)
송파구 7
 
7.0%
서초구 7
 
7.0%
강남구 7
 
7.0%
종로구 6
 
6.0%
중구 6
 
6.0%
영등포구 5
 
5.0%
중랑구 4
 
4.0%
성북구 4
 
4.0%
강북구 4
 
4.0%
동작구 4
 
4.0%
Other values (18) 46
46.0%

Length

2023-12-10T19:02:30.349722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
송파구 7
 
7.0%
강남구 7
 
7.0%
서초구 7
 
7.0%
종로구 6
 
6.0%
중구 6
 
6.0%
영등포구 5
 
5.0%
중랑구 4
 
4.0%
성북구 4
 
4.0%
강북구 4
 
4.0%
동작구 4
 
4.0%
Other values (18) 46
46.0%

co
Real number (ℝ)

Distinct44
Distinct (%)44.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29.16
Minimum1
Maximum191
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:02:30.570829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median6
Q352
95-th percentile122
Maximum191
Range190
Interquartile range (IQR)50

Descriptive statistics

Standard deviation42.933051
Coefficient of variation (CV)1.4723269
Kurtosis3.9383555
Mean29.16
Median Absolute Deviation (MAD)5
Skewness1.9846159
Sum2916
Variance1843.2469
MonotonicityNot monotonic
2023-12-10T19:02:30.802208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=44)
ValueCountFrequency (%)
1 21
21.0%
2 16
16.0%
8 6
 
6.0%
3 5
 
5.0%
4 5
 
5.0%
5 3
 
3.0%
122 2
 
2.0%
65 2
 
2.0%
11 2
 
2.0%
63 2
 
2.0%
Other values (34) 36
36.0%
ValueCountFrequency (%)
1 21
21.0%
2 16
16.0%
3 5
 
5.0%
4 5
 
5.0%
5 3
 
3.0%
7 1
 
1.0%
8 6
 
6.0%
10 1
 
1.0%
11 2
 
2.0%
15 1
 
1.0%
ValueCountFrequency (%)
191 1
1.0%
186 1
1.0%
176 1
1.0%
148 1
1.0%
122 2
2.0%
93 1
1.0%
92 1
1.0%
88 1
1.0%
86 1
1.0%
82 1
1.0%

Interactions

2023-12-10T19:02:27.906098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:02:27.584685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:02:28.054731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:02:27.741216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:02:30.979468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
clctprvn_cdctprvn_nmsigngu_cdsigngu_nmco
cl1.0000.0000.0000.0350.0000.415
ctprvn_cd0.0001.0001.0001.0000.9810.000
ctprvn_nm0.0001.0001.0001.0000.9810.000
signgu_cd0.0351.0001.0001.0000.9970.000
signgu_nm0.0000.9810.9810.9971.0000.000
co0.4150.0000.0000.0000.0001.000
2023-12-10T19:02:31.187732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
clctprvn_cdsigngu_nmctprvn_nm
cl1.0000.0000.0000.000
ctprvn_cd0.0001.0000.8081.000
signgu_nm0.0000.8081.0000.808
ctprvn_nm0.0001.0000.8081.000
2023-12-10T19:02:31.358895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
signgu_cdcoclctprvn_cdctprvn_nmsigngu_nm
signgu_cd1.0000.0330.0001.0001.0000.808
co0.0331.0000.1970.0000.0000.000
cl0.0000.1971.0000.0000.0000.000
ctprvn_cd1.0000.0000.0001.0001.0000.808
ctprvn_nm1.0000.0000.0001.0001.0000.808
signgu_nm0.8080.0000.0000.8080.8081.000

Missing values

2023-12-10T19:02:28.282378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:02:28.540310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

clctprvn_cdctprvn_nmsigngu_cdsigngu_nmco
0미디어11서울특별시11010종로구2
1분수대39제주특별자치도39020서귀포시1
2분수대11서울특별시11010종로구1
3조각11서울특별시11010종로구64
4회화11서울특별시11010종로구21
5공예11서울특별시11010종로구1
6기타11서울특별시11010종로구4
7회화39제주특별자치도39020서귀포시81
8회화11서울특별시11020중구25
9미디어11서울특별시11020중구5
clctprvn_cdctprvn_nmsigngu_cdsigngu_nmco
90미디어11서울특별시11240송파구1
91사진11서울특별시11240송파구2
92회화11서울특별시11250강동구11
93조각11서울특별시11250강동구65
94기타11서울특별시11250강동구1
95조각21부산광역시21010중구8
96조각21부산광역시21020서구31
97회화21부산광역시21020서구18
98조각21부산광역시21030동구55
99회화21부산광역시21030동구19