Overview

Dataset statistics

Number of variables7
Number of observations108
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.5 KiB
Average record size in memory61.2 B

Variable types

Numeric2
Categorical4
Text1

Dataset

Description제설함, 염화칼슘 위치 및 현황입니다.
Author서울특별시 서대문구
URLhttps://www.data.go.kr/data/15066321/fileData.do

Alerts

모래주머니(대) 보유수량 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
제설함규격(FRP) is highly overall correlated with 연번 and 2 other fieldsHigh correlation
염화칼숨(대) 보유수량 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
연번 is highly overall correlated with 제설함규격(FRP) and 2 other fieldsHigh correlation
모래주머니(대) 보유수량 is highly imbalanced (58.6%)Imbalance
연번 has unique valuesUnique
제설함위치(주소) has unique valuesUnique

Reproduction

Analysis started2023-12-12 00:17:17.003681
Analysis finished2023-12-12 00:17:17.881250
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct108
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54.5
Minimum1
Maximum108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-12T09:17:17.955866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.35
Q127.75
median54.5
Q381.25
95-th percentile102.65
Maximum108
Range107
Interquartile range (IQR)53.5

Descriptive statistics

Standard deviation31.32092
Coefficient of variation (CV)0.57469577
Kurtosis-1.2
Mean54.5
Median Absolute Deviation (MAD)27
Skewness0
Sum5886
Variance981
MonotonicityStrictly increasing
2023-12-12T09:17:18.100750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
70 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
79 1
 
0.9%
78 1
 
0.9%
77 1
 
0.9%
76 1
 
0.9%
75 1
 
0.9%
74 1
 
0.9%
Other values (98) 98
90.7%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
108 1
0.9%
107 1
0.9%
106 1
0.9%
105 1
0.9%
104 1
0.9%
103 1
0.9%
102 1
0.9%
101 1
0.9%
100 1
0.9%
99 1
0.9%

제설함규격(FRP)
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size996.0 B
63 
35 
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)0.9%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
63
58.3%
35
32.4%
9
 
8.3%
1
 
0.9%

Length

2023-12-12T09:17:18.210239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:17:18.295599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
63
58.3%
35
32.4%
9
 
8.3%
1
 
0.9%

제설함크기별 번호
Real number (ℝ)

Distinct63
Distinct (%)58.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25.25
Minimum1
Maximum63
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-12T09:17:18.433819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.35
Q19.75
median23
Q336.25
95-th percentile57.65
Maximum63
Range62
Interquartile range (IQR)26.5

Descriptive statistics

Standard deviation17.555852
Coefficient of variation (CV)0.69528127
Kurtosis-0.82027132
Mean25.25
Median Absolute Deviation (MAD)13.5
Skewness0.48060984
Sum2727
Variance308.20794
MonotonicityNot monotonic
2023-12-12T09:17:18.574517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 3
 
2.8%
3 3
 
2.8%
4 3
 
2.8%
5 3
 
2.8%
6 3
 
2.8%
7 3
 
2.8%
8 3
 
2.8%
9 3
 
2.8%
2 3
 
2.8%
30 2
 
1.9%
Other values (53) 79
73.1%
ValueCountFrequency (%)
1 3
2.8%
2 3
2.8%
3 3
2.8%
4 3
2.8%
5 3
2.8%
6 3
2.8%
7 3
2.8%
8 3
2.8%
9 3
2.8%
10 2
1.9%
ValueCountFrequency (%)
63 1
0.9%
62 1
0.9%
61 1
0.9%
60 1
0.9%
59 1
0.9%
58 1
0.9%
57 1
0.9%
56 1
0.9%
55 1
0.9%
54 1
0.9%
Distinct23
Distinct (%)21.3%
Missing0
Missing (%)0.0%
Memory size996.0 B
연희동
17 
홍은2동
16 
홍은1동
북가좌1동
남가좌2동
Other values (18)
55 

Length

Max length5
Median length4
Mean length3.712963
Min length2

Unique

Unique6 ?
Unique (%)5.6%

Sample

1st row홍은1동
2nd row홍제동
3rd row홍은2동
4th row연희동
5th row연희동

Common Values

ValueCountFrequency (%)
연희동 17
15.7%
홍은2동 16
14.8%
홍은1동 8
 
7.4%
북가좌1동 6
 
5.6%
남가좌2동 6
 
5.6%
홍제1동 6
 
5.6%
현저동 5
 
4.6%
북아현동 5
 
4.6%
홍제2동 5
 
4.6%
충현동 5
 
4.6%
Other values (13) 29
26.9%

Length

2023-12-12T09:17:18.725035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
연희동 17
15.7%
홍은2동 16
14.8%
홍은1동 8
 
7.4%
북가좌1동 6
 
5.6%
남가좌2동 6
 
5.6%
홍제1동 6
 
5.6%
현저동 5
 
4.6%
북아현동 5
 
4.6%
홍제2동 5
 
4.6%
충현동 5
 
4.6%
Other values (13) 29
26.9%
Distinct108
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size996.0 B
2023-12-12T09:17:19.118176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length8.3518519
Min length6

Characters and Unicode

Total characters902
Distinct characters85
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)100.0%

Sample

1st row홍은동 440-5
2nd row세검정1길 95
3rd row백련사길 179
4th row연희로32길 19
5th row연희로11마길 85
ValueCountFrequency (%)
통일로 9
 
4.3%
신촌로 8
 
3.8%
홍은동 5
 
2.4%
연희로 5
 
2.4%
홍제동 5
 
2.4%
북가좌동 4
 
1.9%
증가로 4
 
1.9%
성산로 3
 
1.4%
연희동 3
 
1.4%
응암로 3
 
1.4%
Other values (136) 160
76.6%
2023-12-12T09:17:19.666162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
101
 
11.2%
1 69
 
7.6%
68
 
7.5%
2 53
 
5.9%
3 48
 
5.3%
4 33
 
3.7%
31
 
3.4%
- 31
 
3.4%
29
 
3.2%
9 29
 
3.2%
Other values (75) 410
45.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 404
44.8%
Decimal Number 351
38.9%
Space Separator 101
 
11.2%
Dash Punctuation 31
 
3.4%
Uppercase Letter 15
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
68
 
16.8%
31
 
7.7%
29
 
7.2%
18
 
4.5%
18
 
4.5%
14
 
3.5%
13
 
3.2%
11
 
2.7%
11
 
2.7%
9
 
2.2%
Other values (60) 182
45.0%
Decimal Number
ValueCountFrequency (%)
1 69
19.7%
2 53
15.1%
3 48
13.7%
4 33
9.4%
9 29
8.3%
0 28
8.0%
7 28
8.0%
6 23
 
6.6%
5 21
 
6.0%
8 19
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
D 5
33.3%
M 5
33.3%
C 5
33.3%
Space Separator
ValueCountFrequency (%)
101
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 483
53.5%
Hangul 404
44.8%
Latin 15
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
68
 
16.8%
31
 
7.7%
29
 
7.2%
18
 
4.5%
18
 
4.5%
14
 
3.5%
13
 
3.2%
11
 
2.7%
11
 
2.7%
9
 
2.2%
Other values (60) 182
45.0%
Common
ValueCountFrequency (%)
101
20.9%
1 69
14.3%
2 53
11.0%
3 48
9.9%
4 33
 
6.8%
- 31
 
6.4%
9 29
 
6.0%
0 28
 
5.8%
7 28
 
5.8%
6 23
 
4.8%
Other values (2) 40
 
8.3%
Latin
ValueCountFrequency (%)
D 5
33.3%
M 5
33.3%
C 5
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 498
55.2%
Hangul 404
44.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
101
20.3%
1 69
13.9%
2 53
10.6%
3 48
9.6%
4 33
 
6.6%
- 31
 
6.2%
9 29
 
5.8%
0 28
 
5.6%
7 28
 
5.6%
6 23
 
4.6%
Other values (5) 55
11.0%
Hangul
ValueCountFrequency (%)
68
 
16.8%
31
 
7.7%
29
 
7.2%
18
 
4.5%
18
 
4.5%
14
 
3.5%
13
 
3.2%
11
 
2.7%
11
 
2.7%
9
 
2.2%
Other values (60) 182
45.0%

염화칼숨(대) 보유수량
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size996.0 B
10
63 
5
36 
20

Length

Max length2
Median length2
Mean length1.6666667
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row20
4th row20
5th row20

Common Values

ValueCountFrequency (%)
10 63
58.3%
5 36
33.3%
20 9
 
8.3%

Length

2023-12-12T09:17:19.838160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:17:19.979879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10 63
58.3%
5 36
33.3%
20 9
 
8.3%

모래주머니(대) 보유수량
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size996.0 B
0
99 
10
 
9

Length

Max length2
Median length1
Mean length1.0833333
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row10
2nd row10
3rd row10
4th row10
5th row10

Common Values

ValueCountFrequency (%)
0 99
91.7%
10 9
 
8.3%

Length

2023-12-12T09:17:20.123013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:17:20.265275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 99
91.7%
10 9
 
8.3%

Interactions

2023-12-12T09:17:17.452380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:17:17.294937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:17:17.529551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:17:17.366758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:17:20.359416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번제설함규격(FRP)제설함크기별 번호동별 제설함 위치염화칼숨(대) 보유수량모래주머니(대) 보유수량
연번1.0000.8840.9560.7480.9350.983
제설함규격(FRP)0.8841.0000.4380.3011.0001.000
제설함크기별 번호0.9560.4381.0000.7270.5250.513
동별 제설함 위치0.7480.3010.7271.0000.5130.507
염화칼숨(대) 보유수량0.9351.0000.5250.5131.0001.000
모래주머니(대) 보유수량0.9831.0000.5130.5071.0001.000
2023-12-12T09:17:20.502645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
모래주머니(대) 보유수량제설함규격(FRP)동별 제설함 위치염화칼숨(대) 보유수량
모래주머니(대) 보유수량1.0000.9910.3950.995
제설함규격(FRP)0.9911.0000.1420.995
동별 제설함 위치0.3950.1421.0000.277
염화칼숨(대) 보유수량0.9950.9950.2771.000
2023-12-12T09:17:20.622582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번제설함크기별 번호제설함규격(FRP)동별 제설함 위치염화칼숨(대) 보유수량모래주머니(대) 보유수량
연번1.0000.3640.7320.3620.8840.851
제설함크기별 번호0.3641.0000.2670.3430.3570.379
제설함규격(FRP)0.7320.2671.0000.1420.9950.991
동별 제설함 위치0.3620.3430.1421.0000.2770.395
염화칼숨(대) 보유수량0.8840.3570.9950.2771.0000.995
모래주머니(대) 보유수량0.8510.3790.9910.3950.9951.000

Missing values

2023-12-12T09:17:17.725305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:17:17.836816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번제설함규격(FRP)제설함크기별 번호동별 제설함 위치제설함위치(주소)염화칼숨(대) 보유수량모래주머니(대) 보유수량
011홍은1동홍은동 440-52010
122홍제동세검정1길 952010
233홍은2동백련사길 1792010
344연희동연희로32길 192010
455연희동연희로11마길 852010
566봉원동봉원동 50-32010
677북아현동북아현로 1402010
788천연동독립문8길 1072010
899북아현동북아현로14길 692010
9101연희동연희로36길 61100
연번제설함규격(FRP)제설함크기별 번호동별 제설함 위치제설함위치(주소)염화칼숨(대) 보유수량모래주머니(대) 보유수량
989927현저동현저동 21050
9910028연희동연희로36길 2750
10010129창천동연희로2길 6050
10110230홍은2동가좌로 90-850
10210331홍은2동명지길 3050
10310432남가좌2동증가로 103-250
10410533홍제3동세검정로4길 9150
10510634북가좌1동북가좌동 477-150
10610735북가좌1동북가좌동 47850
10710836북아현동북아현동1-161250