Overview

Dataset statistics

Number of variables10
Number of observations826
Missing cells10
Missing cells (%)0.1%
Duplicate rows9
Duplicate rows (%)1.1%
Total size in memory67.9 KiB
Average record size in memory84.2 B

Variable types

Text5
Numeric3
Categorical2

Dataset

Description인증번호,인증회차,분야코드,분야명,제품명,모델명,준공일자,설치위치,사업명,설치수량
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-2204/S/1/datasetView.do

Alerts

분야코드 has constant value ""Constant
분야명 has constant value ""Constant
Dataset has 9 (1.1%) duplicate rowsDuplicates
인증회차 is highly overall correlated with 준공일자High correlation
준공일자 is highly overall correlated with 인증회차High correlation
사업명 has 10 (1.2%) missing valuesMissing

Reproduction

Analysis started2024-05-04 00:36:15.277250
Analysis finished2024-05-04 00:36:21.785487
Duration6.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct182
Distinct (%)22.0%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2024-05-04T00:36:22.306439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters8260
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)8.0%

Sample

1st rowSGPD-00971
2nd rowSGPD-01026
3rd rowSGPD-01026
4th rowSGPD-01045
5th rowSGPD-01045
ValueCountFrequency (%)
sgpd-00210 91
 
11.0%
sgpd-00360 44
 
5.3%
sgpd-00319 35
 
4.2%
sgpd-00357 30
 
3.6%
sgpd-00818 26
 
3.1%
sgpd-00136 22
 
2.7%
sgpd-00513 18
 
2.2%
sgpd-00480 16
 
1.9%
sgpd-00034 12
 
1.5%
sgpd-00797 12
 
1.5%
Other values (172) 520
63.0%
2024-05-04T00:36:23.654611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2004
24.3%
S 826
10.0%
P 826
10.0%
D 826
10.0%
- 826
10.0%
G 826
10.0%
1 382
 
4.6%
3 333
 
4.0%
2 249
 
3.0%
8 240
 
2.9%
Other values (7) 922
11.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4121
49.9%
Uppercase Letter 3313
40.1%
Dash Punctuation 826
 
10.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2004
48.6%
1 382
 
9.3%
3 333
 
8.1%
2 249
 
6.0%
8 240
 
5.8%
6 212
 
5.1%
5 195
 
4.7%
4 184
 
4.5%
7 162
 
3.9%
9 160
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
S 826
24.9%
P 826
24.9%
D 826
24.9%
G 826
24.9%
B 5
 
0.2%
A 4
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 826
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4947
59.9%
Latin 3313
40.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2004
40.5%
- 826
16.7%
1 382
 
7.7%
3 333
 
6.7%
2 249
 
5.0%
8 240
 
4.9%
6 212
 
4.3%
5 195
 
3.9%
4 184
 
3.7%
7 162
 
3.3%
Latin
ValueCountFrequency (%)
S 826
24.9%
P 826
24.9%
D 826
24.9%
G 826
24.9%
B 5
 
0.2%
A 4
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8260
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2004
24.3%
S 826
10.0%
P 826
10.0%
D 826
10.0%
- 826
10.0%
G 826
10.0%
1 382
 
4.6%
3 333
 
4.0%
2 249
 
3.0%
8 240
 
2.9%
Other values (7) 922
11.2%

인증회차
Real number (ℝ)

HIGH CORRELATION 

Distinct25
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.430993
Minimum1
Maximum28
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.4 KiB
2024-05-04T00:36:24.061754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q15
median15
Q325
95-th percentile28
Maximum28
Range27
Interquartile range (IQR)20

Descriptive statistics

Standard deviation9.4505133
Coefficient of variation (CV)0.65487617
Kurtosis-1.6558655
Mean14.430993
Median Absolute Deviation (MAD)10
Skewness0.17457074
Sum11920
Variance89.312202
MonotonicityNot monotonic
2024-05-04T00:36:24.817481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
5 116
14.0%
27 87
10.5%
7 84
10.2%
4 72
 
8.7%
26 71
 
8.6%
6 52
 
6.3%
28 47
 
5.7%
17 40
 
4.8%
18 37
 
4.5%
19 35
 
4.2%
Other values (15) 185
22.4%
ValueCountFrequency (%)
1 16
 
1.9%
2 1
 
0.1%
3 27
 
3.3%
4 72
8.7%
5 116
14.0%
6 52
6.3%
7 84
10.2%
8 32
 
3.9%
9 1
 
0.1%
12 6
 
0.7%
ValueCountFrequency (%)
28 47
5.7%
27 87
10.5%
26 71
8.6%
25 16
 
1.9%
24 9
 
1.1%
23 12
 
1.5%
22 10
 
1.2%
21 28
 
3.4%
20 19
 
2.3%
19 35
4.2%

분야코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2
826 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 826
100.0%

Length

2024-05-04T00:36:25.389887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T00:36:25.682013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 826
100.0%

분야명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
공공시설물
826 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공시설물
2nd row공공시설물
3rd row공공시설물
4th row공공시설물
5th row공공시설물

Common Values

ValueCountFrequency (%)
공공시설물 826
100.0%

Length

2024-05-04T00:36:26.423510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T00:36:27.088903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공공시설물 826
100.0%
Distinct123
Distinct (%)14.9%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2024-05-04T00:36:28.022768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length25
Mean length8.2009685
Min length2

Characters and Unicode

Total characters6774
Distinct characters201
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)3.5%

Sample

1st row보행자용휀스
2nd row바론형
3rd row바론형
4th row디자인형울타리
5th row디자인형울타리
ValueCountFrequency (%)
분전함 92
 
6.9%
가로등 92
 
6.9%
기능형 91
 
6.9%
펜스 56
 
4.2%
디자인펜스 51
 
3.8%
보행자용 47
 
3.5%
3s관절형방호책 44
 
3.3%
보행자용휀스 36
 
2.7%
볼라드 35
 
2.6%
차량용 33
 
2.5%
Other values (147) 751
56.6%
2024-05-04T00:36:29.454046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
502
 
7.4%
211
 
3.1%
208
 
3.1%
200
 
3.0%
- 181
 
2.7%
0 181
 
2.7%
174
 
2.6%
147
 
2.2%
144
 
2.1%
135
 
2.0%
Other values (191) 4691
69.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3967
58.6%
Uppercase Letter 741
 
10.9%
Decimal Number 621
 
9.2%
Space Separator 502
 
7.4%
Lowercase Letter 464
 
6.8%
Dash Punctuation 181
 
2.7%
Open Punctuation 126
 
1.9%
Close Punctuation 126
 
1.9%
Math Symbol 18
 
0.3%
Control 8
 
0.1%
Other values (3) 20
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
211
 
5.3%
208
 
5.2%
200
 
5.0%
174
 
4.4%
147
 
3.7%
144
 
3.6%
135
 
3.4%
133
 
3.4%
132
 
3.3%
111
 
2.8%
Other values (129) 2372
59.8%
Uppercase Letter
ValueCountFrequency (%)
S 117
15.8%
B 87
11.7%
D 70
 
9.4%
A 50
 
6.7%
C 47
 
6.3%
K 46
 
6.2%
P 40
 
5.4%
M 38
 
5.1%
G 35
 
4.7%
H 30
 
4.0%
Other values (13) 181
24.4%
Lowercase Letter
ValueCountFrequency (%)
n 101
21.8%
e 81
17.5%
o 50
10.8%
i 47
10.1%
c 47
10.1%
r 29
 
6.2%
t 18
 
3.9%
a 18
 
3.9%
g 15
 
3.2%
y 13
 
2.8%
Other values (8) 45
9.7%
Decimal Number
ValueCountFrequency (%)
0 181
29.1%
3 109
17.6%
1 91
14.7%
2 64
 
10.3%
4 47
 
7.6%
5 46
 
7.4%
6 32
 
5.2%
8 27
 
4.3%
9 15
 
2.4%
7 9
 
1.4%
Other Punctuation
ValueCountFrequency (%)
, 3
42.9%
/ 2
28.6%
: 2
28.6%
Space Separator
ValueCountFrequency (%)
502
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 181
100.0%
Open Punctuation
ValueCountFrequency (%)
( 126
100.0%
Close Punctuation
ValueCountFrequency (%)
) 126
100.0%
Math Symbol
ValueCountFrequency (%)
+ 18
100.0%
Control
ValueCountFrequency (%)
8
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 8
100.0%
Letter Number
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3967
58.6%
Common 1597
23.6%
Latin 1210
 
17.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
211
 
5.3%
208
 
5.2%
200
 
5.0%
174
 
4.4%
147
 
3.7%
144
 
3.6%
135
 
3.4%
133
 
3.4%
132
 
3.3%
111
 
2.8%
Other values (129) 2372
59.8%
Latin
ValueCountFrequency (%)
S 117
 
9.7%
n 101
 
8.3%
B 87
 
7.2%
e 81
 
6.7%
D 70
 
5.8%
o 50
 
4.1%
A 50
 
4.1%
i 47
 
3.9%
c 47
 
3.9%
C 47
 
3.9%
Other values (32) 513
42.4%
Common
ValueCountFrequency (%)
502
31.4%
- 181
 
11.3%
0 181
 
11.3%
( 126
 
7.9%
) 126
 
7.9%
3 109
 
6.8%
1 91
 
5.7%
2 64
 
4.0%
4 47
 
2.9%
5 46
 
2.9%
Other values (10) 124
 
7.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3967
58.6%
ASCII 2802
41.4%
Number Forms 5
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
502
17.9%
- 181
 
6.5%
0 181
 
6.5%
( 126
 
4.5%
) 126
 
4.5%
S 117
 
4.2%
3 109
 
3.9%
n 101
 
3.6%
1 91
 
3.2%
B 87
 
3.1%
Other values (51) 1181
42.1%
Hangul
ValueCountFrequency (%)
211
 
5.3%
208
 
5.2%
200
 
5.0%
174
 
4.4%
147
 
3.7%
144
 
3.6%
135
 
3.4%
133
 
3.4%
132
 
3.3%
111
 
2.8%
Other values (129) 2372
59.8%
Number Forms
ValueCountFrequency (%)
5
100.0%
Distinct182
Distinct (%)22.0%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2024-05-04T00:36:30.110125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length21
Mean length10.112591
Min length2

Characters and Unicode

Total characters8353
Distinct characters100
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)7.9%

Sample

1st rowKgrim-39
2nd rowACP-2004, ACPL-SC050N
3rd rowACP-2004, ACPL-SC050N
4th rowSN-103
5th rowSN-103
ValueCountFrequency (%)
acero-a(ace-02 91
 
10.0%
3s-ag 44
 
4.8%
jsf-157 35
 
3.8%
kw-e710sb4(p 30
 
3.3%
mdsp39-05a16(등주 26
 
2.8%
mdlm-50-35(등기구 26
 
2.8%
dkm-00301 22
 
2.4%
gr-208 18
 
2.0%
bp-036 16
 
1.8%
디자인아이엠유맨홀 12
 
1.3%
Other values (189) 593
65.0%
2024-05-04T00:36:31.377137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1089
 
13.0%
0 732
 
8.8%
1 535
 
6.4%
A 490
 
5.9%
S 432
 
5.2%
2 327
 
3.9%
C 276
 
3.3%
E 267
 
3.2%
3 255
 
3.1%
5 250
 
3.0%
Other values (90) 3700
44.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 3544
42.4%
Decimal Number 2641
31.6%
Dash Punctuation 1089
 
13.0%
Other Letter 414
 
5.0%
Open Punctuation 215
 
2.6%
Close Punctuation 215
 
2.6%
Space Separator 87
 
1.0%
Lowercase Letter 76
 
0.9%
Other Punctuation 64
 
0.8%
Control 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
62
 
15.0%
31
 
7.5%
31
 
7.5%
31
 
7.5%
22
 
5.3%
14
 
3.4%
14
 
3.4%
14
 
3.4%
13
 
3.1%
13
 
3.1%
Other values (37) 169
40.8%
Uppercase Letter
ValueCountFrequency (%)
A 490
13.8%
S 432
12.2%
C 276
 
7.8%
E 267
 
7.5%
D 212
 
6.0%
B 210
 
5.9%
R 202
 
5.7%
P 184
 
5.2%
M 158
 
4.5%
F 134
 
3.8%
Other values (14) 979
27.6%
Lowercase Letter
ValueCountFrequency (%)
g 13
17.1%
a 10
13.2%
i 10
13.2%
r 10
13.2%
s 8
10.5%
e 5
 
6.6%
l 4
 
5.3%
t 4
 
5.3%
m 3
 
3.9%
c 3
 
3.9%
Other values (3) 6
7.9%
Decimal Number
ValueCountFrequency (%)
0 732
27.7%
1 535
20.3%
2 327
12.4%
3 255
 
9.7%
5 250
 
9.5%
6 166
 
6.3%
4 109
 
4.1%
9 99
 
3.7%
7 93
 
3.5%
8 75
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 1089
100.0%
Open Punctuation
ValueCountFrequency (%)
( 215
100.0%
Close Punctuation
ValueCountFrequency (%)
) 215
100.0%
Space Separator
ValueCountFrequency (%)
87
100.0%
Other Punctuation
ValueCountFrequency (%)
, 64
100.0%
Control
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4319
51.7%
Latin 3620
43.3%
Hangul 414
 
5.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
62
 
15.0%
31
 
7.5%
31
 
7.5%
31
 
7.5%
22
 
5.3%
14
 
3.4%
14
 
3.4%
14
 
3.4%
13
 
3.1%
13
 
3.1%
Other values (37) 169
40.8%
Latin
ValueCountFrequency (%)
A 490
13.5%
S 432
11.9%
C 276
 
7.6%
E 267
 
7.4%
D 212
 
5.9%
B 210
 
5.8%
R 202
 
5.6%
P 184
 
5.1%
M 158
 
4.4%
F 134
 
3.7%
Other values (27) 1055
29.1%
Common
ValueCountFrequency (%)
- 1089
25.2%
0 732
16.9%
1 535
12.4%
2 327
 
7.6%
3 255
 
5.9%
5 250
 
5.8%
( 215
 
5.0%
) 215
 
5.0%
6 166
 
3.8%
4 109
 
2.5%
Other values (6) 426
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7939
95.0%
Hangul 414
 
5.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1089
 
13.7%
0 732
 
9.2%
1 535
 
6.7%
A 490
 
6.2%
S 432
 
5.4%
2 327
 
4.1%
C 276
 
3.5%
E 267
 
3.4%
3 255
 
3.2%
5 250
 
3.1%
Other values (43) 3286
41.4%
Hangul
ValueCountFrequency (%)
62
 
15.0%
31
 
7.5%
31
 
7.5%
31
 
7.5%
22
 
5.3%
14
 
3.4%
14
 
3.4%
14
 
3.4%
13
 
3.1%
13
 
3.1%
Other values (37) 169
40.8%

준공일자
Real number (ℝ)

HIGH CORRELATION 

Distinct622
Distinct (%)75.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20074942
Minimum2013
Maximum20220320
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.4 KiB
2024-05-04T00:36:31.812379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2013
5-th percentile20101126
Q120111201
median20130630
Q320180960
95-th percentile20201217
Maximum20220320
Range20218307
Interquartile range (IQR)69759

Descriptive statistics

Standard deviation1169800.7
Coefficient of variation (CV)0.058271688
Kurtosis273.79572
Mean20074942
Median Absolute Deviation (MAD)20119
Skewness-16.561831
Sum1.6581902 × 1010
Variance1.3684338 × 1012
MonotonicityNot monotonic
2024-05-04T00:36:32.312811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20121031 6
 
0.7%
20111215 5
 
0.6%
20120630 5
 
0.6%
20201221 5
 
0.6%
20200427 5
 
0.6%
20120731 4
 
0.5%
20120507 4
 
0.5%
20200630 4
 
0.5%
20181114 4
 
0.5%
20200609 4
 
0.5%
Other values (612) 780
94.4%
ValueCountFrequency (%)
2013 1
0.1%
201214 1
0.1%
2022831 1
0.1%
20090930 2
0.2%
20091025 2
0.2%
20091030 1
0.1%
20091230 1
0.1%
20091231 1
0.1%
20100331 1
0.1%
20100413 1
0.1%
ValueCountFrequency (%)
20220320 2
0.2%
20220317 1
0.1%
20220216 1
0.1%
20220113 1
0.1%
20211231 1
0.1%
20211229 1
0.1%
20211207 1
0.1%
20211206 1
0.1%
20211203 1
0.1%
20211029 1
0.1%
Distinct678
Distinct (%)82.1%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2024-05-04T00:36:33.161981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length34
Mean length14.404358
Min length1

Characters and Unicode

Total characters11898
Distinct characters392
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique606 ?
Unique (%)73.4%

Sample

1st row서울특별시 구로구 구로동 69-7 일원
2nd row서울 동작구 동작동 102-18 이수스위첸포레힐즈아파트
3rd row서울 동작구 신대방동 686-48 협성휴포레시그니처 입구
4th row서울 중랑구 용마산로 715
5th row서울특별시 중랑구 면목천교
ValueCountFrequency (%)
서울 490
 
16.6%
서울특별시 50
 
1.7%
성북구 43
 
1.5%
영등포구 36
 
1.2%
서초구 36
 
1.2%
강남구 34
 
1.2%
노원구 34
 
1.2%
33
 
1.1%
양천구 33
 
1.1%
경기 32
 
1.1%
Other values (1135) 2126
72.1%
2024-05-04T00:36:34.727762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2150
 
18.1%
730
 
6.1%
695
 
5.8%
668
 
5.6%
588
 
4.9%
1 235
 
2.0%
200
 
1.7%
- 181
 
1.5%
2 169
 
1.4%
167
 
1.4%
Other values (382) 6115
51.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8309
69.8%
Space Separator 2150
 
18.1%
Decimal Number 1116
 
9.4%
Dash Punctuation 181
 
1.5%
Open Punctuation 47
 
0.4%
Close Punctuation 46
 
0.4%
Math Symbol 15
 
0.1%
Uppercase Letter 15
 
0.1%
Other Punctuation 11
 
0.1%
Lowercase Letter 5
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
730
 
8.8%
695
 
8.4%
668
 
8.0%
588
 
7.1%
200
 
2.4%
167
 
2.0%
142
 
1.7%
129
 
1.6%
124
 
1.5%
109
 
1.3%
Other values (347) 4757
57.3%
Uppercase Letter
ValueCountFrequency (%)
H 2
13.3%
T 2
13.3%
S 2
13.3%
B 2
13.3%
R 1
6.7%
G 1
6.7%
C 1
6.7%
M 1
6.7%
D 1
6.7%
U 1
6.7%
Decimal Number
ValueCountFrequency (%)
1 235
21.1%
2 169
15.1%
3 133
11.9%
4 110
9.9%
5 91
 
8.2%
8 90
 
8.1%
6 81
 
7.3%
0 81
 
7.3%
7 64
 
5.7%
9 62
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 6
54.5%
' 2
 
18.2%
/ 2
 
18.2%
. 1
 
9.1%
Lowercase Letter
ValueCountFrequency (%)
e 3
60.0%
p 1
 
20.0%
y 1
 
20.0%
Space Separator
ValueCountFrequency (%)
2150
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 181
100.0%
Open Punctuation
ValueCountFrequency (%)
( 47
100.0%
Close Punctuation
ValueCountFrequency (%)
) 46
100.0%
Math Symbol
ValueCountFrequency (%)
~ 15
100.0%
Control
ValueCountFrequency (%)
2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8308
69.8%
Common 3569
30.0%
Latin 20
 
0.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
730
 
8.8%
695
 
8.4%
668
 
8.0%
588
 
7.1%
200
 
2.4%
167
 
2.0%
142
 
1.7%
129
 
1.6%
124
 
1.5%
109
 
1.3%
Other values (346) 4756
57.2%
Common
ValueCountFrequency (%)
2150
60.2%
1 235
 
6.6%
- 181
 
5.1%
2 169
 
4.7%
3 133
 
3.7%
4 110
 
3.1%
5 91
 
2.5%
8 90
 
2.5%
6 81
 
2.3%
0 81
 
2.3%
Other values (11) 248
 
6.9%
Latin
ValueCountFrequency (%)
e 3
15.0%
H 2
10.0%
T 2
10.0%
S 2
10.0%
B 2
10.0%
R 1
 
5.0%
G 1
 
5.0%
C 1
 
5.0%
M 1
 
5.0%
D 1
 
5.0%
Other values (4) 4
20.0%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8308
69.8%
ASCII 3589
30.2%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2150
59.9%
1 235
 
6.5%
- 181
 
5.0%
2 169
 
4.7%
3 133
 
3.7%
4 110
 
3.1%
5 91
 
2.5%
8 90
 
2.5%
6 81
 
2.3%
0 81
 
2.3%
Other values (25) 268
 
7.5%
Hangul
ValueCountFrequency (%)
730
 
8.8%
695
 
8.4%
668
 
8.0%
588
 
7.1%
200
 
2.4%
167
 
2.0%
142
 
1.7%
129
 
1.6%
124
 
1.5%
109
 
1.3%
Other values (346) 4756
57.2%
CJK
ValueCountFrequency (%)
1
100.0%

사업명
Text

MISSING 

Distinct756
Distinct (%)92.6%
Missing10
Missing (%)1.2%
Memory size6.6 KiB
2024-05-04T00:36:35.581592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length33
Mean length19.707108
Min length3

Characters and Unicode

Total characters16081
Distinct characters473
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique714 ?
Unique (%)87.5%

Sample

1st row관급자재구매(교통사고 잦은 곳 개선사업(거리공원오거리)-보행자안전휀스
2nd row동작1 주택재건축정비사업 중 가로등 설비 전기공사
3rd row신대방 협성휴포레 아파트 신축공사 중 가로등 설비 전기공사
4th row디자인형울타리 구매(노후 보행자방호울타리 교체사업)
5th row관급자재구매(디자인형울타리-2022년하천시설물 유지보수공사)
ValueCountFrequency (%)
구매 103
 
3.5%
관급자재 86
 
3.0%
설치공사 61
 
2.1%
설치 50
 
1.7%
45
 
1.5%
가로등 29
 
1.0%
디자인형울타리 29
 
1.0%
정비공사 28
 
1.0%
조성공사 27
 
0.9%
25
 
0.9%
Other values (1435) 2426
83.4%
2024-05-04T00:36:37.327858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2120
 
13.2%
581
 
3.6%
577
 
3.6%
444
 
2.8%
362
 
2.3%
354
 
2.2%
333
 
2.1%
273
 
1.7%
254
 
1.6%
237
 
1.5%
Other values (463) 10546
65.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12547
78.0%
Space Separator 2120
 
13.2%
Decimal Number 691
 
4.3%
Dash Punctuation 213
 
1.3%
Open Punctuation 174
 
1.1%
Close Punctuation 168
 
1.0%
Other Punctuation 77
 
0.5%
Uppercase Letter 71
 
0.4%
Math Symbol 12
 
0.1%
Connector Punctuation 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
581
 
4.6%
577
 
4.6%
444
 
3.5%
362
 
2.9%
354
 
2.8%
333
 
2.7%
273
 
2.2%
254
 
2.0%
237
 
1.9%
231
 
1.8%
Other values (425) 8901
70.9%
Uppercase Letter
ValueCountFrequency (%)
D 14
19.7%
T 10
14.1%
L 9
12.7%
C 8
11.3%
S 8
11.3%
E 6
8.5%
H 5
 
7.0%
W 3
 
4.2%
V 3
 
4.2%
M 2
 
2.8%
Other values (3) 3
 
4.2%
Decimal Number
ValueCountFrequency (%)
2 185
26.8%
1 166
24.0%
0 107
15.5%
3 56
 
8.1%
7 43
 
6.2%
5 32
 
4.6%
6 31
 
4.5%
4 25
 
3.6%
9 23
 
3.3%
8 23
 
3.3%
Other Punctuation
ValueCountFrequency (%)
, 41
53.2%
/ 14
 
18.2%
' 10
 
13.0%
. 9
 
11.7%
? 2
 
2.6%
: 1
 
1.3%
Open Punctuation
ValueCountFrequency (%)
( 163
93.7%
[ 11
 
6.3%
Close Punctuation
ValueCountFrequency (%)
) 158
94.0%
] 10
 
6.0%
Space Separator
ValueCountFrequency (%)
2120
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 213
100.0%
Math Symbol
ValueCountFrequency (%)
~ 12
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 7
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12547
78.0%
Common 3463
 
21.5%
Latin 71
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
581
 
4.6%
577
 
4.6%
444
 
3.5%
362
 
2.9%
354
 
2.8%
333
 
2.7%
273
 
2.2%
254
 
2.0%
237
 
1.9%
231
 
1.8%
Other values (425) 8901
70.9%
Common
ValueCountFrequency (%)
2120
61.2%
- 213
 
6.2%
2 185
 
5.3%
1 166
 
4.8%
( 163
 
4.7%
) 158
 
4.6%
0 107
 
3.1%
3 56
 
1.6%
7 43
 
1.2%
, 41
 
1.2%
Other values (15) 211
 
6.1%
Latin
ValueCountFrequency (%)
D 14
19.7%
T 10
14.1%
L 9
12.7%
C 8
11.3%
S 8
11.3%
E 6
8.5%
H 5
 
7.0%
W 3
 
4.2%
V 3
 
4.2%
M 2
 
2.8%
Other values (3) 3
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12547
78.0%
ASCII 3534
 
22.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2120
60.0%
- 213
 
6.0%
2 185
 
5.2%
1 166
 
4.7%
( 163
 
4.6%
) 158
 
4.5%
0 107
 
3.0%
3 56
 
1.6%
7 43
 
1.2%
, 41
 
1.2%
Other values (28) 282
 
8.0%
Hangul
ValueCountFrequency (%)
581
 
4.6%
577
 
4.6%
444
 
3.5%
362
 
2.9%
354
 
2.8%
333
 
2.7%
273
 
2.2%
254
 
2.0%
237
 
1.9%
231
 
1.8%
Other values (425) 8901
70.9%

설치수량
Real number (ℝ)

Distinct238
Distinct (%)28.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean89.74092
Minimum1
Maximum2240
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.4 KiB
2024-05-04T00:36:37.826856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q17
median35
Q3106
95-th percentile319.5
Maximum2240
Range2239
Interquartile range (IQR)99

Descriptive statistics

Standard deviation171.1583
Coefficient of variation (CV)1.9072492
Kurtosis52.324343
Mean89.74092
Median Absolute Deviation (MAD)33
Skewness5.9444407
Sum74126
Variance29295.164
MonotonicityNot monotonic
2024-05-04T00:36:38.265228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 74
 
9.0%
2 41
 
5.0%
4 28
 
3.4%
3 28
 
3.4%
6 21
 
2.5%
10 19
 
2.3%
8 14
 
1.7%
80 13
 
1.6%
14 13
 
1.6%
12 13
 
1.6%
Other values (228) 562
68.0%
ValueCountFrequency (%)
1 74
9.0%
2 41
5.0%
3 28
 
3.4%
4 28
 
3.4%
5 11
 
1.3%
6 21
 
2.5%
7 10
 
1.2%
8 14
 
1.7%
9 11
 
1.3%
10 19
 
2.3%
ValueCountFrequency (%)
2240 1
0.1%
1800 1
0.1%
1300 1
0.1%
1211 1
0.1%
1181 1
0.1%
1068 1
0.1%
1012 1
0.1%
1002 1
0.1%
905 1
0.1%
812 1
0.1%

Interactions

2024-05-04T00:36:19.426356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:36:17.250610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:36:18.186228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:36:19.831628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:36:17.605690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:36:18.528906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:36:20.249563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:36:17.992999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:36:19.069522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-04T00:36:38.523192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증회차준공일자설치수량
인증회차1.0000.0000.000
준공일자0.0001.0000.000
설치수량0.0000.0001.000
2024-05-04T00:36:38.874432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증회차준공일자설치수량
인증회차1.0000.6670.281
준공일자0.6671.0000.127
설치수량0.2810.1271.000

Missing values

2024-05-04T00:36:20.802763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-04T00:36:21.508815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인증번호인증회차분야코드분야명제품명모델명준공일자설치위치사업명설치수량
0SGPD-00971272공공시설물보행자용휀스Kgrim-392022831서울특별시 구로구 구로동 69-7 일원관급자재구매(교통사고 잦은 곳 개선사업(거리공원오거리)-보행자안전휀스74
1SGPD-01026282공공시설물바론형ACP-2004, ACPL-SC050N20220320서울 동작구 동작동 102-18 이수스위첸포레힐즈아파트동작1 주택재건축정비사업 중 가로등 설비 전기공사10
2SGPD-01026282공공시설물바론형ACP-2004, ACPL-SC050N20220320서울 동작구 신대방동 686-48 협성휴포레시그니처 입구신대방 협성휴포레 아파트 신축공사 중 가로등 설비 전기공사16
3SGPD-01045282공공시설물디자인형울타리SN-10320220317서울 중랑구 용마산로 715디자인형울타리 구매(노후 보행자방호울타리 교체사업)215
4SGPD-01045282공공시설물디자인형울타리SN-10320220216서울특별시 중랑구 면목천교관급자재구매(디자인형울타리-2022년하천시설물 유지보수공사)95
5SGPD-01041282공공시설물보행자용 펜스Kgrim-4420220113서울 구로구 구로동 701-2 일원관급자재구매(롯데광명물류센터 구간 간선도로 안전휀스 정비103
6SGPD-01018282공공시설물남대문로형ACP-261, ACPL-STL100B20211231서울 종로구 와룡동 2-2울곡로 도로구조 개선공사3
7SGPD-01045282공공시설물디자인형울타리SN-10320211229서울특별시 중랑구 이화교디자인형울타리(에스앤)-묵동천 생태하천 조성공사 3차234
8SGPD-01045282공공시설물디자인형울타리SN-10320211207중랑구 상봉역 2번출구디자인형울타리 구매 -2021년 도로굴착복구공사144
9SGPD-01045282공공시설물디자인형울타리SN-10320211206서울 양천구 목동중앙로 70021년 보도포장 유지보수공사(연간단가) 관급자재 구매(디자인형울타리)114
인증번호인증회차분야코드분야명제품명모델명준공일자설치위치사업명설치수량
816SGPD-0000812공공시설물Frigg(SP-2)Frigg(SP-2)20100430서울 마포구 성산동가로등 유지보수공사51
817SGPD-0004832공공시설물CIP-370CIP-37020100413경기 수원시 영통구 망포동 66-1번지퍼걸러 관급자재 구입1
818SGPD-0007042공공시설물HBE-1002HBE-100220100331서울 송파구 방이동올림픽공원 휴게편익시설 제조 설치60
819SGPD-0000812공공시설물Frigg(SP-2)Frigg(SP-2)20091231서울 노원구 공릉동보안등 및 분전반 구매34
820SGPD-0000912공공시설물Strass e(SP-1)Strass e(SP-1)20091230서울 강동구 천호동한강공원 자전거도로 산책로 분리조성 전기공사1181
821SGPD-00034272공공시설물디자인아이엠유맨홀디자인아이엠유맨홀20091030광진구 군자역~아차산역삼거리 일대 보도설치건강테마 보행벨트 조성공사117
822SGPD-0000812공공시설물Frigg(SP-2)Frigg(SP-2)20091025서울 송파구 잠실3동삼학사길 도로조명 개선공사20
823SGPD-0000812공공시설물Frigg(SP-2)Frigg(SP-2)20091025서울 송파구 잠실3동석촌호수길 도로조명 개선공사43
824SGPD-00034272공공시설물디자인아이엠유맨홀디자인아이엠유맨홀20090930종로구 광화문역 광화문광장 일대 보도설치광화문광장 조성공사171
825SGPD-00034272공공시설물디자인아이엠유맨홀디자인아이엠유맨홀20090930종로구 안국역사거리 일대 보도설치고궁로 보도정비공사135

Duplicate rows

Most frequently occurring

인증번호인증회차분야코드분야명제품명모델명준공일자설치위치사업명설치수량# duplicates
1SGPD-00355262공공시설물WT0120200427경기도 광명시 영당안로 49개운어린이공원 지하주차장 설치공사 중 음수대 제작 납품34
3SGPD-00808172공공시설물미래도시SH1212-Q1120181228서울 영등포구 문래동3가가로수 생육환경 개선사업2004
6SGPD-00884202공공시설물조화SH1212-Q1820181114서울 동작구 흑석동143-80흑석 7구역 재개발사업804
5SGPD-00867192공공시설물Design FenceBRF-12-0620190601서울 동작구 동작동 현충원안전휀스 구매1643
8SGPD-00962222공공시설물RiverAGT-1212-A120191007서울 노원구 월계동인덕아파트재건축1203
0SGPD-00143182공공시설물ACP-261(남대문로형)ACP-261(남대문로형)20120621서울 마포구 동교동가로등유지보수공사12
2SGPD-00530282공공시설물횡단보도 편의시설횡단보도 쉘타20170410성동구 고산자로 202횡단보도 안전쉘타 설치12
4SGPD-00863192공공시설물보행자용펜스KL-WP01B20181210서울 강서구 화곡동 18-5 우장초교 앞우장초교 통학로 조성공사802
7SGPD-00902242공공시설물공공시설물 펜스(자전거도로용)SSBF-0120190829서울 양천구 목동동로 323관내방호울타리 설치공사 관급자재구매2512