Overview

Dataset statistics

Number of variables11
Number of observations10000
Missing cells1570
Missing cells (%)1.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory966.8 KiB
Average record size in memory99.0 B

Variable types

Numeric3
Text4
Categorical3
DateTime1

Dataset

Description연면적 500㎡ 이상의 공공건축물의 대한 석면건축물 현황입니다.조사기관, 주소, 기관명, 석면건축물여부(석면건축물 50㎡ 이상 건축물), 석면자재면적 등의 정보를 제공합니다.
Author한국환경공단
URLhttps://www.data.go.kr/data/15092323/fileData.do

Alerts

건축물구분-대분류 has constant value ""Constant
건축물_동명 has 1564 (15.6%) missing valuesMissing
석면자재 면적 is highly skewed (γ1 = 42.34282813)Skewed
석면건축물_일련번호 has unique valuesUnique
석면자재 면적 has 6282 (62.8%) zerosZeros

Reproduction

Analysis started2024-04-21 02:02:56.104460
Analysis finished2024-04-21 02:03:00.579104
Duration4.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

석면건축물_일련번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean71419.774
Minimum16
Maximum205676
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T11:03:00.667546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16
5-th percentile2794.9
Q115913.25
median34208.5
Q3128072.5
95-th percentile190829.25
Maximum205676
Range205660
Interquartile range (IQR)112159.25

Descriptive statistics

Standard deviation66323.678
Coefficient of variation (CV)0.92864586
Kurtosis-1.2465667
Mean71419.774
Median Absolute Deviation (MAD)26613
Skewness0.59781972
Sum7.1419774 × 108
Variance4.3988302 × 109
MonotonicityNot monotonic
2024-04-21T11:03:00.802978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
33602 1
 
< 0.1%
49121 1
 
< 0.1%
1507 1
 
< 0.1%
161812 1
 
< 0.1%
4007 1
 
< 0.1%
122323 1
 
< 0.1%
4396 1
 
< 0.1%
200676 1
 
< 0.1%
13982 1
 
< 0.1%
19476 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
16 1
< 0.1%
19 1
< 0.1%
21 1
< 0.1%
26 1
< 0.1%
29 1
< 0.1%
32 1
< 0.1%
35 1
< 0.1%
38 1
< 0.1%
43 1
< 0.1%
46 1
< 0.1%
ValueCountFrequency (%)
205676 1
< 0.1%
205674 1
< 0.1%
205671 1
< 0.1%
205663 1
< 0.1%
205649 1
< 0.1%
205634 1
< 0.1%
205633 1
< 0.1%
205632 1
< 0.1%
205629 1
< 0.1%
205627 1
< 0.1%
Distinct8024
Distinct (%)80.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T11:03:01.106345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length49
Mean length23.4058
Min length8

Characters and Unicode

Total characters234058
Distinct characters548
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7081 ?
Unique (%)70.8%

Sample

1st row충청북도 진천군 진천읍 문화로 79
2nd row대전광역시 유성구 대덕대로989번길 242 (덕진동)
3rd row전라남도 장성군 삼계면 영장로 1628
4th row경상북도 청도군 화양읍 청려로 1844
5th row서울특별시 송파구 올림픽로43길 88 (풍납동)
ValueCountFrequency (%)
경기도 1572
 
3.0%
서울특별시 1189
 
2.2%
경상남도 936
 
1.8%
경상북도 908
 
1.7%
강원도 795
 
1.5%
전라남도 656
 
1.2%
충청남도 630
 
1.2%
전라북도 545
 
1.0%
부산광역시 545
 
1.0%
0 528
 
1.0%
Other values (9912) 44698
84.3%
2024-04-21T11:03:01.751328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
43179
 
18.4%
8428
 
3.6%
8050
 
3.4%
7378
 
3.2%
7248
 
3.1%
1 6828
 
2.9%
( 5874
 
2.5%
) 5873
 
2.5%
5200
 
2.2%
2 4666
 
2.0%
Other values (538) 131334
56.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 145360
62.1%
Space Separator 43179
 
18.4%
Decimal Number 33582
 
14.3%
Open Punctuation 5874
 
2.5%
Close Punctuation 5873
 
2.5%
Other Punctuation 169
 
0.1%
Uppercase Letter 20
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8428
 
5.8%
8050
 
5.5%
7378
 
5.1%
7248
 
5.0%
5200
 
3.6%
3958
 
2.7%
3606
 
2.5%
3508
 
2.4%
3216
 
2.2%
2777
 
1.9%
Other values (510) 91991
63.3%
Decimal Number
ValueCountFrequency (%)
1 6828
20.3%
2 4666
13.9%
3 3730
11.1%
4 3063
9.1%
5 2931
8.7%
0 2898
8.6%
6 2567
 
7.6%
7 2449
 
7.3%
9 2243
 
6.7%
8 2207
 
6.6%
Uppercase Letter
ValueCountFrequency (%)
C 5
25.0%
A 3
15.0%
P 3
15.0%
K 2
 
10.0%
E 2
 
10.0%
B 1
 
5.0%
S 1
 
5.0%
F 1
 
5.0%
O 1
 
5.0%
I 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 141
83.4%
. 18
 
10.7%
· 9
 
5.3%
/ 1
 
0.6%
Space Separator
ValueCountFrequency (%)
43179
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5874
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5873
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 145360
62.1%
Common 88678
37.9%
Latin 20
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8428
 
5.8%
8050
 
5.5%
7378
 
5.1%
7248
 
5.0%
5200
 
3.6%
3958
 
2.7%
3606
 
2.5%
3508
 
2.4%
3216
 
2.2%
2777
 
1.9%
Other values (510) 91991
63.3%
Common
ValueCountFrequency (%)
43179
48.7%
1 6828
 
7.7%
( 5874
 
6.6%
) 5873
 
6.6%
2 4666
 
5.3%
3 3730
 
4.2%
4 3063
 
3.5%
5 2931
 
3.3%
0 2898
 
3.3%
6 2567
 
2.9%
Other values (8) 7069
 
8.0%
Latin
ValueCountFrequency (%)
C 5
25.0%
A 3
15.0%
P 3
15.0%
K 2
 
10.0%
E 2
 
10.0%
B 1
 
5.0%
S 1
 
5.0%
F 1
 
5.0%
O 1
 
5.0%
I 1
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 145360
62.1%
ASCII 88689
37.9%
None 9
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
43179
48.7%
1 6828
 
7.7%
( 5874
 
6.6%
) 5873
 
6.6%
2 4666
 
5.3%
3 3730
 
4.2%
4 3063
 
3.5%
5 2931
 
3.3%
0 2898
 
3.3%
6 2567
 
2.9%
Other values (17) 7080
 
8.0%
Hangul
ValueCountFrequency (%)
8428
 
5.8%
8050
 
5.5%
7378
 
5.1%
7248
 
5.0%
5200
 
3.6%
3958
 
2.7%
3606
 
2.5%
3508
 
2.4%
3216
 
2.2%
2777
 
1.9%
Other values (510) 91991
63.3%
None
ValueCountFrequency (%)
· 9
100.0%
Distinct8175
Distinct (%)81.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T11:03:01.989484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length33
Mean length9.1449
Min length2

Characters and Unicode

Total characters91449
Distinct characters647
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7381 ?
Unique (%)73.8%

Sample

1st row진천문화원
2nd row한전원자력연료
3rd row삼계하수종말처리장
4th row청도군 선거관리위원회
5th row한국전력공사 풍납변전소
ValueCountFrequency (%)
한국전력공사 175
 
1.3%
한국도로공사 101
 
0.7%
주민센터 63
 
0.5%
본점 54
 
0.4%
한울원자력본부 44
 
0.3%
ibk기업은행 40
 
0.3%
한국마사회 38
 
0.3%
한국농어촌공사 37
 
0.3%
35
 
0.3%
농업기술센터 34
 
0.3%
Other values (8986) 12846
95.4%
2024-04-21T11:03:02.348905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3493
 
3.8%
2562
 
2.8%
2275
 
2.5%
2107
 
2.3%
2030
 
2.2%
1875
 
2.1%
1688
 
1.8%
1586
 
1.7%
1509
 
1.7%
1462
 
1.6%
Other values (637) 70862
77.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 83270
91.1%
Space Separator 3493
 
3.8%
Decimal Number 1564
 
1.7%
Close Punctuation 1129
 
1.2%
Open Punctuation 1105
 
1.2%
Uppercase Letter 586
 
0.6%
Other Punctuation 214
 
0.2%
Lowercase Letter 58
 
0.1%
Other Symbol 15
 
< 0.1%
Math Symbol 8
 
< 0.1%
Other values (2) 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2562
 
3.1%
2275
 
2.7%
2107
 
2.5%
2030
 
2.4%
1875
 
2.3%
1688
 
2.0%
1586
 
1.9%
1509
 
1.8%
1462
 
1.8%
1426
 
1.7%
Other values (571) 64750
77.8%
Uppercase Letter
ValueCountFrequency (%)
B 99
16.9%
K 79
13.5%
A 65
11.1%
I 58
9.9%
S 54
9.2%
C 36
 
6.1%
O 25
 
4.3%
T 24
 
4.1%
N 21
 
3.6%
P 20
 
3.4%
Other values (14) 105
17.9%
Lowercase Letter
ValueCountFrequency (%)
k 9
15.5%
a 7
12.1%
c 6
10.3%
d 5
8.6%
e 5
8.6%
m 5
8.6%
b 4
6.9%
y 4
6.9%
o 3
 
5.2%
v 3
 
5.2%
Other values (6) 7
12.1%
Decimal Number
ValueCountFrequency (%)
1 571
36.5%
2 316
20.2%
3 138
 
8.8%
9 127
 
8.1%
4 109
 
7.0%
5 82
 
5.2%
7 66
 
4.2%
0 58
 
3.7%
6 54
 
3.5%
8 43
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 125
58.4%
. 54
25.2%
/ 28
 
13.1%
: 3
 
1.4%
· 3
 
1.4%
1
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 1125
99.6%
] 4
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 1101
99.6%
[ 4
 
0.4%
Math Symbol
ValueCountFrequency (%)
~ 7
87.5%
> 1
 
12.5%
Space Separator
ValueCountFrequency (%)
3493
100.0%
Other Symbol
ValueCountFrequency (%)
15
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%
Control
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 83283
91.1%
Common 7520
 
8.2%
Latin 644
 
0.7%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2562
 
3.1%
2275
 
2.7%
2107
 
2.5%
2030
 
2.4%
1875
 
2.3%
1688
 
2.0%
1586
 
1.9%
1509
 
1.8%
1462
 
1.8%
1426
 
1.7%
Other values (571) 64763
77.8%
Latin
ValueCountFrequency (%)
B 99
15.4%
K 79
12.3%
A 65
 
10.1%
I 58
 
9.0%
S 54
 
8.4%
C 36
 
5.6%
O 25
 
3.9%
T 24
 
3.7%
N 21
 
3.3%
P 20
 
3.1%
Other values (30) 163
25.3%
Common
ValueCountFrequency (%)
3493
46.4%
) 1125
 
15.0%
( 1101
 
14.6%
1 571
 
7.6%
2 316
 
4.2%
3 138
 
1.8%
9 127
 
1.7%
, 125
 
1.7%
4 109
 
1.4%
5 82
 
1.1%
Other values (15) 333
 
4.4%
Han
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 83264
91.0%
ASCII 8160
 
8.9%
None 19
 
< 0.1%
Compat Jamo 4
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3493
42.8%
) 1125
 
13.8%
( 1101
 
13.5%
1 571
 
7.0%
2 316
 
3.9%
3 138
 
1.7%
9 127
 
1.6%
, 125
 
1.5%
4 109
 
1.3%
B 99
 
1.2%
Other values (53) 956
 
11.7%
Hangul
ValueCountFrequency (%)
2562
 
3.1%
2275
 
2.7%
2107
 
2.5%
2030
 
2.4%
1875
 
2.3%
1688
 
2.0%
1586
 
1.9%
1509
 
1.8%
1462
 
1.8%
1426
 
1.7%
Other values (569) 64744
77.8%
None
ValueCountFrequency (%)
15
78.9%
· 3
 
15.8%
1
 
5.3%
Compat Jamo
ValueCountFrequency (%)
4
100.0%
CJK
ValueCountFrequency (%)
2
100.0%

건축물_동명
Text

MISSING 

Distinct6713
Distinct (%)79.6%
Missing1564
Missing (%)15.6%
Memory size156.2 KiB
2024-04-21T11:03:02.616894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length37
Mean length7.1188952
Min length1

Characters and Unicode

Total characters60055
Distinct characters632
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6363 ?
Unique (%)75.4%

Sample

1st row한마음관
2nd row청도군 선거관리위원회
3rd row한국전력공사 풍납변전소
4th row수원지방법원 광주등기소
5th row부안변전소
ValueCountFrequency (%)
본관 393
 
3.7%
본관동 107
 
1.0%
1 82
 
0.8%
관리동 80
 
0.8%
별관 74
 
0.7%
한국전력공사 62
 
0.6%
본점 54
 
0.5%
52
 
0.5%
1동 48
 
0.5%
창고 48
 
0.5%
Other values (7154) 9644
90.6%
2024-04-21T11:03:03.029112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2843
 
4.7%
2358
 
3.9%
2046
 
3.4%
1599
 
2.7%
1297
 
2.2%
1290
 
2.1%
1072
 
1.8%
) 883
 
1.5%
( 871
 
1.5%
1 850
 
1.4%
Other values (622) 44946
74.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52425
87.3%
Decimal Number 2509
 
4.2%
Space Separator 2358
 
3.9%
Close Punctuation 885
 
1.5%
Open Punctuation 872
 
1.5%
Uppercase Letter 636
 
1.1%
Other Punctuation 275
 
0.5%
Math Symbol 39
 
0.1%
Lowercase Letter 24
 
< 0.1%
Other Symbol 23
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2843
 
5.4%
2046
 
3.9%
1599
 
3.1%
1297
 
2.5%
1290
 
2.5%
1072
 
2.0%
822
 
1.6%
803
 
1.5%
752
 
1.4%
744
 
1.4%
Other values (559) 39157
74.7%
Uppercase Letter
ValueCountFrequency (%)
B 125
19.7%
A 124
19.5%
C 58
9.1%
K 45
 
7.1%
I 34
 
5.3%
S 32
 
5.0%
D 29
 
4.6%
T 26
 
4.1%
F 26
 
4.1%
E 25
 
3.9%
Other values (15) 112
17.6%
Lowercase Letter
ValueCountFrequency (%)
k 6
25.0%
a 3
12.5%
b 3
12.5%
m 3
12.5%
v 2
 
8.3%
s 1
 
4.2%
t 1
 
4.2%
w 1
 
4.2%
e 1
 
4.2%
r 1
 
4.2%
Other values (2) 2
 
8.3%
Decimal Number
ValueCountFrequency (%)
1 850
33.9%
2 508
20.2%
3 274
 
10.9%
4 202
 
8.1%
9 138
 
5.5%
0 132
 
5.3%
5 129
 
5.1%
6 118
 
4.7%
7 85
 
3.4%
8 73
 
2.9%
Other Punctuation
ValueCountFrequency (%)
, 165
60.0%
. 82
29.8%
/ 21
 
7.6%
: 5
 
1.8%
1
 
0.4%
· 1
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 883
99.8%
] 2
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 871
99.9%
[ 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 38
97.4%
> 1
 
2.6%
Other Symbol
ValueCountFrequency (%)
21
91.3%
2
 
8.7%
Space Separator
ValueCountFrequency (%)
2358
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 52423
87.3%
Common 6968
 
11.6%
Latin 660
 
1.1%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2843
 
5.4%
2046
 
3.9%
1599
 
3.1%
1297
 
2.5%
1290
 
2.5%
1072
 
2.0%
822
 
1.6%
803
 
1.5%
752
 
1.4%
744
 
1.4%
Other values (559) 39155
74.7%
Latin
ValueCountFrequency (%)
B 125
18.9%
A 124
18.8%
C 58
8.8%
K 45
 
6.8%
I 34
 
5.2%
S 32
 
4.8%
D 29
 
4.4%
T 26
 
3.9%
F 26
 
3.9%
E 25
 
3.8%
Other values (27) 136
20.6%
Common
ValueCountFrequency (%)
2358
33.8%
) 883
 
12.7%
( 871
 
12.5%
1 850
 
12.2%
2 508
 
7.3%
3 274
 
3.9%
4 202
 
2.9%
, 165
 
2.4%
9 138
 
2.0%
0 132
 
1.9%
Other values (15) 587
 
8.4%
Han
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 52418
87.3%
ASCII 7605
 
12.7%
CJK Compat 21
 
< 0.1%
CJK 4
 
< 0.1%
None 4
 
< 0.1%
Compat Jamo 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2843
 
5.4%
2046
 
3.9%
1599
 
3.1%
1297
 
2.5%
1290
 
2.5%
1072
 
2.0%
822
 
1.6%
803
 
1.5%
752
 
1.4%
744
 
1.4%
Other values (557) 39150
74.7%
ASCII
ValueCountFrequency (%)
2358
31.0%
) 883
 
11.6%
( 871
 
11.5%
1 850
 
11.2%
2 508
 
6.7%
3 274
 
3.6%
4 202
 
2.7%
, 165
 
2.2%
9 138
 
1.8%
0 132
 
1.7%
Other values (49) 1224
16.1%
CJK Compat
ValueCountFrequency (%)
21
100.0%
CJK
ValueCountFrequency (%)
4
100.0%
Compat Jamo
ValueCountFrequency (%)
3
100.0%
None
ValueCountFrequency (%)
2
50.0%
1
25.0%
· 1
25.0%

건축물구분-대분류
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공공건축물
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공건축물
2nd row공공건축물
3rd row공공건축물
4th row공공건축물
5th row공공건축물

Common Values

ValueCountFrequency (%)
공공건축물 10000
100.0%

Length

2024-04-21T11:03:03.159399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:03:03.238978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공공건축물 10000
100.0%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
행정기관
5971 
공공기관
2234 
특수법인
1301 
지방공사/공단
 
494

Length

Max length7
Median length4
Mean length4.1482
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row행정기관
2nd row공공기관
3rd row행정기관
4th row행정기관
5th row공공기관

Common Values

ValueCountFrequency (%)
행정기관 5971
59.7%
공공기관 2234
 
22.3%
특수법인 1301
 
13.0%
지방공사/공단 494
 
4.9%

Length

2024-04-21T11:03:03.336988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:03:03.444207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
행정기관 5971
59.7%
공공기관 2234
 
22.3%
특수법인 1301
 
13.0%
지방공사/공단 494
 
4.9%

연면적
Real number (ℝ)

Distinct8727
Distinct (%)87.3%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean3728.7637
Minimum0
Maximum378705
Zeros11
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T11:03:03.576281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile523.018
Q1769.34
median1350
Q33086.84
95-th percentile11794.492
Maximum378705
Range378705
Interquartile range (IQR)2317.5

Descriptive statistics

Standard deviation11833.658
Coefficient of variation (CV)3.1736142
Kurtosis364.16947
Mean3728.7637
Median Absolute Deviation (MAD)726.31
Skewness16.403966
Sum37283909
Variance1.4003545 × 108
MonotonicityNot monotonic
2024-04-21T11:03:03.706732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.0 19
 
0.2%
660.0 14
 
0.1%
500.0 13
 
0.1%
538.05 12
 
0.1%
540.0 11
 
0.1%
600.0 11
 
0.1%
0.0 11
 
0.1%
630.0 10
 
0.1%
576.0 10
 
0.1%
567.0 9
 
0.1%
Other values (8717) 9879
98.8%
ValueCountFrequency (%)
0.0 11
0.1%
1.0 19
0.2%
8.5 1
 
< 0.1%
8.62 1
 
< 0.1%
9.61 1
 
< 0.1%
15.0 2
 
< 0.1%
15.48 1
 
< 0.1%
16.2 2
 
< 0.1%
16.3 1
 
< 0.1%
17.4 1
 
< 0.1%
ValueCountFrequency (%)
378705.0 1
 
< 0.1%
279279.9 6
0.1%
269129.0 1
 
< 0.1%
260447.0 1
 
< 0.1%
232700.0 1
 
< 0.1%
197086.0 1
 
< 0.1%
175597.2 1
 
< 0.1%
161593.06 1
 
< 0.1%
158938.0 1
 
< 0.1%
134076.0 1
 
< 0.1%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
미해당
7462 
해당
2538 

Length

Max length3
Median length3
Mean length2.7462
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미해당
2nd row해당
3rd row미해당
4th row미해당
5th row해당

Common Values

ValueCountFrequency (%)
미해당 7462
74.6%
해당 2538
 
25.4%

Length

2024-04-21T11:03:03.850080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:03:03.936884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미해당 7462
74.6%
해당 2538
 
25.4%

석면자재 면적
Real number (ℝ)

SKEWED  ZEROS 

Distinct3183
Distinct (%)31.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean256.25479
Minimum0
Maximum104589.11
Zeros6282
Zeros (%)62.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T11:03:04.039620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q358.05
95-th percentile1219.396
Maximum104589.11
Range104589.11
Interquartile range (IQR)58.05

Descriptive statistics

Standard deviation1488.2886
Coefficient of variation (CV)5.807847
Kurtosis2625.34
Mean256.25479
Median Absolute Deviation (MAD)0
Skewness42.342828
Sum2562547.9
Variance2215003
MonotonicityNot monotonic
2024-04-21T11:03:04.164676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 6282
62.8%
1.0 56
 
0.6%
5.0 12
 
0.1%
0.03 12
 
0.1%
0.1 12
 
0.1%
0.12 12
 
0.1%
0.04 12
 
0.1%
0.06 12
 
0.1%
0.14 11
 
0.1%
0.2 9
 
0.1%
Other values (3173) 3570
35.7%
ValueCountFrequency (%)
0.0 6282
62.8%
0.01 9
 
0.1%
0.02 9
 
0.1%
0.03 12
 
0.1%
0.04 12
 
0.1%
0.05 3
 
< 0.1%
0.06 12
 
0.1%
0.07 8
 
0.1%
0.08 9
 
0.1%
0.09 7
 
0.1%
ValueCountFrequency (%)
104589.11 1
< 0.1%
52451.19 1
< 0.1%
34632.5 1
< 0.1%
32270.1 1
< 0.1%
20038.0 1
< 0.1%
16715.34 1
< 0.1%
13141.0 1
< 0.1%
12839.87 1
< 0.1%
12756.79 1
< 0.1%
12391.87 1
< 0.1%
Distinct999
Distinct (%)10.0%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
Minimum2008-05-13 00:00:00
Maximum2023-10-12 00:00:00
2024-04-21T11:03:04.291324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:03:04.412539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct486
Distinct (%)4.9%
Missing4
Missing (%)< 0.1%
Memory size156.2 KiB
2024-04-21T11:03:04.640008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length10.453481
Min length1

Characters and Unicode

Total characters104493
Distinct characters262
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique144 ?
Unique (%)1.4%

Sample

1st row(주)한국환경시험연구소
2nd row(주)어반환경
3rd row(주)대한환경컨설팅
4th row(사)대한산업보건협회대구산업보건센타
5th row(주)산업공해연구소
ValueCountFrequency (%)
주)대한석면환경컨설팅 210
 
2.1%
이티에스컨설팅(주 178
 
1.8%
주)누리환경기술센터 162
 
1.6%
대한석면조사기관(주 159
 
1.6%
한국석면연구원(주 158
 
1.6%
충청산업보건연구원(주) 144
 
1.4%
주)정진이엔씨 140
 
1.4%
주)에코에이엔티 139
 
1.4%
주)테크월 135
 
1.3%
푸른환경연구소주식회사 131
 
1.3%
Other values (483) 8606
84.7%
2024-04-21T11:03:05.023460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8861
 
8.5%
( 7019
 
6.7%
) 6878
 
6.6%
5131
 
4.9%
4793
 
4.6%
3608
 
3.5%
3465
 
3.3%
3138
 
3.0%
2756
 
2.6%
2571
 
2.5%
Other values (252) 56273
53.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 88055
84.3%
Close Punctuation 7660
 
7.3%
Open Punctuation 7446
 
7.1%
Uppercase Letter 1138
 
1.1%
Space Separator 172
 
0.2%
Other Symbol 16
 
< 0.1%
Lowercase Letter 3
 
< 0.1%
Connector Punctuation 2
 
< 0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8861
 
10.1%
5131
 
5.8%
4793
 
5.4%
3608
 
4.1%
3465
 
3.9%
3138
 
3.6%
2756
 
3.1%
2571
 
2.9%
2552
 
2.9%
2068
 
2.3%
Other values (229) 49112
55.8%
Uppercase Letter
ValueCountFrequency (%)
G 250
22.0%
S 204
17.9%
I 173
15.2%
A 134
11.8%
M 112
9.8%
R 107
9.4%
T 60
 
5.3%
F 53
 
4.7%
E 25
 
2.2%
H 18
 
1.6%
Other values (2) 2
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
a 1
33.3%
n 1
33.3%
d 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 7019
94.3%
427
 
5.7%
Close Punctuation
ValueCountFrequency (%)
) 6878
89.8%
782
 
10.2%
Space Separator
ValueCountFrequency (%)
172
100.0%
Other Symbol
ValueCountFrequency (%)
16
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Decimal Number
ValueCountFrequency (%)
0 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 88071
84.3%
Common 15281
 
14.6%
Latin 1141
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8861
 
10.1%
5131
 
5.8%
4793
 
5.4%
3608
 
4.1%
3465
 
3.9%
3138
 
3.6%
2756
 
3.1%
2571
 
2.9%
2552
 
2.9%
2068
 
2.3%
Other values (230) 49128
55.8%
Latin
ValueCountFrequency (%)
G 250
21.9%
S 204
17.9%
I 173
15.2%
A 134
11.7%
M 112
9.8%
R 107
9.4%
T 60
 
5.3%
F 53
 
4.6%
E 25
 
2.2%
H 18
 
1.6%
Other values (5) 5
 
0.4%
Common
ValueCountFrequency (%)
( 7019
45.9%
) 6878
45.0%
782
 
5.1%
427
 
2.8%
172
 
1.1%
_ 2
 
< 0.1%
0 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 88055
84.3%
ASCII 15213
 
14.6%
None 1225
 
1.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8861
 
10.1%
5131
 
5.8%
4793
 
5.4%
3608
 
4.1%
3465
 
3.9%
3138
 
3.6%
2756
 
3.1%
2571
 
2.9%
2552
 
2.9%
2068
 
2.3%
Other values (229) 49112
55.8%
ASCII
ValueCountFrequency (%)
( 7019
46.1%
) 6878
45.2%
G 250
 
1.6%
S 204
 
1.3%
I 173
 
1.1%
172
 
1.1%
A 134
 
0.9%
M 112
 
0.7%
R 107
 
0.7%
T 60
 
0.4%
Other values (10) 104
 
0.7%
None
ValueCountFrequency (%)
782
63.8%
427
34.9%
16
 
1.3%

Interactions

2024-04-21T11:02:59.899608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:02:59.297969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:02:59.619116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:02:59.976635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:02:59.433500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:02:59.709586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:03:00.100600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:02:59.537295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:02:59.817191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T11:03:05.112296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
석면건축물_일련번호건축물구분-소분류연면적석면건축물 여부석면자재 면적
석면건축물_일련번호1.0000.1550.1110.0330.024
건축물구분-소분류0.1551.0000.1030.2300.025
연면적0.1110.1031.0000.0340.311
석면건축물 여부0.0330.2300.0341.0000.052
석면자재 면적0.0240.0250.3110.0521.000
2024-04-21T11:03:05.220832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건축물구분-소분류석면건축물 여부
건축물구분-소분류1.0000.153
석면건축물 여부0.1531.000
2024-04-21T11:03:05.292932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
석면건축물_일련번호연면적석면자재 면적건축물구분-소분류석면건축물 여부
석면건축물_일련번호1.0000.015-0.0250.0990.033
연면적0.0151.0000.1290.0660.034
석면자재 면적-0.0250.1291.0000.0210.063
건축물구분-소분류0.0990.0660.0211.0000.153
석면건축물 여부0.0330.0340.0630.1531.000

Missing values

2024-04-21T11:03:00.237164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:03:00.379939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-21T11:03:00.499336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

석면건축물_일련번호도로명주소건축물명건축물_동명건축물구분-대분류건축물구분-소분류연면적석면건축물 여부석면자재 면적조사일자조사기관
1575633602충청북도 진천군 진천읍 문화로 79진천문화원<NA>공공건축물행정기관729.78미해당0.02013-06-11(주)한국환경시험연구소
2328432568대전광역시 유성구 대덕대로989번길 242 (덕진동)한전원자력연료한마음관공공건축물공공기관4466.0해당73.472013-12-18(주)어반환경
1142047231전라남도 장성군 삼계면 영장로 1628삼계하수종말처리장<NA>공공건축물행정기관777.65미해당0.02013-05-29(주)대한환경컨설팅
8870160430경상북도 청도군 화양읍 청려로 1844청도군 선거관리위원회청도군 선거관리위원회공공건축물행정기관595.54미해당0.02013-06-18(사)대한산업보건협회대구산업보건센타
2945030351서울특별시 송파구 올림픽로43길 88 (풍납동)한국전력공사 풍납변전소한국전력공사 풍납변전소공공건축물공공기관3560.02해당1428.122013-10-18(주)산업공해연구소
3057828116서울특별시 구로구 디지털로32길 55 (구로동)IBK기업은행 구로동지점<NA>공공건축물공공기관5903.66미해당33.02014-04-28(주)우리환경컨설팅
17723127579경기도 광주시 행정타운로 49 15 (송정동)수원지방법원 광주등기소수원지방법원 광주등기소공공건축물행정기관1584.0미해당0.02013-11-29주식회사에코석면환경연구원
11190827전라북도 부안군 행안면 남산길 16 21부안변전소부안변전소공공건축물지방공사/공단973.0해당189.362013-06-26(유한)와이에스산업
10607163251경상북도 김천시 공단2길 30 22 (대광동)김천시평생교육원나동(대강당)공공건축물행정기관947.95해당188.422013-08-22(주)테크월
20663143924경기도 평택시 평택로 51 (평택동)AK PLAZA평택점영화관공공건축물행정기관1.0미해당0.02014-03-11(사)대한산업안전협회
석면건축물_일련번호도로명주소건축물명건축물_동명건축물구분-대분류건축물구분-소분류연면적석면건축물 여부석면자재 면적조사일자조사기관
2968928075서울특별시 강남구 밤고개로5길 46 13 (수서동)수서차량기지수서차량기지공공건축물지방공사/공단14661.93해당7580.092009-06-30서울메트로
2650918863대구광역시 달서구 월배로5길 39 (유천동)월배차량기지변전실공공건축물지방공사/공단720.0미해당0.052013-08-12아스텍주식회사
2877315375부산광역시 부산진구 신천대로 145 (범천동)부산철도차량정비단 일반기지(부산정비창청사)부산철도차량정비단 일반기지(부산정비창청사)공공건축물공공기관3480.0해당1896.842013-12-17실내환경연구소(주)
28338537강원도 횡성군 우천면 우항3길 6횡성축협한우프라자우천점<NA>공공건축물특수법인697.19미해당23.672014-04-25미래엔텍코리아(주)
2452545228광주광역시 서구 풍금로 135 (금호동)서구청금호2동주민센터서구청금호2동주민센터공공건축물행정기관905.71미해당0.02013-04-23주식회사에코석면환경연구원
2500111117인천광역시 서구 환경로 42 (경서동)국립환경과학원파워플랜트동공공건축물행정기관52684.0미해당46.882012-11-22한국환경공단
26124205325대구광역시 군위군 경북대로 3291 0군위지사 본관동군위지사 본관동공공건축물지방공사/공단711.9미해당0.02013-07-30(주)GG석면연구소
3196722419서울특별시 중랑구 면목로 298 (면목동)면목제7동주민센터면목제7동주민센터공공건축물행정기관523.81해당227.092011-06-08(주)푸른환경산업연구소
10095161921경상북도 구미시 옥계북로 43 48(옥계동)구미옥계휴먼시아1단지<NA>공공건축물공공기관1055.0미해당0.02014-04-28(주)신성에코텍
434143623전라북도 장수군 장수읍 장천로 247보건의료원(구관동)보건의료원(구관동)공공건축물행정기관2055.17미해당0.02013-05-02푸른환경연구소주식회사