Overview

Dataset statistics

Number of variables4
Number of observations3723
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory120.1 KiB
Average record size in memory33.0 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description안전보건공단에서 제공하는 석면해체사업장 정보로 업체명, 관할지청, 공단 지사, 소재지, 전화번호에 대한 내용을 제공합니다.
URLhttps://www.data.go.kr/data/15087439/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:29:16.789383
Analysis finished2023-12-12 01:29:17.859784
Duration1.07 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct3723
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1862
Minimum1
Maximum3723
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size32.9 KiB
2023-12-12T10:29:17.950938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile187.1
Q1931.5
median1862
Q32792.5
95-th percentile3536.9
Maximum3723
Range3722
Interquartile range (IQR)1861

Descriptive statistics

Standard deviation1074.8819
Coefficient of variation (CV)0.57727275
Kurtosis-1.2
Mean1862
Median Absolute Deviation (MAD)931
Skewness0
Sum6932226
Variance1155371
MonotonicityStrictly increasing
2023-12-12T10:29:18.171865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
2488 1
 
< 0.1%
2476 1
 
< 0.1%
2477 1
 
< 0.1%
2478 1
 
< 0.1%
2479 1
 
< 0.1%
2480 1
 
< 0.1%
2481 1
 
< 0.1%
2482 1
 
< 0.1%
2483 1
 
< 0.1%
Other values (3713) 3713
99.7%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
3723 1
< 0.1%
3722 1
< 0.1%
3721 1
< 0.1%
3720 1
< 0.1%
3719 1
< 0.1%
3718 1
< 0.1%
3717 1
< 0.1%
3716 1
< 0.1%
3715 1
< 0.1%
3714 1
< 0.1%
Distinct3653
Distinct (%)98.1%
Missing1
Missing (%)< 0.1%
Memory size29.2 KiB
2023-12-12T10:29:18.470165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length20
Mean length8.0045674
Min length1

Characters and Unicode

Total characters29793
Distinct characters503
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3588 ?
Unique (%)96.4%

Sample

1st row(주)리아스테크
2nd row성도건설산업(주)
3rd row에스엠개발
4th row케이디건설산업(주)
5th row(주)옥당산업
ValueCountFrequency (%)
주식회사 25
 
0.7%
주)대영건설 3
 
0.1%
대성건설(주 3
 
0.1%
주식회사진흥건설 3
 
0.1%
금강산업 3
 
0.1%
3
 
0.1%
늘찬건설산업(유 2
 
0.1%
주)신성건설 2
 
0.1%
주)그린환경 2
 
0.1%
주식회사대현건설 2
 
0.1%
Other values (3648) 3707
98.7%
2023-12-12T10:29:18.968561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3154
 
10.6%
( 2131
 
7.2%
) 2116
 
7.1%
1557
 
5.2%
1412
 
4.7%
1287
 
4.3%
1184
 
4.0%
1082
 
3.6%
800
 
2.7%
786
 
2.6%
Other values (493) 14284
47.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25048
84.1%
Close Punctuation 2257
 
7.6%
Open Punctuation 2256
 
7.6%
Uppercase Letter 85
 
0.3%
Other Punctuation 48
 
0.2%
Space Separator 34
 
0.1%
Lowercase Letter 21
 
0.1%
Decimal Number 20
 
0.1%
Math Symbol 12
 
< 0.1%
Connector Punctuation 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3154
 
12.6%
1557
 
6.2%
1412
 
5.6%
1287
 
5.1%
1184
 
4.7%
1082
 
4.3%
800
 
3.2%
786
 
3.1%
651
 
2.6%
596
 
2.4%
Other values (439) 12539
50.1%
Uppercase Letter
ValueCountFrequency (%)
E 14
16.5%
N 13
15.3%
C 10
11.8%
S 8
9.4%
H 7
8.2%
T 6
7.1%
G 5
 
5.9%
D 4
 
4.7%
B 3
 
3.5%
M 3
 
3.5%
Other values (7) 12
14.1%
Lowercase Letter
ValueCountFrequency (%)
a 3
14.3%
e 3
14.3%
t 2
9.5%
c 2
9.5%
r 2
9.5%
n 2
9.5%
l 1
 
4.8%
z 1
 
4.8%
i 1
 
4.8%
v 1
 
4.8%
Other values (3) 3
14.3%
Decimal Number
ValueCountFrequency (%)
1 9
45.0%
9 3
 
15.0%
2 3
 
15.0%
5 2
 
10.0%
4 1
 
5.0%
0 1
 
5.0%
3 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
/ 33
68.8%
. 8
 
16.7%
& 3
 
6.2%
, 3
 
6.2%
1
 
2.1%
Open Punctuation
ValueCountFrequency (%)
( 2131
94.5%
115
 
5.1%
[ 10
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 2116
93.8%
131
 
5.8%
] 10
 
0.4%
Space Separator
ValueCountFrequency (%)
33
97.1%
  1
 
2.9%
Math Symbol
ValueCountFrequency (%)
> 6
50.0%
< 6
50.0%
Connector Punctuation
ValueCountFrequency (%)
_ 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25048
84.1%
Common 4639
 
15.6%
Latin 106
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3154
 
12.6%
1557
 
6.2%
1412
 
5.6%
1287
 
5.1%
1184
 
4.7%
1082
 
4.3%
800
 
3.2%
786
 
3.1%
651
 
2.6%
596
 
2.4%
Other values (439) 12539
50.1%
Latin
ValueCountFrequency (%)
E 14
13.2%
N 13
 
12.3%
C 10
 
9.4%
S 8
 
7.5%
H 7
 
6.6%
T 6
 
5.7%
G 5
 
4.7%
D 4
 
3.8%
B 3
 
2.8%
M 3
 
2.8%
Other values (20) 33
31.1%
Common
ValueCountFrequency (%)
( 2131
45.9%
) 2116
45.6%
131
 
2.8%
115
 
2.5%
/ 33
 
0.7%
33
 
0.7%
[ 10
 
0.2%
] 10
 
0.2%
1 9
 
0.2%
. 8
 
0.2%
Other values (14) 43
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25048
84.1%
ASCII 4497
 
15.1%
None 248
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3154
 
12.6%
1557
 
6.2%
1412
 
5.6%
1287
 
5.1%
1184
 
4.7%
1082
 
4.3%
800
 
3.2%
786
 
3.1%
651
 
2.6%
596
 
2.4%
Other values (439) 12539
50.1%
ASCII
ValueCountFrequency (%)
( 2131
47.4%
) 2116
47.1%
/ 33
 
0.7%
33
 
0.7%
E 14
 
0.3%
N 13
 
0.3%
C 10
 
0.2%
[ 10
 
0.2%
] 10
 
0.2%
1 9
 
0.2%
Other values (40) 118
 
2.6%
None
ValueCountFrequency (%)
131
52.8%
115
46.4%
  1
 
0.4%
1
 
0.4%

지정관서
Categorical

Distinct49
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size29.2 KiB
광 주 청
 
269
전 주
 
236
대 전 청
 
180
여 수
 
151
창 원
 
130
Other values (44)
2757 

Length

Max length6
Median length6
Mean length5.5135643
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울북부
2nd row서울남부
3rd row대구서부
4th row서울남부
5th row서울남부

Common Values

ValueCountFrequency (%)
광 주 청 269
 
7.2%
전 주 236
 
6.3%
대 전 청 180
 
4.8%
여 수 151
 
4.1%
창 원 130
 
3.5%
청 주 119
 
3.2%
목 포 119
 
3.2%
경 기 111
 
3.0%
포 항 109
 
2.9%
대 구 청 101
 
2.7%
Other values (39) 2198
59.0%

Length

2023-12-12T10:29:19.182596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
941
 
12.1%
847
 
10.9%
430
 
5.5%
416
 
5.3%
281
 
3.6%
278
 
3.6%
269
 
3.5%
262
 
3.4%
242
 
3.1%
228
 
2.9%
Other values (46) 3590
46.1%
Distinct3691
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size29.2 KiB
2023-12-12T10:29:19.672381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length83
Median length68
Mean length34.107172
Min length1

Characters and Unicode

Total characters126981
Distinct characters604
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3662 ?
Unique (%)98.4%

Sample

1st row서울특별시 성북구 화랑로40길 12-7 (석관동) 미르빌 3층 301호(58-273번지)
2nd row경기 김포시 하성면 원산리 7번지
3rd row대구 서구 비산동 1868-2
4th row서울특별시 영등포구 도영로7길 15-2 303호 (상가A동)
5th row서울특별시 영등포구 문래북로 83 (당산동2가, 제이씨 빌딩) 4층(53-4번지 옥당빌딩 6층)
ValueCountFrequency (%)
2층 434
 
1.8%
1층 278
 
1.1%
경기 275
 
1.1%
경기도 252
 
1.0%
전북 230
 
0.9%
3층 225
 
0.9%
전남 217
 
0.9%
강원 215
 
0.9%
경남 197
 
0.8%
경북 176
 
0.7%
Other values (9438) 22219
89.9%
2023-12-12T10:29:20.756877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21946
 
17.3%
1 6012
 
4.7%
) 4706
 
3.7%
( 4706
 
3.7%
2 4392
 
3.5%
3313
 
2.6%
3 3004
 
2.4%
2821
 
2.2%
0 2782
 
2.2%
- 2622
 
2.1%
Other values (594) 70677
55.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 64760
51.0%
Decimal Number 27261
21.5%
Space Separator 21946
 
17.3%
Close Punctuation 4706
 
3.7%
Open Punctuation 4706
 
3.7%
Dash Punctuation 2622
 
2.1%
Other Punctuation 732
 
0.6%
Uppercase Letter 223
 
0.2%
Lowercase Letter 13
 
< 0.1%
Other Symbol 6
 
< 0.1%
Other values (2) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3313
 
5.1%
2821
 
4.4%
2573
 
4.0%
2236
 
3.5%
1920
 
3.0%
1814
 
2.8%
1737
 
2.7%
1650
 
2.5%
1552
 
2.4%
1469
 
2.3%
Other values (544) 43675
67.4%
Uppercase Letter
ValueCountFrequency (%)
B 54
24.2%
A 28
12.6%
C 24
10.8%
K 18
 
8.1%
S 17
 
7.6%
T 16
 
7.2%
I 13
 
5.8%
E 10
 
4.5%
D 10
 
4.5%
V 7
 
3.1%
Other values (10) 26
11.7%
Decimal Number
ValueCountFrequency (%)
1 6012
22.1%
2 4392
16.1%
3 3004
11.0%
0 2782
10.2%
4 2370
 
8.7%
5 2179
 
8.0%
6 1833
 
6.7%
7 1650
 
6.1%
8 1551
 
5.7%
9 1488
 
5.5%
Lowercase Letter
ValueCountFrequency (%)
t 3
23.1%
k 2
15.4%
w 2
15.4%
i 2
15.4%
c 1
 
7.7%
a 1
 
7.7%
n 1
 
7.7%
s 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 676
92.3%
. 43
 
5.9%
· 7
 
1.0%
/ 6
 
0.8%
Math Symbol
ValueCountFrequency (%)
~ 1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
21946
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4706
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4706
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2622
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%
Letter Number
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 64766
51.0%
Common 61975
48.8%
Latin 240
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3313
 
5.1%
2821
 
4.4%
2573
 
4.0%
2236
 
3.5%
1920
 
3.0%
1814
 
2.8%
1737
 
2.7%
1650
 
2.5%
1552
 
2.4%
1469
 
2.3%
Other values (545) 43681
67.4%
Latin
ValueCountFrequency (%)
B 54
22.5%
A 28
11.7%
C 24
10.0%
K 18
 
7.5%
S 17
 
7.1%
T 16
 
6.7%
I 13
 
5.4%
E 10
 
4.2%
D 10
 
4.2%
V 7
 
2.9%
Other values (19) 43
17.9%
Common
ValueCountFrequency (%)
21946
35.4%
1 6012
 
9.7%
) 4706
 
7.6%
( 4706
 
7.6%
2 4392
 
7.1%
3 3004
 
4.8%
0 2782
 
4.5%
- 2622
 
4.2%
4 2370
 
3.8%
5 2179
 
3.5%
Other values (10) 7256
 
11.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 64760
51.0%
ASCII 62203
49.0%
None 13
 
< 0.1%
Number Forms 4
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
21946
35.3%
1 6012
 
9.7%
) 4706
 
7.6%
( 4706
 
7.6%
2 4392
 
7.1%
3 3004
 
4.8%
0 2782
 
4.5%
- 2622
 
4.2%
4 2370
 
3.8%
5 2179
 
3.5%
Other values (36) 7484
 
12.0%
Hangul
ValueCountFrequency (%)
3313
 
5.1%
2821
 
4.4%
2573
 
4.0%
2236
 
3.5%
1920
 
3.0%
1814
 
2.8%
1737
 
2.7%
1650
 
2.5%
1552
 
2.4%
1469
 
2.3%
Other values (544) 43675
67.4%
None
ValueCountFrequency (%)
· 7
53.8%
6
46.2%
Number Forms
ValueCountFrequency (%)
4
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-12T10:29:17.556282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:29:20.887111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지정관서
연번1.0000.290
지정관서0.2901.000
2023-12-12T10:29:21.001698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지정관서
연번1.0000.102
지정관서0.1021.000

Missing values

2023-12-12T10:29:17.718809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:29:17.816463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업자명지정관서소재지
01(주)리아스테크서울북부서울특별시 성북구 화랑로40길 12-7 (석관동) 미르빌 3층 301호(58-273번지)
12성도건설산업(주)서울남부경기 김포시 하성면 원산리 7번지
23에스엠개발대구서부대구 서구 비산동 1868-2
34케이디건설산업(주)서울남부서울특별시 영등포구 도영로7길 15-2 303호 (상가A동)
45(주)옥당산업서울남부서울특별시 영등포구 문래북로 83 (당산동2가, 제이씨 빌딩) 4층(53-4번지 옥당빌딩 6층)
56(유)완주환경토건전 주전북 완주군 소양면 소양로 209 (736-4번지)
67(주)성진시앤디의 정 부경기도 포천시 소흘읍 호국로484번길 66-31 (소흘읍)(350-1번지)
78(주)코리아카코서울남부서울 영등포구 디지털로 334-1 (대림동)(1087-37번지)
89주식회사경부건설익 산전북 김제시 봉남면 대송로 174 ((유)전주환경/경부건설)(157-번지)
910(주)참마루건설서울남부서울 영등포구 국제금융로6길 26 한국노총 10층 (여의도동)(35번지 한국노총 10층)
연번사업자명지정관서소재지
37133714주식회사주영건설대 구 청대구광역시 북구 조야로7길 23 (조야동)(19-2번지)
37143715주식회사광성[건설업본사]성 남경기 이천시 부발읍 무촌로 103 상가동204호
37153716석면스토리(주)전 주전라북도 전주시 덕진구 만성북로 51-25 3층 3057호 (스페이스온 지식산업센터)(50-3번지)
37163717주식회사지서환경시스템포 항경북 포항시 남구 오천읍 냉천로 440 1층
37173718건일종합건설주식회사광 주 청전남 나주시 빛가람로 679 (빛가람동) 303호
37183719(주)대천부산북부부산 북구 백양대로995번길 35 (구포동) 1층
37193720마스타케미칼(주)창 원경남 창원시 마산합포구 3·15대로 455 (상남동)(162-84번지)
37203721(주)에코이엔지보 령충남 보령시 큰오랏6길 30 2층 (동대동)(549번지 2층)
37213722에이치케이안전시스템(주)천 안충청남도 아산시 음봉면 음봉로586번길 41-12 (621-1번지)
37223723주식회사선진기술사사무소인천북부인천 부평구 경인로 771 402-2호(십정동,종로빌딩)