Overview

Dataset statistics

Number of variables20
Number of observations10000
Missing cells13772
Missing cells (%)6.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 MiB
Average record size in memory169.0 B

Variable types

Numeric1
Text11
Categorical8

Dataset

Description환경정보공개시스템 기관등록 명단(2020-10-22 기준 / 업체명, 업체특성, 사업장명, 주소, 세부업종, 주요업무, 연도별 대상여부 등))
Author한국환경산업기술원
URLhttps://www.data.go.kr/data/15072038/fileData.do

Alerts

2017 is highly overall correlated with 번호 and 2 other fieldsHigh correlation
2016 is highly overall correlated with 번호 and 2 other fieldsHigh correlation
번호 is highly overall correlated with 2016 and 1 other fieldsHigh correlation
업체분야 is highly overall correlated with 업체특성 and 1 other fieldsHigh correlation
업체특성 is highly overall correlated with 업체분야High correlation
업체구분 is highly overall correlated with 2019High correlation
업종 is highly overall correlated with 업체분야High correlation
2018 is highly overall correlated with 2016 and 1 other fieldsHigh correlation
2019 is highly overall correlated with 업체구분High correlation
업체 대표명 has 1012 (10.1%) missing valuesMissing
대표전화번호 has 1612 (16.1%) missing valuesMissing
상세주소 has 1616 (16.2%) missing valuesMissing
주요업무 has 2389 (23.9%) missing valuesMissing
홈페이지 has 7039 (70.4%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:46:27.043177
Analysis finished2023-12-12 15:46:32.666174
Duration5.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9342.0639
Minimum1
Maximum18659
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T00:46:32.768402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile961.7
Q14670.75
median9267
Q314038.75
95-th percentile17697.2
Maximum18659
Range18658
Interquartile range (IQR)9368

Descriptive statistics

Standard deviation5386.3631
Coefficient of variation (CV)0.576571
Kurtosis-1.2111776
Mean9342.0639
Median Absolute Deviation (MAD)4686
Skewness0.0064197719
Sum93420639
Variance29012908
MonotonicityNot monotonic
2023-12-13T00:46:32.958986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11986 1
 
< 0.1%
13767 1
 
< 0.1%
14629 1
 
< 0.1%
13170 1
 
< 0.1%
12027 1
 
< 0.1%
551 1
 
< 0.1%
693 1
 
< 0.1%
5813 1
 
< 0.1%
17816 1
 
< 0.1%
5189 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
4 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
14 1
< 0.1%
ValueCountFrequency (%)
18659 1
< 0.1%
18658 1
< 0.1%
18657 1
< 0.1%
18656 1
< 0.1%
18652 1
< 0.1%
18649 1
< 0.1%
18645 1
< 0.1%
18643 1
< 0.1%
18642 1
< 0.1%
18639 1
< 0.1%
Distinct2534
Distinct (%)25.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:46:33.340683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length29
Mean length7.7352
Min length2

Characters and Unicode

Total characters77352
Distinct characters549
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1609 ?
Unique (%)16.1%

Sample

1st row임실군청
2nd row롯데쇼핑(주) 본점(본사)
3rd row한국남동발전(주) 영동화력발전처
4th row은평구청
5th row진주시청
ValueCountFrequency (%)
본사 368
 
2.9%
주)이마트(본사+성수점 195
 
1.5%
서울특별시 141
 
1.1%
홈플러스(주 124
 
1.0%
본부 110
 
0.9%
부산광역시 110
 
0.9%
주식회사 100
 
0.8%
롯데쇼핑(주 96
 
0.8%
본점(본사 95
 
0.8%
안동시청 90
 
0.7%
Other values (2720) 11155
88.6%
2023-12-13T00:46:33.982137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4230
 
5.5%
3459
 
4.5%
3176
 
4.1%
) 2730
 
3.5%
( 2711
 
3.5%
2586
 
3.3%
2297
 
3.0%
1969
 
2.5%
1620
 
2.1%
1475
 
1.9%
Other values (539) 51099
66.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67813
87.7%
Close Punctuation 2730
 
3.5%
Open Punctuation 2711
 
3.5%
Space Separator 2586
 
3.3%
Uppercase Letter 703
 
0.9%
Decimal Number 223
 
0.3%
Math Symbol 197
 
0.3%
Other Symbol 130
 
0.2%
Lowercase Letter 125
 
0.2%
Connector Punctuation 73
 
0.1%
Other values (2) 61
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4230
 
6.2%
3459
 
5.1%
3176
 
4.7%
2297
 
3.4%
1969
 
2.9%
1620
 
2.4%
1475
 
2.2%
1407
 
2.1%
1396
 
2.1%
1230
 
1.8%
Other values (476) 45554
67.2%
Uppercase Letter
ValueCountFrequency (%)
S 164
23.3%
K 124
17.6%
C 80
11.4%
L 48
 
6.8%
P 48
 
6.8%
D 33
 
4.7%
G 31
 
4.4%
N 24
 
3.4%
I 24
 
3.4%
J 23
 
3.3%
Other values (13) 104
14.8%
Lowercase Letter
ValueCountFrequency (%)
t 34
27.2%
k 31
24.8%
e 11
 
8.8%
s 10
 
8.0%
a 8
 
6.4%
p 7
 
5.6%
u 5
 
4.0%
m 4
 
3.2%
b 3
 
2.4%
r 3
 
2.4%
Other values (7) 9
 
7.2%
Decimal Number
ValueCountFrequency (%)
1 119
53.4%
2 42
 
18.8%
5 24
 
10.8%
3 23
 
10.3%
4 7
 
3.1%
0 3
 
1.3%
6 3
 
1.3%
9 1
 
0.4%
7 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
, 21
40.4%
. 14
26.9%
& 14
26.9%
/ 2
 
3.8%
· 1
 
1.9%
Math Symbol
ValueCountFrequency (%)
+ 195
99.0%
2
 
1.0%
Other Symbol
ValueCountFrequency (%)
65
50.0%
65
50.0%
Close Punctuation
ValueCountFrequency (%)
) 2730
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2711
100.0%
Space Separator
ValueCountFrequency (%)
2586
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 73
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 67813
87.7%
Common 8711
 
11.3%
Latin 828
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4230
 
6.2%
3459
 
5.1%
3176
 
4.7%
2297
 
3.4%
1969
 
2.9%
1620
 
2.4%
1475
 
2.2%
1407
 
2.1%
1396
 
2.1%
1230
 
1.8%
Other values (476) 45554
67.2%
Latin
ValueCountFrequency (%)
S 164
19.8%
K 124
15.0%
C 80
 
9.7%
L 48
 
5.8%
P 48
 
5.8%
t 34
 
4.1%
D 33
 
4.0%
k 31
 
3.7%
G 31
 
3.7%
N 24
 
2.9%
Other values (30) 211
25.5%
Common
ValueCountFrequency (%)
) 2730
31.3%
( 2711
31.1%
2586
29.7%
+ 195
 
2.2%
1 119
 
1.4%
_ 73
 
0.8%
65
 
0.7%
65
 
0.7%
2 42
 
0.5%
5 24
 
0.3%
Other values (13) 101
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 67813
87.7%
ASCII 9406
 
12.2%
Geometric Shapes 130
 
0.2%
Arrows 2
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4230
 
6.2%
3459
 
5.1%
3176
 
4.7%
2297
 
3.4%
1969
 
2.9%
1620
 
2.4%
1475
 
2.2%
1407
 
2.1%
1396
 
2.1%
1230
 
1.8%
Other values (476) 45554
67.2%
ASCII
ValueCountFrequency (%)
) 2730
29.0%
( 2711
28.8%
2586
27.5%
+ 195
 
2.1%
S 164
 
1.7%
K 124
 
1.3%
1 119
 
1.3%
C 80
 
0.9%
_ 73
 
0.8%
L 48
 
0.5%
Other values (49) 576
 
6.1%
Geometric Shapes
ValueCountFrequency (%)
65
50.0%
65
50.0%
Arrows
ValueCountFrequency (%)
2
100.0%
None
ValueCountFrequency (%)
· 1
100.0%

업체분야
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공공행정
6327 
제조
1586 
기타서비스
1479 
기타산업
 
280
교육서비스
 
230

Length

Max length5
Median length4
Mean length3.8341
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공행정
2nd row기타서비스
3rd row제조
4th row공공행정
5th row공공행정

Common Values

ValueCountFrequency (%)
공공행정 6327
63.3%
제조 1586
 
15.9%
기타서비스 1479
 
14.8%
기타산업 280
 
2.8%
교육서비스 230
 
2.3%
보건 98
 
1.0%

Length

2023-12-13T00:46:34.188311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:46:34.323806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공공행정 6327
63.3%
제조 1586
 
15.9%
기타서비스 1479
 
14.8%
기타산업 280
 
2.8%
교육서비스 230
 
2.3%
보건 98
 
1.0%

업체특성
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
지방자치단체
4344 
배출권할당대상업체
1791 
공공기관
1446 
온실가스목표관리업체
865 
중앙행정기관
 
417
Other values (7)
1137 

Length

Max length10
Median length9
Mean length6.3794
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지방자치단체
2nd row온실가스목표관리업체
3rd row녹색기업
4th row지방자치단체
5th row지방자치단체

Common Values

ValueCountFrequency (%)
지방자치단체 4344
43.4%
배출권할당대상업체 1791
17.9%
공공기관 1446
 
14.5%
온실가스목표관리업체 865
 
8.6%
중앙행정기관 417
 
4.2%
지방공단 381
 
3.8%
<NA> 241
 
2.4%
녹색기업 201
 
2.0%
지방공사 176
 
1.8%
국공립대학 115
 
1.1%
Other values (2) 23
 
0.2%

Length

2023-12-13T00:46:34.530087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지방자치단체 4344
43.4%
배출권할당대상업체 1791
17.9%
공공기관 1446
 
14.5%
온실가스목표관리업체 865
 
8.6%
중앙행정기관 417
 
4.2%
지방공단 381
 
3.8%
na 241
 
2.4%
녹색기업 201
 
2.0%
지방공사 176
 
1.8%
국공립대학 115
 
1.1%
Other values (2) 23
 
0.2%

업체구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
사업장
6946 
대표사업장
3054 

Length

Max length5
Median length3
Mean length3.6108
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장
2nd row사업장
3rd row대표사업장
4th row사업장
5th row사업장

Common Values

ValueCountFrequency (%)
사업장 6946
69.5%
대표사업장 3054
30.5%

Length

2023-12-13T00:46:34.754893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:46:34.954895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장 6946
69.5%
대표사업장 3054
30.5%
Distinct8055
Distinct (%)80.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:46:35.403519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length26
Mean length9.6469
Min length1

Characters and Unicode

Total characters96469
Distinct characters661
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6910 ?
Unique (%)69.1%

Sample

1st row임실군 청웅면사무소
2nd row롯데시네마
3rd row한국남동발전(주) 영동화력발전처
4th row구립응암정보도서관
5th row사봉면
ValueCountFrequency (%)
본사 116
 
0.8%
주민센터 111
 
0.8%
주)이마트 110
 
0.8%
주)이마트(본사+성수점 88
 
0.6%
홈플러스 77
 
0.5%
농업기술센터 73
 
0.5%
보건소 67
 
0.5%
주식회사 67
 
0.5%
한국주택금융공사 65
 
0.5%
롯데마트 61
 
0.4%
Other values (7794) 13276
94.1%
2023-12-13T00:46:36.048480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4117
 
4.3%
3670
 
3.8%
3390
 
3.5%
2273
 
2.4%
) 2246
 
2.3%
( 2237
 
2.3%
2150
 
2.2%
1997
 
2.1%
1965
 
2.0%
1637
 
1.7%
Other values (651) 70787
73.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 84740
87.8%
Space Separator 4117
 
4.3%
Close Punctuation 2251
 
2.3%
Open Punctuation 2242
 
2.3%
Decimal Number 1100
 
1.1%
Uppercase Letter 951
 
1.0%
Connector Punctuation 503
 
0.5%
Lowercase Letter 149
 
0.2%
Other Symbol 133
 
0.1%
Dash Punctuation 99
 
0.1%
Other values (2) 184
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3670
 
4.3%
3390
 
4.0%
2273
 
2.7%
2150
 
2.5%
1997
 
2.4%
1965
 
2.3%
1637
 
1.9%
1607
 
1.9%
1561
 
1.8%
1494
 
1.8%
Other values (579) 62996
74.3%
Uppercase Letter
ValueCountFrequency (%)
S 169
17.8%
K 129
13.6%
C 116
12.2%
L 60
 
6.3%
P 60
 
6.3%
T 59
 
6.2%
I 50
 
5.3%
D 46
 
4.8%
G 38
 
4.0%
N 35
 
3.7%
Other values (15) 189
19.9%
Lowercase Letter
ValueCountFrequency (%)
t 35
23.5%
k 32
21.5%
e 12
 
8.1%
o 11
 
7.4%
a 11
 
7.4%
p 7
 
4.7%
s 6
 
4.0%
r 5
 
3.4%
n 5
 
3.4%
l 5
 
3.4%
Other values (10) 20
13.4%
Decimal Number
ValueCountFrequency (%)
2 358
32.5%
1 346
31.5%
3 149
13.5%
4 87
 
7.9%
5 60
 
5.5%
0 45
 
4.1%
6 21
 
1.9%
7 15
 
1.4%
9 10
 
0.9%
8 9
 
0.8%
Other Punctuation
ValueCountFrequency (%)
. 34
36.2%
· 22
23.4%
& 20
21.3%
, 16
17.0%
/ 2
 
2.1%
Other Symbol
ValueCountFrequency (%)
64
48.1%
64
48.1%
5
 
3.8%
Close Punctuation
ValueCountFrequency (%)
) 2246
99.8%
] 5
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 2237
99.8%
[ 5
 
0.2%
Math Symbol
ValueCountFrequency (%)
+ 88
97.8%
2
 
2.2%
Space Separator
ValueCountFrequency (%)
4117
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 503
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 99
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 84745
87.8%
Common 10624
 
11.0%
Latin 1100
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3670
 
4.3%
3390
 
4.0%
2273
 
2.7%
2150
 
2.5%
1997
 
2.4%
1965
 
2.3%
1637
 
1.9%
1607
 
1.9%
1561
 
1.8%
1494
 
1.8%
Other values (580) 63001
74.3%
Latin
ValueCountFrequency (%)
S 169
15.4%
K 129
 
11.7%
C 116
 
10.5%
L 60
 
5.5%
P 60
 
5.5%
T 59
 
5.4%
I 50
 
4.5%
D 46
 
4.2%
G 38
 
3.5%
t 35
 
3.2%
Other values (35) 338
30.7%
Common
ValueCountFrequency (%)
4117
38.8%
) 2246
21.1%
( 2237
21.1%
_ 503
 
4.7%
2 358
 
3.4%
1 346
 
3.3%
3 149
 
1.4%
- 99
 
0.9%
+ 88
 
0.8%
4 87
 
0.8%
Other values (16) 394
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 84740
87.8%
ASCII 11572
 
12.0%
Geometric Shapes 128
 
0.1%
None 27
 
< 0.1%
Arrows 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4117
35.6%
) 2246
19.4%
( 2237
19.3%
_ 503
 
4.3%
2 358
 
3.1%
1 346
 
3.0%
S 169
 
1.5%
3 149
 
1.3%
K 129
 
1.1%
C 116
 
1.0%
Other values (57) 1202
 
10.4%
Hangul
ValueCountFrequency (%)
3670
 
4.3%
3390
 
4.0%
2273
 
2.7%
2150
 
2.5%
1997
 
2.4%
1965
 
2.3%
1637
 
1.9%
1607
 
1.9%
1561
 
1.8%
1494
 
1.8%
Other values (579) 62996
74.3%
Geometric Shapes
ValueCountFrequency (%)
64
50.0%
64
50.0%
None
ValueCountFrequency (%)
· 22
81.5%
5
 
18.5%
Arrows
ValueCountFrequency (%)
2
100.0%

업체 대표명
Text

MISSING 

Distinct1600
Distinct (%)17.8%
Missing1012
Missing (%)10.1%
Memory size156.2 KiB
2023-12-13T00:46:36.391233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length24
Mean length7.4165554
Min length1

Characters and Unicode

Total characters66660
Distinct characters494
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique752 ?
Unique (%)8.4%

Sample

1st row전라북도 임실군
2nd row롯데쇼핑(주)
3rd row한국남동발전(주)
4th row서울특별시 은평구
5th row경상남도 진주시
ValueCountFrequency (%)
경상북도 533
 
4.2%
경기도 501
 
3.9%
서울특별시 443
 
3.5%
전라남도 356
 
2.8%
전라북도 352
 
2.7%
경상남도 334
 
2.6%
강원도 230
 
1.8%
충청남도 215
 
1.7%
충청북도 212
 
1.7%
부산광역시 209
 
1.6%
Other values (1625) 9428
73.6%
2023-12-13T00:46:36.850918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3826
 
5.7%
3578
 
5.4%
3095
 
4.6%
2795
 
4.2%
) 1986
 
3.0%
( 1985
 
3.0%
1664
 
2.5%
1402
 
2.1%
1263
 
1.9%
1262
 
1.9%
Other values (484) 43804
65.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 58037
87.1%
Space Separator 3826
 
5.7%
Close Punctuation 1986
 
3.0%
Open Punctuation 1985
 
3.0%
Uppercase Letter 624
 
0.9%
Other Punctuation 109
 
0.2%
Dash Punctuation 54
 
0.1%
Decimal Number 29
 
< 0.1%
Lowercase Letter 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3578
 
6.2%
3095
 
5.3%
2795
 
4.8%
1664
 
2.9%
1402
 
2.4%
1263
 
2.2%
1262
 
2.2%
1240
 
2.1%
1220
 
2.1%
1149
 
2.0%
Other values (443) 39369
67.8%
Uppercase Letter
ValueCountFrequency (%)
S 133
21.3%
L 90
14.4%
G 81
13.0%
K 80
12.8%
P 45
 
7.2%
C 41
 
6.6%
T 31
 
5.0%
J 19
 
3.0%
I 18
 
2.9%
N 18
 
2.9%
Other values (12) 68
10.9%
Decimal Number
ValueCountFrequency (%)
2 13
44.8%
5 6
20.7%
1 4
 
13.8%
3 3
 
10.3%
4 2
 
6.9%
9 1
 
3.4%
Lowercase Letter
ValueCountFrequency (%)
e 4
40.0%
l 2
20.0%
t 2
20.0%
a 1
 
10.0%
d 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
, 92
84.4%
& 13
 
11.9%
. 3
 
2.8%
/ 1
 
0.9%
Space Separator
ValueCountFrequency (%)
3826
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1986
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1985
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 54
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 58036
87.1%
Common 7989
 
12.0%
Latin 634
 
1.0%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3578
 
6.2%
3095
 
5.3%
2795
 
4.8%
1664
 
2.9%
1402
 
2.4%
1263
 
2.2%
1262
 
2.2%
1240
 
2.1%
1220
 
2.1%
1149
 
2.0%
Other values (442) 39368
67.8%
Latin
ValueCountFrequency (%)
S 133
21.0%
L 90
14.2%
G 81
12.8%
K 80
12.6%
P 45
 
7.1%
C 41
 
6.5%
T 31
 
4.9%
J 19
 
3.0%
I 18
 
2.8%
N 18
 
2.8%
Other values (17) 78
12.3%
Common
ValueCountFrequency (%)
3826
47.9%
) 1986
24.9%
( 1985
24.8%
, 92
 
1.2%
- 54
 
0.7%
& 13
 
0.2%
2 13
 
0.2%
5 6
 
0.1%
1 4
 
0.1%
3 3
 
< 0.1%
Other values (4) 7
 
0.1%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 58036
87.1%
ASCII 8623
 
12.9%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3826
44.4%
) 1986
23.0%
( 1985
23.0%
S 133
 
1.5%
, 92
 
1.1%
L 90
 
1.0%
G 81
 
0.9%
K 80
 
0.9%
- 54
 
0.6%
P 45
 
0.5%
Other values (31) 251
 
2.9%
Hangul
ValueCountFrequency (%)
3578
 
6.2%
3095
 
5.3%
2795
 
4.8%
1664
 
2.9%
1402
 
2.4%
1263
 
2.2%
1262
 
2.2%
1240
 
2.1%
1220
 
2.1%
1149
 
2.0%
Other values (442) 39368
67.8%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct5274
Distinct (%)53.3%
Missing98
Missing (%)1.0%
Memory size156.2 KiB
2023-12-13T00:46:37.099421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters118824
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3894 ?
Unique (%)39.3%

Sample

1st row407-83-02560
2nd row215-85-13462
3rd row226-85-22656
4th row110-82-13924
5th row613-83-01415
ValueCountFrequency (%)
206-86-50913 198
 
2.0%
220-81-60348 124
 
1.3%
125-83-01960 81
 
0.8%
104-81-86269 65
 
0.7%
120-82-00052 56
 
0.6%
418-83-00034 54
 
0.5%
505-83-00022 52
 
0.5%
512-83-00068 52
 
0.5%
314-81-11803 49
 
0.5%
417-83-00216 46
 
0.5%
Other values (5264) 9125
92.2%
2023-12-13T00:46:37.721567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 19804
16.7%
0 19318
16.3%
1 14792
12.4%
8 14003
11.8%
3 12083
10.2%
2 11164
9.4%
6 6590
 
5.5%
5 6477
 
5.5%
4 6414
 
5.4%
7 4145
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 99020
83.3%
Dash Punctuation 19804
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 19318
19.5%
1 14792
14.9%
8 14003
14.1%
3 12083
12.2%
2 11164
11.3%
6 6590
 
6.7%
5 6477
 
6.5%
4 6414
 
6.5%
7 4145
 
4.2%
9 4034
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 19804
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 118824
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 19804
16.7%
0 19318
16.3%
1 14792
12.4%
8 14003
11.8%
3 12083
10.2%
2 11164
9.4%
6 6590
 
5.5%
5 6477
 
5.5%
4 6414
 
5.4%
7 4145
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 118824
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 19804
16.7%
0 19318
16.3%
1 14792
12.4%
8 14003
11.8%
3 12083
10.2%
2 11164
9.4%
6 6590
 
5.5%
5 6477
 
5.5%
4 6414
 
5.4%
7 4145
 
3.5%
Distinct2295
Distinct (%)23.0%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T00:46:38.080934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length3
Mean length3.6694669
Min length2

Characters and Unicode

Total characters36691
Distinct characters393
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1358 ?
Unique (%)13.6%

Sample

1st row임실군수
2nd row강희태
3rd row영동화력발전처장
4th row김미경
5th row조규일
ValueCountFrequency (%)
강희석 195
 
1.8%
임일순 173
 
1.6%
강희태 97
 
0.9%
권영세 89
 
0.8%
평택시장 83
 
0.8%
대표이사 71
 
0.7%
현성철 69
 
0.6%
이정환 65
 
0.6%
김진숙 62
 
0.6%
이환주 59
 
0.6%
Other values (2394) 9658
90.9%
2023-12-13T00:46:38.612011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1571
 
4.3%
1561
 
4.3%
1395
 
3.8%
1115
 
3.0%
1007
 
2.7%
790
 
2.2%
734
 
2.0%
724
 
2.0%
654
 
1.8%
644
 
1.8%
Other values (383) 26496
72.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35372
96.4%
Space Separator 641
 
1.7%
Other Punctuation 333
 
0.9%
Uppercase Letter 156
 
0.4%
Close Punctuation 70
 
0.2%
Open Punctuation 70
 
0.2%
Decimal Number 46
 
0.1%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1571
 
4.4%
1561
 
4.4%
1395
 
3.9%
1115
 
3.2%
1007
 
2.8%
790
 
2.2%
734
 
2.1%
724
 
2.0%
654
 
1.8%
644
 
1.8%
Other values (354) 25177
71.2%
Uppercase Letter
ValueCountFrequency (%)
N 17
10.9%
G 16
 
10.3%
L 14
 
9.0%
I 13
 
8.3%
U 11
 
7.1%
K 10
 
6.4%
Y 9
 
5.8%
E 9
 
5.8%
A 9
 
5.8%
O 8
 
5.1%
Other values (9) 40
25.6%
Other Punctuation
ValueCountFrequency (%)
, 331
99.4%
/ 1
 
0.3%
. 1
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 16
34.8%
2 16
34.8%
3 14
30.4%
Space Separator
ValueCountFrequency (%)
641
100.0%
Close Punctuation
ValueCountFrequency (%)
) 70
100.0%
Open Punctuation
ValueCountFrequency (%)
( 70
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35361
96.4%
Common 1163
 
3.2%
Latin 156
 
0.4%
Han 11
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1571
 
4.4%
1561
 
4.4%
1395
 
3.9%
1115
 
3.2%
1007
 
2.8%
790
 
2.2%
734
 
2.1%
724
 
2.0%
654
 
1.8%
644
 
1.8%
Other values (353) 25166
71.2%
Latin
ValueCountFrequency (%)
N 17
10.9%
G 16
 
10.3%
L 14
 
9.0%
I 13
 
8.3%
U 11
 
7.1%
K 10
 
6.4%
Y 9
 
5.8%
E 9
 
5.8%
A 9
 
5.8%
O 8
 
5.1%
Other values (9) 40
25.6%
Common
ValueCountFrequency (%)
641
55.1%
, 331
28.5%
) 70
 
6.0%
( 70
 
6.0%
1 16
 
1.4%
2 16
 
1.4%
3 14
 
1.2%
- 3
 
0.3%
/ 1
 
0.1%
. 1
 
0.1%
Han
ValueCountFrequency (%)
11
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35361
96.4%
ASCII 1319
 
3.6%
CJK 11
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1571
 
4.4%
1561
 
4.4%
1395
 
3.9%
1115
 
3.2%
1007
 
2.8%
790
 
2.2%
734
 
2.1%
724
 
2.0%
654
 
1.8%
644
 
1.8%
Other values (353) 25166
71.2%
ASCII
ValueCountFrequency (%)
641
48.6%
, 331
25.1%
) 70
 
5.3%
( 70
 
5.3%
N 17
 
1.3%
G 16
 
1.2%
1 16
 
1.2%
2 16
 
1.2%
3 14
 
1.1%
L 14
 
1.1%
Other values (19) 114
 
8.6%
CJK
ValueCountFrequency (%)
11
100.0%

대표전화번호
Text

MISSING 

Distinct6214
Distinct (%)74.1%
Missing1612
Missing (%)16.1%
Memory size156.2 KiB
2023-12-13T00:46:38.957240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.959704
Min length8

Characters and Unicode

Total characters100318
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5271 ?
Unique (%)62.8%

Sample

1st row033-640-3329
2nd row054-534-3501
3rd row054-977-8773
4th row054-280-6354
5th row033-258-9333
ValueCountFrequency (%)
02-3459-1668 163
 
1.9%
02-380-9209 88
 
1.0%
031-8024-3732 54
 
0.6%
02-2145-8101 50
 
0.6%
054-779-6364 38
 
0.5%
02-380-0680 34
 
0.4%
02-879-6254 31
 
0.4%
02-6100-3689 30
 
0.4%
031-940-4453 30
 
0.4%
051-607-4391 27
 
0.3%
Other values (6204) 7843
93.5%
2023-12-13T00:46:39.456533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 17977
17.9%
- 16766
16.7%
2 10110
10.1%
3 9968
9.9%
5 8624
8.6%
1 8254
8.2%
4 7323
7.3%
6 7081
 
7.1%
8 5014
 
5.0%
7 4905
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 83552
83.3%
Dash Punctuation 16766
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 17977
21.5%
2 10110
12.1%
3 9968
11.9%
5 8624
10.3%
1 8254
9.9%
4 7323
8.8%
6 7081
 
8.5%
8 5014
 
6.0%
7 4905
 
5.9%
9 4296
 
5.1%
Dash Punctuation
ValueCountFrequency (%)
- 16766
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 100318
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 17977
17.9%
- 16766
16.7%
2 10110
10.1%
3 9968
9.9%
5 8624
8.6%
1 8254
8.2%
4 7323
7.3%
6 7081
 
7.1%
8 5014
 
5.0%
7 4905
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100318
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 17977
17.9%
- 16766
16.7%
2 10110
10.1%
3 9968
9.9%
5 8624
8.6%
1 8254
8.2%
4 7323
7.3%
6 7081
 
7.1%
8 5014
 
5.0%
7 4905
 
4.9%

주소
Text

Distinct7437
Distinct (%)74.4%
Missing5
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T00:46:40.004929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length36
Mean length19.153677
Min length6

Characters and Unicode

Total characters191441
Distinct characters551
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5996 ?
Unique (%)60.0%

Sample

1st row전라북도 임실군 청웅면 청웅로 153
2nd row서울특별시 송파구 올림픽로 269
3rd row강원 강릉시 강동면 안인리
4th row서울시 은평구 가좌로7길 15(응암동 730-3)
5th row경상남도 진주시 사봉면 사군로 105
ValueCountFrequency (%)
서울특별시 1575
 
3.5%
경기도 1427
 
3.2%
경상북도 846
 
1.9%
경상남도 703
 
1.6%
전라북도 613
 
1.4%
전라남도 601
 
1.3%
부산광역시 514
 
1.1%
충청남도 491
 
1.1%
강원도 479
 
1.1%
충청북도 457
 
1.0%
Other values (8885) 37221
82.8%
2023-12-13T00:46:40.659818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34964
 
18.3%
8711
 
4.6%
8171
 
4.3%
1 6310
 
3.3%
6165
 
3.2%
5695
 
3.0%
2 4176
 
2.2%
3 3433
 
1.8%
3403
 
1.8%
3262
 
1.7%
Other values (541) 107151
56.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 124127
64.8%
Space Separator 34964
 
18.3%
Decimal Number 30751
 
16.1%
Dash Punctuation 1253
 
0.7%
Open Punctuation 143
 
0.1%
Close Punctuation 143
 
0.1%
Uppercase Letter 36
 
< 0.1%
Other Punctuation 22
 
< 0.1%
Letter Number 1
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8711
 
7.0%
8171
 
6.6%
6165
 
5.0%
5695
 
4.6%
3403
 
2.7%
3262
 
2.6%
3131
 
2.5%
3107
 
2.5%
3084
 
2.5%
2845
 
2.3%
Other values (507) 76553
61.7%
Uppercase Letter
ValueCountFrequency (%)
L 6
16.7%
C 5
13.9%
G 5
13.9%
A 4
11.1%
E 3
8.3%
P 3
8.3%
K 2
 
5.6%
D 2
 
5.6%
T 1
 
2.8%
Y 1
 
2.8%
Other values (4) 4
11.1%
Decimal Number
ValueCountFrequency (%)
1 6310
20.5%
2 4176
13.6%
3 3433
11.2%
5 2935
9.5%
4 2512
 
8.2%
7 2501
 
8.1%
0 2488
 
8.1%
6 2413
 
7.8%
8 2062
 
6.7%
9 1921
 
6.2%
Other Punctuation
ValueCountFrequency (%)
· 9
40.9%
, 7
31.8%
. 5
22.7%
' 1
 
4.5%
Space Separator
ValueCountFrequency (%)
34964
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1253
100.0%
Open Punctuation
ValueCountFrequency (%)
( 143
100.0%
Close Punctuation
ValueCountFrequency (%)
) 143
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Lowercase Letter
ValueCountFrequency (%)
l 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 124127
64.8%
Common 67276
35.1%
Latin 38
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8711
 
7.0%
8171
 
6.6%
6165
 
5.0%
5695
 
4.6%
3403
 
2.7%
3262
 
2.6%
3131
 
2.5%
3107
 
2.5%
3084
 
2.5%
2845
 
2.3%
Other values (507) 76553
61.7%
Common
ValueCountFrequency (%)
34964
52.0%
1 6310
 
9.4%
2 4176
 
6.2%
3 3433
 
5.1%
5 2935
 
4.4%
4 2512
 
3.7%
7 2501
 
3.7%
0 2488
 
3.7%
6 2413
 
3.6%
8 2062
 
3.1%
Other values (8) 3482
 
5.2%
Latin
ValueCountFrequency (%)
L 6
15.8%
C 5
13.2%
G 5
13.2%
A 4
10.5%
E 3
7.9%
P 3
7.9%
K 2
 
5.3%
D 2
 
5.3%
T 1
 
2.6%
Y 1
 
2.6%
Other values (6) 6
15.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 124127
64.8%
ASCII 67304
35.2%
None 9
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
34964
51.9%
1 6310
 
9.4%
2 4176
 
6.2%
3 3433
 
5.1%
5 2935
 
4.4%
4 2512
 
3.7%
7 2501
 
3.7%
0 2488
 
3.7%
6 2413
 
3.6%
8 2062
 
3.1%
Other values (22) 3510
 
5.2%
Hangul
ValueCountFrequency (%)
8711
 
7.0%
8171
 
6.6%
6165
 
5.0%
5695
 
4.6%
3403
 
2.7%
3262
 
2.6%
3131
 
2.5%
3107
 
2.5%
3084
 
2.5%
2845
 
2.3%
Other values (507) 76553
61.7%
None
ValueCountFrequency (%)
· 9
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

상세주소
Text

MISSING 

Distinct6287
Distinct (%)75.0%
Missing1616
Missing (%)16.2%
Memory size156.2 KiB
2023-12-13T00:46:40.997637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length36
Mean length8.7062261
Min length1

Characters and Unicode

Total characters72993
Distinct characters661
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5320 ?
Unique (%)63.5%

Sample

1st row롯데캐슬골드 4층 롯데시네마
2nd row200 영동화력발전처
3rd row대한적십자사 상주적십자병원 총무팀
4th row국립칠곡숲체원
5th row243
ValueCountFrequency (%)
이마트 279
 
2.1%
본사 158
 
1.2%
홈플러스 139
 
1.1%
주민센터 125
 
1.0%
성수점 88
 
0.7%
88
 
0.7%
2층 86
 
0.7%
3층 79
 
0.6%
4층 66
 
0.5%
64
 
0.5%
Other values (7050) 11808
91.0%
2023-12-13T00:46:41.504268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4606
 
6.3%
1873
 
2.6%
1851
 
2.5%
1770
 
2.4%
1 1313
 
1.8%
1242
 
1.7%
1233
 
1.7%
) 1158
 
1.6%
( 1153
 
1.6%
1143
 
1.6%
Other values (651) 55651
76.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 58473
80.1%
Decimal Number 5797
 
7.9%
Space Separator 4606
 
6.3%
Close Punctuation 1159
 
1.6%
Open Punctuation 1155
 
1.6%
Uppercase Letter 990
 
1.4%
Dash Punctuation 373
 
0.5%
Other Punctuation 266
 
0.4%
Lowercase Letter 149
 
0.2%
Math Symbol 19
 
< 0.1%
Other values (2) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1873
 
3.2%
1851
 
3.2%
1770
 
3.0%
1242
 
2.1%
1233
 
2.1%
1143
 
2.0%
1135
 
1.9%
1120
 
1.9%
1067
 
1.8%
1056
 
1.8%
Other values (576) 44983
76.9%
Uppercase Letter
ValueCountFrequency (%)
S 143
14.4%
L 102
10.3%
K 100
10.1%
C 95
9.6%
G 86
 
8.7%
T 72
 
7.3%
I 51
 
5.2%
D 49
 
4.9%
E 35
 
3.5%
B 33
 
3.3%
Other values (15) 224
22.6%
Lowercase Letter
ValueCountFrequency (%)
a 19
12.8%
t 18
12.1%
n 16
10.7%
k 15
10.1%
e 10
 
6.7%
c 8
 
5.4%
s 8
 
5.4%
o 7
 
4.7%
i 6
 
4.0%
r 6
 
4.0%
Other values (14) 36
24.2%
Decimal Number
ValueCountFrequency (%)
1 1313
22.6%
2 907
15.6%
3 674
11.6%
4 516
 
8.9%
5 492
 
8.5%
0 475
 
8.2%
6 455
 
7.8%
7 380
 
6.6%
8 303
 
5.2%
9 282
 
4.9%
Other Punctuation
ValueCountFrequency (%)
, 154
57.9%
. 68
25.6%
· 19
 
7.1%
& 15
 
5.6%
/ 9
 
3.4%
1
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 1158
99.9%
] 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1153
99.8%
[ 2
 
0.2%
Math Symbol
ValueCountFrequency (%)
~ 18
94.7%
1
 
5.3%
Space Separator
ValueCountFrequency (%)
4606
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 373
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 58467
80.1%
Common 13380
 
18.3%
Latin 1139
 
1.6%
Han 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1873
 
3.2%
1851
 
3.2%
1770
 
3.0%
1242
 
2.1%
1233
 
2.1%
1143
 
2.0%
1135
 
1.9%
1120
 
1.9%
1067
 
1.8%
1056
 
1.8%
Other values (575) 44977
76.9%
Latin
ValueCountFrequency (%)
S 143
 
12.6%
L 102
 
9.0%
K 100
 
8.8%
C 95
 
8.3%
G 86
 
7.6%
T 72
 
6.3%
I 51
 
4.5%
D 49
 
4.3%
E 35
 
3.1%
B 33
 
2.9%
Other values (39) 373
32.7%
Common
ValueCountFrequency (%)
4606
34.4%
1 1313
 
9.8%
) 1158
 
8.7%
( 1153
 
8.6%
2 907
 
6.8%
3 674
 
5.0%
4 516
 
3.9%
5 492
 
3.7%
0 475
 
3.6%
6 455
 
3.4%
Other values (15) 1631
 
12.2%
Han
ValueCountFrequency (%)
5
71.4%
2
 
28.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 58461
80.1%
ASCII 14498
 
19.9%
None 21
 
< 0.1%
CJK 7
 
< 0.1%
Compat Jamo 5
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4606
31.8%
1 1313
 
9.1%
) 1158
 
8.0%
( 1153
 
8.0%
2 907
 
6.3%
3 674
 
4.6%
4 516
 
3.6%
5 492
 
3.4%
0 475
 
3.3%
6 455
 
3.1%
Other values (61) 2749
19.0%
Hangul
ValueCountFrequency (%)
1873
 
3.2%
1851
 
3.2%
1770
 
3.0%
1242
 
2.1%
1233
 
2.1%
1143
 
2.0%
1135
 
1.9%
1120
 
1.9%
1067
 
1.8%
1056
 
1.8%
Other values (571) 44971
76.9%
None
ValueCountFrequency (%)
· 19
90.5%
1
 
4.8%
1
 
4.8%
CJK
ValueCountFrequency (%)
5
71.4%
2
 
28.6%
Compat Jamo
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

업종
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공공행정, 국방 및 사회보장 행정
4930 
제조업
1486 
도매 및 소매업
566 
보건업 및 사회복지 서비스업
 
342
전기, 가스, 증기 및 수도사업
 
335
Other values (15)
2341 

Length

Max length24
Median length18
Mean length13.9956
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공행정, 국방 및 사회보장 행정
2nd row도매 및 소매업
3rd row전기, 가스, 증기 및 수도사업
4th row예술, 스포츠 및 여가관련 서비스업
5th row공공행정, 국방 및 사회보장 행정

Common Values

ValueCountFrequency (%)
공공행정, 국방 및 사회보장 행정 4930
49.3%
제조업 1486
 
14.9%
도매 및 소매업 566
 
5.7%
보건업 및 사회복지 서비스업 342
 
3.4%
전기, 가스, 증기 및 수도사업 335
 
3.4%
사업시설관리 및 사업지원 서비스업 322
 
3.2%
교육 서비스업 307
 
3.1%
금융 및 보험업 280
 
2.8%
예술, 스포츠 및 여가관련 서비스업 259
 
2.6%
전문, 과학 및 기술 서비스업 248
 
2.5%
Other values (10) 925
 
9.2%

Length

2023-12-13T00:46:41.693480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
7980
20.3%
공공행정 4930
12.6%
행정 4930
12.6%
국방 4930
12.6%
사회보장 4930
12.6%
서비스업 1489
 
3.8%
제조업 1486
 
3.8%
도매 566
 
1.4%
소매업 566
 
1.4%
보건업 342
 
0.9%
Other values (40) 7127
18.1%
Distinct72
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:46:42.018328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length18
Mean length14.7266
Min length2

Characters and Unicode

Total characters147266
Distinct characters164
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row공공행정, 국방 및 사회보장 행정
2nd row소매업; 자동차 제외
3rd row전기, 가스, 증기 및 공기조절 공급업
4th row창작, 예술 및 여가관련 서비스업
5th row공공행정, 국방 및 사회보장 행정
ValueCountFrequency (%)
6938
17.1%
공공행정 4930
12.1%
행정 4930
12.1%
국방 4930
12.1%
사회보장 4930
12.1%
제조업 1201
 
3.0%
서비스업 1117
 
2.7%
제외 858
 
2.1%
자동차 614
 
1.5%
소매업 526
 
1.3%
Other values (142) 9709
23.9%
2023-12-13T00:46:42.441062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30683
20.8%
10385
 
7.1%
9914
 
6.7%
9860
 
6.7%
6938
 
4.7%
, 6335
 
4.3%
5462
 
3.7%
5359
 
3.6%
5246
 
3.6%
5174
 
3.5%
Other values (154) 51910
35.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 109175
74.1%
Space Separator 30683
 
20.8%
Other Punctuation 7255
 
4.9%
Decimal Number 153
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10385
 
9.5%
9914
 
9.1%
9860
 
9.0%
6938
 
6.4%
5462
 
5.0%
5359
 
4.9%
5246
 
4.8%
5174
 
4.7%
5015
 
4.6%
4976
 
4.6%
Other values (149) 40846
37.4%
Other Punctuation
ValueCountFrequency (%)
, 6335
87.3%
; 918
 
12.7%
· 2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
30683
100.0%
Decimal Number
ValueCountFrequency (%)
1 153
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 109175
74.1%
Common 38091
 
25.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10385
 
9.5%
9914
 
9.1%
9860
 
9.0%
6938
 
6.4%
5462
 
5.0%
5359
 
4.9%
5246
 
4.8%
5174
 
4.7%
5015
 
4.6%
4976
 
4.6%
Other values (149) 40846
37.4%
Common
ValueCountFrequency (%)
30683
80.6%
, 6335
 
16.6%
; 918
 
2.4%
1 153
 
0.4%
· 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 109175
74.1%
ASCII 38089
 
25.9%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30683
80.6%
, 6335
 
16.6%
; 918
 
2.4%
1 153
 
0.4%
Hangul
ValueCountFrequency (%)
10385
 
9.5%
9914
 
9.1%
9860
 
9.0%
6938
 
6.4%
5462
 
5.0%
5359
 
4.9%
5246
 
4.8%
5174
 
4.7%
5015
 
4.6%
4976
 
4.6%
Other values (149) 40846
37.4%
None
ValueCountFrequency (%)
· 2
100.0%

주요업무
Text

MISSING 

Distinct2960
Distinct (%)38.9%
Missing2389
Missing (%)23.9%
Memory size156.2 KiB
2023-12-13T00:46:42.715357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length46
Mean length7.5464459
Min length1

Characters and Unicode

Total characters57436
Distinct characters617
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2224 ?
Unique (%)29.2%

Sample

1st row-
2nd row의료
3rd row산림복지서비스
4th row컬러강판 제조업
5th row의료서비스업
ValueCountFrequency (%)
808
 
5.6%
636
 
4.4%
공공행정 603
 
4.2%
459
 
3.2%
운영 339
 
2.4%
제조 313
 
2.2%
판매 234
 
1.6%
관리 215
 
1.5%
업무 182
 
1.3%
평생교육 177
 
1.2%
Other values (3493) 10437
72.5%
2023-12-13T00:46:43.257583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6801
 
11.8%
2339
 
4.1%
, 1557
 
2.7%
1482
 
2.6%
1457
 
2.5%
1299
 
2.3%
1293
 
2.3%
1108
 
1.9%
928
 
1.6%
903
 
1.6%
Other values (607) 38269
66.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47133
82.1%
Space Separator 6801
 
11.8%
Other Punctuation 1803
 
3.1%
Dash Punctuation 733
 
1.3%
Uppercase Letter 416
 
0.7%
Lowercase Letter 174
 
0.3%
Open Punctuation 131
 
0.2%
Close Punctuation 130
 
0.2%
Decimal Number 109
 
0.2%
Connector Punctuation 3
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2339
 
5.0%
1482
 
3.1%
1457
 
3.1%
1299
 
2.8%
1293
 
2.7%
1108
 
2.4%
928
 
2.0%
903
 
1.9%
859
 
1.8%
789
 
1.7%
Other values (537) 34676
73.6%
Lowercase Letter
ValueCountFrequency (%)
e 21
12.1%
l 16
 
9.2%
i 14
 
8.0%
o 14
 
8.0%
d 12
 
6.9%
a 12
 
6.9%
r 12
 
6.9%
t 11
 
6.3%
s 10
 
5.7%
c 8
 
4.6%
Other values (13) 44
25.3%
Uppercase Letter
ValueCountFrequency (%)
C 76
18.3%
P 49
11.8%
D 40
9.6%
L 31
 
7.5%
A 23
 
5.5%
S 23
 
5.5%
E 22
 
5.3%
H 20
 
4.8%
R 19
 
4.6%
B 18
 
4.3%
Other values (12) 95
22.8%
Decimal Number
ValueCountFrequency (%)
1 28
25.7%
8 25
22.9%
2 17
15.6%
4 9
 
8.3%
0 9
 
8.3%
6 7
 
6.4%
3 5
 
4.6%
7 3
 
2.8%
5 3
 
2.8%
9 3
 
2.8%
Other Punctuation
ValueCountFrequency (%)
, 1557
86.4%
. 125
 
6.9%
/ 74
 
4.1%
* 17
 
0.9%
· 16
 
0.9%
& 9
 
0.5%
: 4
 
0.2%
' 1
 
0.1%
Space Separator
ValueCountFrequency (%)
6801
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 733
100.0%
Open Punctuation
ValueCountFrequency (%)
( 131
100.0%
Close Punctuation
ValueCountFrequency (%)
) 130
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Math Symbol
ValueCountFrequency (%)
= 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47133
82.1%
Common 9713
 
16.9%
Latin 590
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2339
 
5.0%
1482
 
3.1%
1457
 
3.1%
1299
 
2.8%
1293
 
2.7%
1108
 
2.4%
928
 
2.0%
903
 
1.9%
859
 
1.8%
789
 
1.7%
Other values (537) 34676
73.6%
Latin
ValueCountFrequency (%)
C 76
 
12.9%
P 49
 
8.3%
D 40
 
6.8%
L 31
 
5.3%
A 23
 
3.9%
S 23
 
3.9%
E 22
 
3.7%
e 21
 
3.6%
H 20
 
3.4%
R 19
 
3.2%
Other values (35) 266
45.1%
Common
ValueCountFrequency (%)
6801
70.0%
, 1557
 
16.0%
- 733
 
7.5%
( 131
 
1.3%
) 130
 
1.3%
. 125
 
1.3%
/ 74
 
0.8%
1 28
 
0.3%
8 25
 
0.3%
* 17
 
0.2%
Other values (15) 92
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47122
82.0%
ASCII 10286
 
17.9%
None 16
 
< 0.1%
Compat Jamo 11
 
< 0.1%
CJK Compat 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6801
66.1%
, 1557
 
15.1%
- 733
 
7.1%
( 131
 
1.3%
) 130
 
1.3%
. 125
 
1.2%
C 76
 
0.7%
/ 74
 
0.7%
P 49
 
0.5%
D 40
 
0.4%
Other values (58) 570
 
5.5%
Hangul
ValueCountFrequency (%)
2339
 
5.0%
1482
 
3.1%
1457
 
3.1%
1299
 
2.8%
1293
 
2.7%
1108
 
2.4%
928
 
2.0%
903
 
1.9%
859
 
1.8%
789
 
1.7%
Other values (533) 34665
73.6%
None
ValueCountFrequency (%)
· 16
100.0%
Compat Jamo
ValueCountFrequency (%)
8
72.7%
1
 
9.1%
1
 
9.1%
1
 
9.1%
CJK Compat
ValueCountFrequency (%)
1
100.0%

홈페이지
Text

MISSING 

Distinct1548
Distinct (%)52.3%
Missing7039
Missing (%)70.4%
Memory size156.2 KiB
2023-12-13T00:46:43.660800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length98
Median length73
Mean length22.390746
Min length1

Characters and Unicode

Total characters66299
Distinct characters198
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1128 ?
Unique (%)38.1%

Sample

1st rowwww.rch.or.kr
2nd rowwww.poscocnc.com
3rd rowwww.knuh.or.kr
4th rowhttp://seobu-market.gwangju.go.kr
5th rowhttp://www.dgs.go.kr
ValueCountFrequency (%)
http://corporate.homeplus.co.kr 173
 
5.7%
www.hf.go.kr 52
 
1.7%
www.snu.ac.kr 33
 
1.1%
http://www.bsnamgu.go.kr 26
 
0.9%
http://seogu.gwangju.kr 26
 
0.9%
http://yeosu.go.kr 23
 
0.8%
http://www.wonju.go.kr 23
 
0.8%
www.kwater.or.kr 20
 
0.7%
18
 
0.6%
http://www.goryeong.go.kr 17
 
0.6%
Other values (1548) 2601
86.4%
2023-12-13T00:46:44.296457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 8168
 
12.3%
w 7079
 
10.7%
o 4887
 
7.4%
/ 4536
 
6.8%
t 4355
 
6.6%
r 4083
 
6.2%
k 3201
 
4.8%
h 2697
 
4.1%
p 2647
 
4.0%
c 2594
 
3.9%
Other values (188) 22052
33.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 50478
76.1%
Other Punctuation 14417
 
21.7%
Decimal Number 608
 
0.9%
Other Letter 304
 
0.5%
Uppercase Letter 156
 
0.2%
Dash Punctuation 127
 
0.2%
Connector Punctuation 94
 
0.1%
Math Symbol 62
 
0.1%
Space Separator 51
 
0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
4.6%
11
 
3.6%
8
 
2.6%
7
 
2.3%
7
 
2.3%
6
 
2.0%
6
 
2.0%
6
 
2.0%
6
 
2.0%
6
 
2.0%
Other values (115) 227
74.7%
Lowercase Letter
ValueCountFrequency (%)
w 7079
14.0%
o 4887
 
9.7%
t 4355
 
8.6%
r 4083
 
8.1%
k 3201
 
6.3%
h 2697
 
5.3%
p 2647
 
5.2%
c 2594
 
5.1%
n 2491
 
4.9%
g 2368
 
4.7%
Other values (16) 14076
27.9%
Uppercase Letter
ValueCountFrequency (%)
A 20
12.8%
I 17
10.9%
W 15
9.6%
K 12
 
7.7%
C 11
 
7.1%
S 11
 
7.1%
M 10
 
6.4%
R 9
 
5.8%
D 7
 
4.5%
H 7
 
4.5%
Other values (11) 37
23.7%
Other Punctuation
ValueCountFrequency (%)
. 8168
56.7%
/ 4536
31.5%
: 1632
 
11.3%
? 46
 
0.3%
& 16
 
0.1%
@ 8
 
0.1%
, 8
 
0.1%
' 1
 
< 0.1%
; 1
 
< 0.1%
# 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 204
33.6%
1 188
30.9%
2 75
 
12.3%
3 32
 
5.3%
9 30
 
4.9%
5 27
 
4.4%
4 21
 
3.5%
6 14
 
2.3%
7 9
 
1.5%
8 8
 
1.3%
Dash Punctuation
ValueCountFrequency (%)
- 127
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 94
100.0%
Math Symbol
ValueCountFrequency (%)
= 62
100.0%
Space Separator
ValueCountFrequency (%)
51
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 50634
76.4%
Common 15361
 
23.2%
Hangul 304
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
4.6%
11
 
3.6%
8
 
2.6%
7
 
2.3%
7
 
2.3%
6
 
2.0%
6
 
2.0%
6
 
2.0%
6
 
2.0%
6
 
2.0%
Other values (115) 227
74.7%
Latin
ValueCountFrequency (%)
w 7079
14.0%
o 4887
 
9.7%
t 4355
 
8.6%
r 4083
 
8.1%
k 3201
 
6.3%
h 2697
 
5.3%
p 2647
 
5.2%
c 2594
 
5.1%
n 2491
 
4.9%
g 2368
 
4.7%
Other values (37) 14232
28.1%
Common
ValueCountFrequency (%)
. 8168
53.2%
/ 4536
29.5%
: 1632
 
10.6%
0 204
 
1.3%
1 188
 
1.2%
- 127
 
0.8%
_ 94
 
0.6%
2 75
 
0.5%
= 62
 
0.4%
51
 
0.3%
Other values (16) 224
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 65995
99.5%
Hangul 303
 
0.5%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 8168
 
12.4%
w 7079
 
10.7%
o 4887
 
7.4%
/ 4536
 
6.9%
t 4355
 
6.6%
r 4083
 
6.2%
k 3201
 
4.9%
h 2697
 
4.1%
p 2647
 
4.0%
c 2594
 
3.9%
Other values (63) 21748
33.0%
Hangul
ValueCountFrequency (%)
14
 
4.6%
11
 
3.6%
8
 
2.6%
7
 
2.3%
7
 
2.3%
6
 
2.0%
6
 
2.0%
6
 
2.0%
6
 
2.0%
6
 
2.0%
Other values (114) 226
74.6%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

2016
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
대상
6935 
대상아님
3065 

Length

Max length4
Median length2
Mean length2.613
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대상
2nd row대상아님
3rd row대상아님
4th row대상
5th row대상

Common Values

ValueCountFrequency (%)
대상 6935
69.3%
대상아님 3065
30.6%

Length

2023-12-13T00:46:44.531500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:46:44.657587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대상 6935
69.3%
대상아님 3065
30.6%

2017
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
대상
6999 
대상아님
3001 

Length

Max length4
Median length2
Mean length2.6002
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대상
2nd row대상아님
3rd row대상아님
4th row대상
5th row대상

Common Values

ValueCountFrequency (%)
대상 6999
70.0%
대상아님 3001
30.0%

Length

2023-12-13T00:46:44.795655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:46:44.960848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대상 6999
70.0%
대상아님 3001
30.0%

2018
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
대상
6957 
대상아님
3043 

Length

Max length4
Median length2
Mean length2.6086
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대상
2nd row대상아님
3rd row대상아님
4th row대상
5th row대상

Common Values

ValueCountFrequency (%)
대상 6957
69.6%
대상아님 3043
30.4%

Length

2023-12-13T00:46:45.119030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:46:45.277285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대상 6957
69.6%
대상아님 3043
30.4%

2019
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
대상제외
4475 
대상
2738 
<NA>
2734 
대상(제외신청 거절)
 
52
대상제외신청중
 
1

Length

Max length11
Median length4
Mean length3.4891
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row대상제외
2nd row<NA>
3rd row<NA>
4th row대상제외
5th row대상제외

Common Values

ValueCountFrequency (%)
대상제외 4475
44.8%
대상 2738
27.4%
<NA> 2734
27.3%
대상(제외신청 거절) 52
 
0.5%
대상제외신청중 1
 
< 0.1%

Length

2023-12-13T00:46:45.400112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:46:45.539786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대상제외 4475
44.5%
대상 2738
27.2%
na 2734
27.2%
대상(제외신청 52
 
0.5%
거절 52
 
0.5%
대상제외신청중 1
 
< 0.1%

Interactions

2023-12-13T00:46:31.425545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:46:45.656459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업체분야업체특성업체구분업종세부업종2016201720182019
번호1.0000.3970.4980.4440.5620.5840.6750.6600.6200.363
업체분야0.3971.0000.7730.3230.9080.9530.2220.1880.1970.476
업체특성0.4980.7731.0000.2630.7910.8460.3740.3640.3290.593
업체구분0.4440.3230.2631.0000.2950.3350.3890.3890.3520.882
업종0.5620.9080.7910.2951.0001.0000.3000.2900.2860.673
세부업종0.5840.9530.8460.3351.0001.0000.3370.3130.3140.705
20160.6750.2220.3740.3890.3000.3371.0000.9880.9570.321
20170.6600.1880.3640.3890.2900.3130.9881.0000.9850.322
20180.6200.1970.3290.3520.2860.3140.9570.9851.0000.337
20190.3630.4760.5930.8820.6730.7050.3210.3220.3371.000
2023-12-13T00:46:45.823598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체구분업체분야2017업체특성201620182019업종
업체구분1.0000.2320.2540.2510.2540.2290.6890.233
업체분야0.2321.0000.1350.5390.1600.1420.3250.716
20170.2540.1351.0000.3490.9010.8910.2150.229
업체특성0.2510.5390.3491.0000.3580.3150.3980.433
20160.2540.1600.9010.3581.0000.8130.2140.237
20180.2290.1420.8910.3150.8131.0000.2250.226
20190.6890.3250.2150.3980.2140.2251.0000.380
업종0.2330.7160.2290.4330.2370.2260.3801.000
2023-12-13T00:46:45.969758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업체분야업체특성업체구분업종2016201720182019
번호1.0000.2220.2380.3410.2110.5250.5130.4800.225
업체분야0.2221.0000.5390.2320.7160.1600.1350.1420.325
업체특성0.2380.5391.0000.2510.4330.3580.3490.3150.398
업체구분0.3410.2320.2511.0000.2330.2540.2540.2290.689
업종0.2110.7160.4330.2331.0000.2370.2290.2260.380
20160.5250.1600.3580.2540.2371.0000.9010.8130.214
20170.5130.1350.3490.2540.2290.9011.0000.8910.215
20180.4800.1420.3150.2290.2260.8130.8911.0000.225
20190.2250.3250.3980.6890.3800.2140.2150.2251.000

Missing values

2023-12-13T00:46:31.701612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:46:32.138408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T00:46:32.463088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호업체명업체분야업체특성업체구분사업장명업체 대표명사업자등록번호대표자대표전화번호주소상세주소업종세부업종주요업무홈페이지2016201720182019
1198211986임실군청공공행정지방자치단체사업장임실군 청웅면사무소전라북도 임실군407-83-02560임실군수<NA>전라북도 임실군 청웅면 청웅로 153<NA>공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정<NA><NA>대상대상대상대상제외
1038910391롯데쇼핑(주) 본점(본사)기타서비스온실가스목표관리업체사업장롯데시네마롯데쇼핑(주)215-85-13462강희태<NA>서울특별시 송파구 올림픽로 269롯데캐슬골드 4층 롯데시네마도매 및 소매업소매업; 자동차 제외<NA><NA>대상아님대상아님대상아님<NA>
1654916552한국남동발전(주) 영동화력발전처제조녹색기업대표사업장한국남동발전(주) 영동화력발전처한국남동발전(주)226-85-22656영동화력발전처장033-640-3329강원 강릉시 강동면 안인리200 영동화력발전처전기, 가스, 증기 및 수도사업전기, 가스, 증기 및 공기조절 공급업-<NA>대상아님대상아님대상아님<NA>
82778279은평구청공공행정지방자치단체사업장구립응암정보도서관서울특별시 은평구110-82-13924김미경<NA>서울시 은평구 가좌로7길 15(응암동 730-3)<NA>예술, 스포츠 및 여가관련 서비스업창작, 예술 및 여가관련 서비스업<NA><NA>대상대상대상대상제외
82908292진주시청공공행정지방자치단체사업장사봉면경상남도 진주시613-83-01415조규일<NA>경상남도 진주시 사봉면 사군로 105<NA>공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정<NA><NA>대상대상대상대상제외
1127211274대한적십자사보건공공기관사업장대한적십자사 상주적십자병원대한적십자사511-82-00113박경서054-534-3501경상북도 상주시 상서문로 53대한적십자사 상주적십자병원 총무팀보건업 및 사회복지 서비스업보건업의료www.rch.or.kr대상대상대상대상제외
18651866한국산림복지진흥원(본사)기타서비스공공기관사업장국립칠곡숲체원한국산림복지진흥원827-82-00089이창재054-977-8773경상북도 칠곡군 석적읍 성곡리 산73-13국립칠곡숲체원보건업 및 사회복지 서비스업사회복지 서비스업산림복지서비스<NA>대상아님대상대상대상제외
1412914131포스코강판(주) 도금공장제조배출권할당대상업체사업장포스코강판(주) 컬러공장포스코강판(주)506-81-05517대표이사054-280-6354경상북도 포항시 남구 대송로 243 (괴동동)243제조업1차 금속 제조업컬러강판 제조업www.poscocnc.com대상대상대상대상
1460414606강원대학교병원보건공공기관대표사업장강원대학교병원강원대학교병원221-82-08323이승준033-258-9333강원도 춘천시 백령로 156강원대학교병원보건업 및 사회복지 서비스업보건업의료서비스업www.knuh.or.kr대상대상대상대상
1757817598서부농수산물도매시장관리사무소공공행정지방공사대표사업장서부농수산물도매시장관리사무소<NA>410-83-06067변주봉062-613-5475광주광역시 서구 매월2로 16서부농수산물도매시장관리사무소농업,임업 및 어업농업농수산물 유통거래http://seobu-market.gwangju.go.kr대상아님대상아님대상아님<NA>
번호업체명업체분야업체특성업체구분사업장명업체 대표명사업자등록번호대표자대표전화번호주소상세주소업종세부업종주요업무홈페이지2016201720182019
1676316765(주)한화구미사업장제조온실가스목표관리업체대표사업장(주)한화구미사업장<NA>513-85-12203심경섭054-467-8573경상북도 구미시 산호대로 264-36<NA>제조업식료품 제조업한화구미사업장<NA>대상아님대상아님대상아님<NA>
61266127청양군청공공행정지방자치단체사업장공공시설사업소(칠갑산휴양림)충청남도 청양군청307-83-01257청양군수<NA>충청남도 청양군 대치면 칠갑산로 668-103<NA>공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정<NA><NA>대상대상대상대상제외
94319433한국보건복지인력개발원교육서비스공공기관대표사업장한국보건복지인력개발원한국보건복지인력개발원110-82-10761허선043-710-9135충청북도 청주시 흥덕구 오송읍 오송생명2로 187오송생명과학단지 한국보건복지인력개발원교육 서비스업교육 서비스업교육서비스http://www.kohi.or.kr대상대상대상대상
1351713519광주광역시청공공행정지방자치단체사업장시립도서관광주광역시409-83-02718이용섭062-613-7712광주광역시 북구 면앙로 130광주광역시시립도서관부동산업 및 임대업임대업;부동산 제외시립도서관 관리업무<NA>대상대상대상대상제외
18341835보성군청공공행정지방자치단체사업장노동면사무소전라남도 보성군413-83-00077보성군수<NA>전라남도 보성군 노동면 광곡길 28<NA>공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정<NA><NA>대상대상대상대상제외
33153316제주소방서공공행정지방자치단체대표사업장제주소방서제주소방서616-83-00055윤두진064-729-0142제주특별자치도 제주시 중앙로 342제주소방서공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정화재,구조,구급,예방, 사회안전<NA>대상아님대상아님대상아님<NA>
1040310405(주)이마트(본사+성수점)기타서비스배출권할당대상업체대표사업장(주)이마트(본사+성수점)강희석206-86-50913강희석02-380-9209서울특별시 성동구 뚝섬로 377이마트 본사 및 이마트 성수점도매 및 소매업소매업; 자동차 제외본사<NA>대상대상대상대상
56285629파주시청공공행정지방자치단체대표사업장파주시청경기도 파주시128-83-00937최종환031-940-4453경기도 파주시 시청로 50파주시청공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정공공기관http://www.paju.go.kr/main/main.tdf?a=user.index.IndexApp&c=1001대상대상대상대상
1490014902한국철도공사기타산업온실가스목표관리업체사업장한국철도공사_부산경남본부손병석314-82-10024손병석051-440-2828부산광역시 동구 중앙대로 206한국철도공사 부산경남본부운수업창고 및 운송관련 서비스업운송사업korail.com대상대상대상대상
1716817164남면사무소공공행정지방자치단체대표사업장남면사무소<NA>310-83-01379박종관041-670-5134충청남도 태안군 남면 달산포로 311<NA>공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정-<NA>대상아님대상아님대상아님<NA>