Overview

Dataset statistics

Number of variables13
Number of observations10000
Missing cells4608
Missing cells (%)3.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 MiB
Average record size in memory113.0 B

Variable types

Numeric1
Categorical4
Text8

Dataset

Description환경정보공개시스템 업체 사업장 정보(20.10.22 기준 / 사업장명, 대표명, 전화번호, 주소, 업종, 주요 업무 등)
Author한국환경산업기술원
URLhttps://www.data.go.kr/data/15071996/fileData.do

Alerts

업체유형 is highly overall correlated with 업체특성 and 2 other fieldsHigh correlation
세부업종 is highly overall correlated with 번호 and 3 other fieldsHigh correlation
업종 is highly overall correlated with 업체유형 and 1 other fieldsHigh correlation
업체특성 is highly overall correlated with 업체유형 and 1 other fieldsHigh correlation
번호 is highly overall correlated with 세부업종High correlation
상세주소 has 1409 (14.1%) missing valuesMissing
주요업무 has 3120 (31.2%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:03:31.796354
Analysis finished2023-12-12 14:03:35.269337
Duration3.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6459.7411
Minimum1
Maximum12904
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T23:03:35.356067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile628.95
Q13250.75
median6455.5
Q39694.25
95-th percentile12275.05
Maximum12904
Range12903
Interquartile range (IQR)6443.5

Descriptive statistics

Standard deviation3726.7191
Coefficient of variation (CV)0.57691463
Kurtosis-1.1992879
Mean6459.7411
Median Absolute Deviation (MAD)3225
Skewness-0.0041569003
Sum64597411
Variance13888436
MonotonicityNot monotonic
2023-12-12T23:03:35.572255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5523 1
 
< 0.1%
5343 1
 
< 0.1%
5014 1
 
< 0.1%
5508 1
 
< 0.1%
11630 1
 
< 0.1%
10268 1
 
< 0.1%
12839 1
 
< 0.1%
2330 1
 
< 0.1%
4501 1
 
< 0.1%
322 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
7 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
ValueCountFrequency (%)
12904 1
< 0.1%
12902 1
< 0.1%
12901 1
< 0.1%
12900 1
< 0.1%
12899 1
< 0.1%
12898 1
< 0.1%
12897 1
< 0.1%
12895 1
< 0.1%
12894 1
< 0.1%
12893 1
< 0.1%

업체유형
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공공행정
6086 
제조
1728 
기타서비스
1457 
기타산업
 
293
교육서비스
 
250
Other values (2)
 
186

Length

Max length5
Median length4
Mean length3.8075
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공행정
2nd row공공행정
3rd row공공행정
4th row제조
5th row기타서비스

Common Values

ValueCountFrequency (%)
공공행정 6086
60.9%
제조 1728
 
17.3%
기타서비스 1457
 
14.6%
기타산업 293
 
2.9%
교육서비스 250
 
2.5%
<NA> 98
 
1.0%
보건 88
 
0.9%

Length

2023-12-12T23:03:35.727989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:03:35.841064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공공행정 6086
60.9%
제조 1728
 
17.3%
기타서비스 1457
 
14.6%
기타산업 293
 
2.9%
교육서비스 250
 
2.5%
na 98
 
1.0%
보건 88
 
0.9%

업체특성
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
지방자치단체
4052 
배출권할당대상업체
1990 
공공기관
1324 
온실가스목표관리업체
886 
지방공단
432 
Other values (7)
1316 

Length

Max length10
Median length9
Mean length6.435
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공기관
2nd row중앙행정기관
3rd row공공기관
4th row배출권할당대상업체
5th row배출권할당대상업체

Common Values

ValueCountFrequency (%)
지방자치단체 4052
40.5%
배출권할당대상업체 1990
19.9%
공공기관 1324
 
13.2%
온실가스목표관리업체 886
 
8.9%
지방공단 432
 
4.3%
중앙행정기관 419
 
4.2%
<NA> 332
 
3.3%
녹색기업 216
 
2.2%
지방공사 205
 
2.1%
국공립대학 130
 
1.3%
Other values (2) 14
 
0.1%

Length

2023-12-12T23:03:35.983456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지방자치단체 4052
40.5%
배출권할당대상업체 1990
19.9%
공공기관 1324
 
13.2%
온실가스목표관리업체 886
 
8.9%
지방공단 432
 
4.3%
중앙행정기관 419
 
4.2%
na 332
 
3.3%
녹색기업 216
 
2.2%
지방공사 205
 
2.1%
국공립대학 130
 
1.3%
Other values (2) 14
 
0.1%
Distinct2054
Distinct (%)20.5%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T23:03:36.489285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length23
Mean length7.620262
Min length2

Characters and Unicode

Total characters76195
Distinct characters535
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1179 ?
Unique (%)11.8%

Sample

1st row한국광해관리공단
2nd row법무부
3rd row도로교통공단
4th row대한유화(주)
5th row롯데쇼핑(주) 본점(본사)
ValueCountFrequency (%)
본사 381
 
3.0%
부산광역시 142
 
1.1%
주)이마트(본사+성수점 137
 
1.1%
주식회사 121
 
1.0%
서울특별시 121
 
1.0%
본부 108
 
0.9%
롯데쇼핑(주 105
 
0.8%
본점(본사 105
 
0.8%
대구광역시 102
 
0.8%
삼성생명본사(서초타워 100
 
0.8%
Other values (2230) 11154
88.7%
2023-12-12T23:03:36.975824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4393
 
5.8%
3570
 
4.7%
3052
 
4.0%
) 2792
 
3.7%
( 2774
 
3.6%
2578
 
3.4%
2213
 
2.9%
2039
 
2.7%
1770
 
2.3%
1483
 
1.9%
Other values (525) 49531
65.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 66759
87.6%
Close Punctuation 2792
 
3.7%
Open Punctuation 2774
 
3.6%
Space Separator 2578
 
3.4%
Uppercase Letter 769
 
1.0%
Math Symbol 137
 
0.2%
Lowercase Letter 135
 
0.2%
Decimal Number 116
 
0.2%
Connector Punctuation 66
 
0.1%
Other Punctuation 48
 
0.1%
Other values (2) 21
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4393
 
6.6%
3570
 
5.3%
3052
 
4.6%
2213
 
3.3%
2039
 
3.1%
1770
 
2.7%
1483
 
2.2%
1452
 
2.2%
1407
 
2.1%
1286
 
1.9%
Other values (465) 44094
66.0%
Uppercase Letter
ValueCountFrequency (%)
S 163
21.2%
K 103
13.4%
C 99
12.9%
L 63
 
8.2%
G 44
 
5.7%
D 38
 
4.9%
J 33
 
4.3%
I 30
 
3.9%
N 29
 
3.8%
P 24
 
3.1%
Other values (13) 143
18.6%
Lowercase Letter
ValueCountFrequency (%)
k 28
20.7%
t 27
20.0%
s 13
9.6%
e 12
8.9%
a 11
 
8.1%
p 8
 
5.9%
m 6
 
4.4%
b 6
 
4.4%
u 5
 
3.7%
r 5
 
3.7%
Other values (7) 14
10.4%
Decimal Number
ValueCountFrequency (%)
1 78
67.2%
2 24
 
20.7%
3 8
 
6.9%
4 3
 
2.6%
7 1
 
0.9%
6 1
 
0.9%
5 1
 
0.9%
Other Punctuation
ValueCountFrequency (%)
& 18
37.5%
, 15
31.2%
. 11
22.9%
/ 4
 
8.3%
Other Symbol
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
Close Punctuation
ValueCountFrequency (%)
) 2792
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2774
100.0%
Space Separator
ValueCountFrequency (%)
2578
100.0%
Math Symbol
ValueCountFrequency (%)
+ 137
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 66
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 66760
87.6%
Common 8531
 
11.2%
Latin 904
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4393
 
6.6%
3570
 
5.3%
3052
 
4.6%
2213
 
3.3%
2039
 
3.1%
1770
 
2.7%
1483
 
2.2%
1452
 
2.2%
1407
 
2.1%
1286
 
1.9%
Other values (466) 44095
66.1%
Latin
ValueCountFrequency (%)
S 163
18.0%
K 103
 
11.4%
C 99
 
11.0%
L 63
 
7.0%
G 44
 
4.9%
D 38
 
4.2%
J 33
 
3.7%
I 30
 
3.3%
N 29
 
3.2%
k 28
 
3.1%
Other values (30) 274
30.3%
Common
ValueCountFrequency (%)
) 2792
32.7%
( 2774
32.5%
2578
30.2%
+ 137
 
1.6%
1 78
 
0.9%
_ 66
 
0.8%
2 24
 
0.3%
& 18
 
0.2%
- 16
 
0.2%
, 15
 
0.2%
Other values (9) 33
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 66759
87.6%
ASCII 9431
 
12.4%
Geometric Shapes 4
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4393
 
6.6%
3570
 
5.3%
3052
 
4.6%
2213
 
3.3%
2039
 
3.1%
1770
 
2.7%
1483
 
2.2%
1452
 
2.2%
1407
 
2.1%
1286
 
1.9%
Other values (465) 44094
66.0%
ASCII
ValueCountFrequency (%)
) 2792
29.6%
( 2774
29.4%
2578
27.3%
S 163
 
1.7%
+ 137
 
1.5%
K 103
 
1.1%
C 99
 
1.0%
1 78
 
0.8%
_ 66
 
0.7%
L 63
 
0.7%
Other values (47) 578
 
6.1%
Geometric Shapes
ValueCountFrequency (%)
2
50.0%
2
50.0%
None
ValueCountFrequency (%)
1
100.0%
Distinct9534
Distinct (%)95.4%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T23:03:37.373819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length23
Mean length9.8257652
Min length2

Characters and Unicode

Total characters98238
Distinct characters678
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9291 ?
Unique (%)92.9%

Sample

1st row한국광해관리공단 충청지사
2nd row청주출입국·외국인사무소
3rd row강원교통방송
4th row대한유화(주)
5th row롯데쇼핑(주)안양점
ValueCountFrequency (%)
주)이마트 135
 
0.9%
주민센터 129
 
0.9%
본사 89
 
0.6%
농업기술센터 84
 
0.6%
보건소 83
 
0.6%
주식회사 75
 
0.5%
홈플러스 75
 
0.5%
한국산업은행 59
 
0.4%
롯데마트 55
 
0.4%
행정복지센터 55
 
0.4%
Other values (9092) 13595
94.2%
2023-12-12T23:03:37.947000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4443
 
4.5%
3673
 
3.7%
3159
 
3.2%
2247
 
2.3%
) 2187
 
2.2%
( 2182
 
2.2%
2137
 
2.2%
2053
 
2.1%
1805
 
1.8%
1794
 
1.8%
Other values (668) 72558
73.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 86164
87.7%
Space Separator 4443
 
4.5%
Close Punctuation 2189
 
2.2%
Open Punctuation 2184
 
2.2%
Decimal Number 1195
 
1.2%
Uppercase Letter 1092
 
1.1%
Connector Punctuation 596
 
0.6%
Lowercase Letter 135
 
0.1%
Dash Punctuation 134
 
0.1%
Other Punctuation 95
 
0.1%
Other values (2) 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3673
 
4.3%
3159
 
3.7%
2247
 
2.6%
2137
 
2.5%
2053
 
2.4%
1805
 
2.1%
1794
 
2.1%
1753
 
2.0%
1650
 
1.9%
1625
 
1.9%
Other values (599) 64268
74.6%
Uppercase Letter
ValueCountFrequency (%)
S 166
15.2%
C 138
12.6%
K 112
10.3%
L 83
 
7.6%
T 68
 
6.2%
D 54
 
4.9%
G 54
 
4.9%
I 53
 
4.9%
N 48
 
4.4%
P 43
 
3.9%
Other values (15) 273
25.0%
Lowercase Letter
ValueCountFrequency (%)
k 25
18.5%
t 25
18.5%
a 12
8.9%
s 9
 
6.7%
o 8
 
5.9%
e 8
 
5.9%
u 7
 
5.2%
g 6
 
4.4%
p 6
 
4.4%
b 5
 
3.7%
Other values (8) 24
17.8%
Decimal Number
ValueCountFrequency (%)
2 402
33.6%
1 373
31.2%
3 177
14.8%
4 88
 
7.4%
0 56
 
4.7%
5 43
 
3.6%
6 28
 
2.3%
9 11
 
0.9%
7 11
 
0.9%
8 6
 
0.5%
Other Punctuation
ValueCountFrequency (%)
. 33
34.7%
& 27
28.4%
, 19
20.0%
· 11
 
11.6%
/ 5
 
5.3%
Other Symbol
ValueCountFrequency (%)
6
60.0%
2
 
20.0%
2
 
20.0%
Close Punctuation
ValueCountFrequency (%)
) 2187
99.9%
] 2
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 2182
99.9%
[ 2
 
0.1%
Space Separator
ValueCountFrequency (%)
4443
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 596
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 134
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 86170
87.7%
Common 10841
 
11.0%
Latin 1227
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3673
 
4.3%
3159
 
3.7%
2247
 
2.6%
2137
 
2.5%
2053
 
2.4%
1805
 
2.1%
1794
 
2.1%
1753
 
2.0%
1650
 
1.9%
1625
 
1.9%
Other values (600) 64274
74.6%
Latin
ValueCountFrequency (%)
S 166
13.5%
C 138
 
11.2%
K 112
 
9.1%
L 83
 
6.8%
T 68
 
5.5%
D 54
 
4.4%
G 54
 
4.4%
I 53
 
4.3%
N 48
 
3.9%
P 43
 
3.5%
Other values (33) 408
33.3%
Common
ValueCountFrequency (%)
4443
41.0%
) 2187
20.2%
( 2182
20.1%
_ 596
 
5.5%
2 402
 
3.7%
1 373
 
3.4%
3 177
 
1.6%
- 134
 
1.2%
4 88
 
0.8%
0 56
 
0.5%
Other values (15) 203
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 86160
87.7%
ASCII 12053
 
12.3%
None 17
 
< 0.1%
Compat Jamo 4
 
< 0.1%
Geometric Shapes 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4443
36.9%
) 2187
18.1%
( 2182
18.1%
_ 596
 
4.9%
2 402
 
3.3%
1 373
 
3.1%
3 177
 
1.5%
S 166
 
1.4%
C 138
 
1.1%
- 134
 
1.1%
Other values (55) 1255
 
10.4%
Hangul
ValueCountFrequency (%)
3673
 
4.3%
3159
 
3.7%
2247
 
2.6%
2137
 
2.5%
2053
 
2.4%
1805
 
2.1%
1794
 
2.1%
1753
 
2.0%
1650
 
1.9%
1625
 
1.9%
Other values (598) 64264
74.6%
None
ValueCountFrequency (%)
· 11
64.7%
6
35.3%
Compat Jamo
ValueCountFrequency (%)
4
100.0%
Geometric Shapes
ValueCountFrequency (%)
2
50.0%
2
50.0%
Distinct5407
Distinct (%)54.2%
Missing20
Missing (%)0.2%
Memory size156.2 KiB
2023-12-12T23:03:38.441782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length3
Mean length3.6468938
Min length1

Characters and Unicode

Total characters36396
Distinct characters476
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4363 ?
Unique (%)43.7%

Sample

1st row남광수
2nd row대표자
3rd row윤종기
4th row정영태
5th row이원준
ValueCountFrequency (%)
동장 141
 
1.3%
강희석 113
 
1.1%
임일순 96
 
0.9%
김창수 93
 
0.9%
소장 91
 
0.9%
대표이사 76
 
0.7%
면장 70
 
0.7%
성낙인 58
 
0.6%
이원준 50
 
0.5%
이상철 46
 
0.4%
Other values (5482) 9648
92.0%
2023-12-12T23:03:39.085119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2329
 
6.4%
1697
 
4.7%
1360
 
3.7%
918
 
2.5%
694
 
1.9%
683
 
1.9%
637
 
1.8%
622
 
1.7%
556
 
1.5%
515
 
1.4%
Other values (466) 26385
72.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 34945
96.0%
Space Separator 509
 
1.4%
Other Punctuation 353
 
1.0%
Decimal Number 274
 
0.8%
Uppercase Letter 143
 
0.4%
Close Punctuation 79
 
0.2%
Open Punctuation 79
 
0.2%
Lowercase Letter 10
 
< 0.1%
Dash Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2329
 
6.7%
1697
 
4.9%
1360
 
3.9%
918
 
2.6%
694
 
2.0%
683
 
2.0%
637
 
1.8%
622
 
1.8%
556
 
1.6%
515
 
1.5%
Other values (422) 24934
71.4%
Uppercase Letter
ValueCountFrequency (%)
A 14
 
9.8%
N 12
 
8.4%
I 12
 
8.4%
O 11
 
7.7%
L 10
 
7.0%
C 10
 
7.0%
E 10
 
7.0%
G 9
 
6.3%
R 9
 
6.3%
S 8
 
5.6%
Other values (11) 38
26.6%
Decimal Number
ValueCountFrequency (%)
2 99
36.1%
1 91
33.2%
3 41
15.0%
4 17
 
6.2%
5 8
 
2.9%
6 7
 
2.6%
0 5
 
1.8%
9 3
 
1.1%
7 2
 
0.7%
8 1
 
0.4%
Lowercase Letter
ValueCountFrequency (%)
h 2
20.0%
d 2
20.0%
z 2
20.0%
t 1
10.0%
g 1
10.0%
w 1
10.0%
j 1
10.0%
Other Punctuation
ValueCountFrequency (%)
, 347
98.3%
. 6
 
1.7%
Space Separator
ValueCountFrequency (%)
509
100.0%
Close Punctuation
ValueCountFrequency (%)
) 79
100.0%
Open Punctuation
ValueCountFrequency (%)
( 79
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 34931
96.0%
Common 1298
 
3.6%
Latin 153
 
0.4%
Han 14
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2329
 
6.7%
1697
 
4.9%
1360
 
3.9%
918
 
2.6%
694
 
2.0%
683
 
2.0%
637
 
1.8%
622
 
1.8%
556
 
1.6%
515
 
1.5%
Other values (421) 24920
71.3%
Latin
ValueCountFrequency (%)
A 14
 
9.2%
N 12
 
7.8%
I 12
 
7.8%
O 11
 
7.2%
L 10
 
6.5%
C 10
 
6.5%
E 10
 
6.5%
G 9
 
5.9%
R 9
 
5.9%
S 8
 
5.2%
Other values (18) 48
31.4%
Common
ValueCountFrequency (%)
509
39.2%
, 347
26.7%
2 99
 
7.6%
1 91
 
7.0%
) 79
 
6.1%
( 79
 
6.1%
3 41
 
3.2%
4 17
 
1.3%
5 8
 
0.6%
6 7
 
0.5%
Other values (6) 21
 
1.6%
Han
ValueCountFrequency (%)
14
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 34931
96.0%
ASCII 1451
 
4.0%
CJK 14
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2329
 
6.7%
1697
 
4.9%
1360
 
3.9%
918
 
2.6%
694
 
2.0%
683
 
2.0%
637
 
1.8%
622
 
1.8%
556
 
1.6%
515
 
1.5%
Other values (421) 24920
71.3%
ASCII
ValueCountFrequency (%)
509
35.1%
, 347
23.9%
2 99
 
6.8%
1 91
 
6.3%
) 79
 
5.4%
( 79
 
5.4%
3 41
 
2.8%
4 17
 
1.2%
A 14
 
1.0%
N 12
 
0.8%
Other values (34) 163
 
11.2%
CJK
ValueCountFrequency (%)
14
100.0%
Distinct6154
Distinct (%)61.9%
Missing55
Missing (%)0.5%
Memory size156.2 KiB
2023-12-12T23:03:39.320939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters119340
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5351 ?
Unique (%)53.8%

Sample

1st row305-82-14803
2nd row315-83-02636
3rd row203-82-32086
4th row104-81-06794
5th row123-85-27798
ValueCountFrequency (%)
206-86-50913 137
 
1.4%
220-81-60348 77
 
0.8%
120-82-00052 62
 
0.6%
220-81-39938 47
 
0.5%
418-83-00034 42
 
0.4%
119-82-08433 40
 
0.4%
125-83-01960 37
 
0.4%
203-82-32086 37
 
0.4%
128-83-00937 35
 
0.4%
403-83-00013 34
 
0.3%
Other values (6144) 9397
94.5%
2023-12-12T23:03:39.689229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 19890
16.7%
0 18959
15.9%
1 15105
12.7%
8 14102
11.8%
3 12054
10.1%
2 11326
9.5%
6 6588
 
5.5%
5 6579
 
5.5%
4 6433
 
5.4%
7 4308
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 99450
83.3%
Dash Punctuation 19890
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 18959
19.1%
1 15105
15.2%
8 14102
14.2%
3 12054
12.1%
2 11326
11.4%
6 6588
 
6.6%
5 6579
 
6.6%
4 6433
 
6.5%
7 4308
 
4.3%
9 3996
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 19890
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 119340
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 19890
16.7%
0 18959
15.9%
1 15105
12.7%
8 14102
11.8%
3 12054
10.1%
2 11326
9.5%
6 6588
 
5.5%
5 6579
 
5.5%
4 6433
 
5.4%
7 4308
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 119340
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 19890
16.7%
0 18959
15.9%
1 15105
12.7%
8 14102
11.8%
3 12054
10.1%
2 11326
9.5%
6 6588
 
5.5%
5 6579
 
5.5%
4 6433
 
5.4%
7 4308
 
3.6%
Distinct6137
Distinct (%)61.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T23:03:39.963212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.6179
Min length10

Characters and Unicode

Total characters116179
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5624 ?
Unique (%)56.2%

Sample

1st row042-627-6209
2nd row043-236-4907
3rd row00-000-000
4th row02-2122-1459
5th row02-2118-2280
ValueCountFrequency (%)
00-000-000 1738
 
17.4%
02-3709-5221 99
 
1.0%
02-3459-8160 95
 
0.9%
02-2145-8587 45
 
0.4%
063-281-2613 36
 
0.4%
054-840-6189 31
 
0.3%
054-760-7415 31
 
0.3%
061-339-2825 30
 
0.3%
02-2118-2280 29
 
0.3%
02-509-2239 27
 
0.3%
Other values (6127) 7839
78.4%
2023-12-12T23:03:40.384741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 29651
25.5%
- 20000
17.2%
3 9488
 
8.2%
2 9302
 
8.0%
5 8648
 
7.4%
1 7490
 
6.4%
6 7104
 
6.1%
4 7092
 
6.1%
9 6576
 
5.7%
8 5531
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 96179
82.8%
Dash Punctuation 20000
 
17.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 29651
30.8%
3 9488
 
9.9%
2 9302
 
9.7%
5 8648
 
9.0%
1 7490
 
7.8%
6 7104
 
7.4%
4 7092
 
7.4%
9 6576
 
6.8%
8 5531
 
5.8%
7 5297
 
5.5%
Dash Punctuation
ValueCountFrequency (%)
- 20000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 116179
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 29651
25.5%
- 20000
17.2%
3 9488
 
8.2%
2 9302
 
8.0%
5 8648
 
7.4%
1 7490
 
6.4%
6 7104
 
6.1%
4 7092
 
6.1%
9 6576
 
5.7%
8 5531
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 116179
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 29651
25.5%
- 20000
17.2%
3 9488
 
8.2%
2 9302
 
8.0%
5 8648
 
7.4%
1 7490
 
6.4%
6 7104
 
6.1%
4 7092
 
6.1%
9 6576
 
5.7%
8 5531
 
4.8%

주소
Text

Distinct8830
Distinct (%)88.3%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T23:03:40.817606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length39
Mean length19.270427
Min length3

Characters and Unicode

Total characters192685
Distinct characters571
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8123 ?
Unique (%)81.2%

Sample

1st row대전광역시 대덕구 한밭대로 1027
2nd row충청북도 청주시 흥덕구 비하로12번길 52
3rd row강원도 원주시 동부순환로 183
4th row서울특별시 종로구 자하문로 77
5th row경기도 안양시 만안구 만안로 244
ValueCountFrequency (%)
경기도 1488
 
3.3%
서울특별시 1446
 
3.2%
경상남도 734
 
1.6%
경상북도 719
 
1.6%
전라남도 571
 
1.3%
전라북도 545
 
1.2%
부산광역시 540
 
1.2%
충청남도 529
 
1.2%
강원도 501
 
1.1%
충청북도 466
 
1.0%
Other values (10065) 37489
83.3%
2023-12-12T23:03:41.427303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35068
 
18.2%
8704
 
4.5%
8081
 
4.2%
1 6322
 
3.3%
6139
 
3.2%
5843
 
3.0%
2 4342
 
2.3%
3 3360
 
1.7%
3358
 
1.7%
3323
 
1.7%
Other values (561) 108145
56.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 124820
64.8%
Space Separator 35068
 
18.2%
Decimal Number 31064
 
16.1%
Dash Punctuation 1334
 
0.7%
Open Punctuation 165
 
0.1%
Close Punctuation 165
 
0.1%
Uppercase Letter 48
 
< 0.1%
Other Punctuation 20
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8704
 
7.0%
8081
 
6.5%
6139
 
4.9%
5843
 
4.7%
3358
 
2.7%
3323
 
2.7%
3287
 
2.6%
2974
 
2.4%
2959
 
2.4%
2916
 
2.3%
Other values (526) 77236
61.9%
Uppercase Letter
ValueCountFrequency (%)
L 9
18.8%
G 7
14.6%
C 7
14.6%
A 4
8.3%
D 3
 
6.2%
P 3
 
6.2%
E 3
 
6.2%
J 2
 
4.2%
K 2
 
4.2%
B 2
 
4.2%
Other values (6) 6
12.5%
Decimal Number
ValueCountFrequency (%)
1 6322
20.4%
2 4342
14.0%
3 3360
10.8%
5 2903
9.3%
4 2690
8.7%
0 2567
8.3%
6 2443
 
7.9%
7 2376
 
7.6%
8 2090
 
6.7%
9 1971
 
6.3%
Other Punctuation
ValueCountFrequency (%)
, 8
40.0%
· 6
30.0%
. 5
25.0%
' 1
 
5.0%
Space Separator
ValueCountFrequency (%)
35068
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1334
100.0%
Open Punctuation
ValueCountFrequency (%)
( 165
100.0%
Close Punctuation
ValueCountFrequency (%)
) 165
100.0%
Lowercase Letter
ValueCountFrequency (%)
l 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 124820
64.8%
Common 67816
35.2%
Latin 49
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8704
 
7.0%
8081
 
6.5%
6139
 
4.9%
5843
 
4.7%
3358
 
2.7%
3323
 
2.7%
3287
 
2.6%
2974
 
2.4%
2959
 
2.4%
2916
 
2.3%
Other values (526) 77236
61.9%
Common
ValueCountFrequency (%)
35068
51.7%
1 6322
 
9.3%
2 4342
 
6.4%
3 3360
 
5.0%
5 2903
 
4.3%
4 2690
 
4.0%
0 2567
 
3.8%
6 2443
 
3.6%
7 2376
 
3.5%
8 2090
 
3.1%
Other values (8) 3655
 
5.4%
Latin
ValueCountFrequency (%)
L 9
18.4%
G 7
14.3%
C 7
14.3%
A 4
8.2%
D 3
 
6.1%
P 3
 
6.1%
E 3
 
6.1%
J 2
 
4.1%
K 2
 
4.1%
B 2
 
4.1%
Other values (7) 7
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 124820
64.8%
ASCII 67859
35.2%
None 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
35068
51.7%
1 6322
 
9.3%
2 4342
 
6.4%
3 3360
 
5.0%
5 2903
 
4.3%
4 2690
 
4.0%
0 2567
 
3.8%
6 2443
 
3.6%
7 2376
 
3.5%
8 2090
 
3.1%
Other values (24) 3698
 
5.4%
Hangul
ValueCountFrequency (%)
8704
 
7.0%
8081
 
6.5%
6139
 
4.9%
5843
 
4.7%
3358
 
2.7%
3323
 
2.7%
3287
 
2.6%
2974
 
2.4%
2959
 
2.4%
2916
 
2.3%
Other values (526) 77236
61.9%
None
ValueCountFrequency (%)
· 6
100.0%

상세주소
Text

MISSING 

Distinct7513
Distinct (%)87.5%
Missing1409
Missing (%)14.1%
Memory size156.2 KiB
2023-12-12T23:03:41.729990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length41
Mean length8.7424048
Min length1

Characters and Unicode

Total characters75106
Distinct characters672
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7033 ?
Unique (%)81.9%

Sample

1st row우성빌딩 6층 한국광해관리공단 충청지사
2nd row법무부 청주출입국·외국인사무소
3rd row원주시 동부순환로 183 한국교통방송 강원본부
4th row유남빌딩 대한유화공업(주)
5th row독산도서관
ValueCountFrequency (%)
주민센터 141
 
1.1%
이마트 127
 
1.0%
2층 92
 
0.7%
홈플러스 77
 
0.6%
3층 76
 
0.6%
행정복지센터 66
 
0.5%
4층 64
 
0.5%
56
 
0.4%
5층 54
 
0.4%
서울대학교 51
 
0.4%
Other values (8323) 12253
93.8%
2023-12-12T23:03:42.203460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4480
 
6.0%
1962
 
2.6%
1923
 
2.6%
1850
 
2.5%
1 1390
 
1.9%
1333
 
1.8%
1322
 
1.8%
1285
 
1.7%
1267
 
1.7%
) 1265
 
1.7%
Other values (662) 57029
75.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 60071
80.0%
Decimal Number 6043
 
8.0%
Space Separator 4480
 
6.0%
Close Punctuation 1265
 
1.7%
Open Punctuation 1262
 
1.7%
Uppercase Letter 1153
 
1.5%
Dash Punctuation 387
 
0.5%
Other Punctuation 267
 
0.4%
Lowercase Letter 151
 
0.2%
Math Symbol 21
 
< 0.1%
Other values (2) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1962
 
3.3%
1923
 
3.2%
1850
 
3.1%
1333
 
2.2%
1322
 
2.2%
1285
 
2.1%
1267
 
2.1%
1231
 
2.0%
1062
 
1.8%
1006
 
1.7%
Other values (587) 45830
76.3%
Uppercase Letter
ValueCountFrequency (%)
S 152
13.2%
L 135
11.7%
G 114
9.9%
C 108
 
9.4%
K 102
 
8.8%
T 73
 
6.3%
D 55
 
4.8%
I 50
 
4.3%
E 49
 
4.2%
B 43
 
3.7%
Other values (15) 272
23.6%
Lowercase Letter
ValueCountFrequency (%)
a 24
15.9%
t 18
11.9%
s 13
 
8.6%
k 13
 
8.6%
n 12
 
7.9%
e 10
 
6.6%
r 7
 
4.6%
i 7
 
4.6%
m 5
 
3.3%
o 5
 
3.3%
Other values (14) 37
24.5%
Decimal Number
ValueCountFrequency (%)
1 1390
23.0%
2 940
15.6%
3 698
11.6%
4 533
 
8.8%
0 514
 
8.5%
5 511
 
8.5%
6 462
 
7.6%
7 361
 
6.0%
8 344
 
5.7%
9 290
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 166
62.2%
. 67
25.1%
& 17
 
6.4%
/ 9
 
3.4%
· 7
 
2.6%
1
 
0.4%
Math Symbol
ValueCountFrequency (%)
~ 18
85.7%
+ 2
 
9.5%
1
 
4.8%
Open Punctuation
ValueCountFrequency (%)
( 1261
99.9%
[ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
4480
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1265
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 387
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 60069
80.0%
Common 13728
 
18.3%
Latin 1304
 
1.7%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1962
 
3.3%
1923
 
3.2%
1850
 
3.1%
1333
 
2.2%
1322
 
2.2%
1285
 
2.1%
1267
 
2.1%
1231
 
2.0%
1062
 
1.8%
1006
 
1.7%
Other values (586) 45828
76.3%
Latin
ValueCountFrequency (%)
S 152
 
11.7%
L 135
 
10.4%
G 114
 
8.7%
C 108
 
8.3%
K 102
 
7.8%
T 73
 
5.6%
D 55
 
4.2%
I 50
 
3.8%
E 49
 
3.8%
B 43
 
3.3%
Other values (39) 423
32.4%
Common
ValueCountFrequency (%)
4480
32.6%
1 1390
 
10.1%
) 1265
 
9.2%
( 1261
 
9.2%
2 940
 
6.8%
3 698
 
5.1%
4 533
 
3.9%
0 514
 
3.7%
5 511
 
3.7%
6 462
 
3.4%
Other values (15) 1674
 
12.2%
Han
ValueCountFrequency (%)
4
80.0%
1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 60064
80.0%
ASCII 15023
 
20.0%
None 11
 
< 0.1%
CJK 5
 
< 0.1%
Compat Jamo 2
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4480
29.8%
1 1390
 
9.3%
) 1265
 
8.4%
( 1261
 
8.4%
2 940
 
6.3%
3 698
 
4.6%
4 533
 
3.5%
0 514
 
3.4%
5 511
 
3.4%
6 462
 
3.1%
Other values (61) 2969
19.8%
Hangul
ValueCountFrequency (%)
1962
 
3.3%
1923
 
3.2%
1850
 
3.1%
1333
 
2.2%
1322
 
2.2%
1285
 
2.1%
1267
 
2.1%
1231
 
2.0%
1062
 
1.8%
1006
 
1.7%
Other values (584) 45823
76.3%
None
ValueCountFrequency (%)
· 7
63.6%
3
27.3%
1
 
9.1%
CJK
ValueCountFrequency (%)
4
80.0%
1
 
20.0%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

업종
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공공행정, 국방 및 사회보장 행정
4651 
제조업
1657 
도매 및 소매업
 
456
전기, 가스, 증기 및 수도사업
 
355
보건업 및 사회복지 서비스업
 
355
Other values (15)
2526 

Length

Max length24
Median length23
Mean length13.8059
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공행정, 국방 및 사회보장 행정
2nd row공공행정, 국방 및 사회보장 행정
3rd row출판, 영상, 방송통신 및 정보서비스업
4th row제조업
5th row도매 및 소매업

Common Values

ValueCountFrequency (%)
공공행정, 국방 및 사회보장 행정 4651
46.5%
제조업 1657
 
16.6%
도매 및 소매업 456
 
4.6%
전기, 가스, 증기 및 수도사업 355
 
3.5%
보건업 및 사회복지 서비스업 355
 
3.5%
교육 서비스업 349
 
3.5%
예술, 스포츠 및 여가관련 서비스업 334
 
3.3%
사업시설관리 및 사업지원 서비스업 313
 
3.1%
금융 및 보험업 296
 
3.0%
하수·폐기물 처리, 원료재생 및 환경복원업 284
 
2.8%
Other values (10) 950
 
9.5%

Length

2023-12-12T23:03:42.340669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
7741
20.0%
공공행정 4651
12.0%
사회보장 4651
12.0%
행정 4651
12.0%
국방 4651
12.0%
제조업 1657
 
4.3%
서비스업 1632
 
4.2%
도매 456
 
1.2%
소매업 456
 
1.2%
수도사업 355
 
0.9%
Other values (40) 7733
20.0%

세부업종
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
5349 
공공행정, 국방 및 사회보장 행정
4651 

Length

Max length18
Median length4
Mean length10.5114
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공행정, 국방 및 사회보장 행정
2nd row공공행정, 국방 및 사회보장 행정
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5349
53.5%
공공행정, 국방 및 사회보장 행정 4651
46.5%

Length

2023-12-12T23:03:42.469766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:03:42.556302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5349
18.7%
공공행정 4651
16.3%
국방 4651
16.3%
4651
16.3%
사회보장 4651
16.3%
행정 4651
16.3%

주요업무
Text

MISSING 

Distinct3294
Distinct (%)47.9%
Missing3120
Missing (%)31.2%
Memory size156.2 KiB
2023-12-12T23:03:42.748155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length46
Mean length8.3023256
Min length1

Characters and Unicode

Total characters57120
Distinct characters640
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2670 ?
Unique (%)38.8%

Sample

1st row광해방지사업, 석탄지역진흥
2nd row외국인 체류관리, 출입국심사
3rd row교통방송
4th row석유화학 제품 제조 및 판매
5th row시설관리, 주차사업, 체육사업
ValueCountFrequency (%)
662
 
4.9%
공공행정 602
 
4.4%
405
 
3.0%
제조 340
 
2.5%
운영 291
 
2.1%
관리 246
 
1.8%
업무 196
 
1.4%
생산 190
 
1.4%
판매 176
 
1.3%
제조업 110
 
0.8%
Other values (3865) 10400
76.4%
2023-12-12T23:03:43.111452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6751
 
11.8%
2362
 
4.1%
1559
 
2.7%
, 1478
 
2.6%
1444
 
2.5%
1363
 
2.4%
1292
 
2.3%
1211
 
2.1%
968
 
1.7%
944
 
1.7%
Other values (630) 37748
66.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47620
83.4%
Space Separator 6751
 
11.8%
Other Punctuation 1730
 
3.0%
Uppercase Letter 407
 
0.7%
Lowercase Letter 213
 
0.4%
Close Punctuation 138
 
0.2%
Open Punctuation 138
 
0.2%
Decimal Number 107
 
0.2%
Dash Punctuation 12
 
< 0.1%
Math Symbol 2
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2362
 
5.0%
1559
 
3.3%
1444
 
3.0%
1363
 
2.9%
1292
 
2.7%
1211
 
2.5%
968
 
2.0%
944
 
2.0%
919
 
1.9%
857
 
1.8%
Other values (555) 34701
72.9%
Uppercase Letter
ValueCountFrequency (%)
C 62
15.2%
P 46
11.3%
D 30
 
7.4%
T 29
 
7.1%
S 26
 
6.4%
I 26
 
6.4%
E 23
 
5.7%
L 22
 
5.4%
H 21
 
5.2%
A 21
 
5.2%
Other values (15) 101
24.8%
Lowercase Letter
ValueCountFrequency (%)
e 24
11.3%
a 19
 
8.9%
l 18
 
8.5%
i 18
 
8.5%
o 15
 
7.0%
r 14
 
6.6%
s 14
 
6.6%
t 13
 
6.1%
u 11
 
5.2%
c 10
 
4.7%
Other values (15) 57
26.8%
Decimal Number
ValueCountFrequency (%)
1 29
27.1%
8 28
26.2%
2 18
16.8%
0 9
 
8.4%
4 6
 
5.6%
3 5
 
4.7%
6 5
 
4.7%
9 3
 
2.8%
7 3
 
2.8%
5 1
 
0.9%
Other Punctuation
ValueCountFrequency (%)
, 1478
85.4%
. 129
 
7.5%
/ 76
 
4.4%
* 20
 
1.2%
· 12
 
0.7%
& 8
 
0.5%
' 4
 
0.2%
: 3
 
0.2%
Space Separator
ValueCountFrequency (%)
6751
100.0%
Close Punctuation
ValueCountFrequency (%)
) 138
100.0%
Open Punctuation
ValueCountFrequency (%)
( 138
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47619
83.4%
Common 8880
 
15.5%
Latin 620
 
1.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2362
 
5.0%
1559
 
3.3%
1444
 
3.0%
1363
 
2.9%
1292
 
2.7%
1211
 
2.5%
968
 
2.0%
944
 
2.0%
919
 
1.9%
857
 
1.8%
Other values (554) 34700
72.9%
Latin
ValueCountFrequency (%)
C 62
 
10.0%
P 46
 
7.4%
D 30
 
4.8%
T 29
 
4.7%
S 26
 
4.2%
I 26
 
4.2%
e 24
 
3.9%
E 23
 
3.7%
L 22
 
3.5%
H 21
 
3.4%
Other values (40) 311
50.2%
Common
ValueCountFrequency (%)
6751
76.0%
, 1478
 
16.6%
) 138
 
1.6%
( 138
 
1.6%
. 129
 
1.5%
/ 76
 
0.9%
1 29
 
0.3%
8 28
 
0.3%
* 20
 
0.2%
2 18
 
0.2%
Other values (15) 75
 
0.8%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47613
83.4%
ASCII 9487
 
16.6%
None 12
 
< 0.1%
Compat Jamo 6
 
< 0.1%
CJK 1
 
< 0.1%
CJK Compat 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6751
71.2%
, 1478
 
15.6%
) 138
 
1.5%
( 138
 
1.5%
. 129
 
1.4%
/ 76
 
0.8%
C 62
 
0.7%
P 46
 
0.5%
D 30
 
0.3%
T 29
 
0.3%
Other values (63) 610
 
6.4%
Hangul
ValueCountFrequency (%)
2362
 
5.0%
1559
 
3.3%
1444
 
3.0%
1363
 
2.9%
1292
 
2.7%
1211
 
2.5%
968
 
2.0%
944
 
2.0%
919
 
1.9%
857
 
1.8%
Other values (552) 34694
72.9%
None
ValueCountFrequency (%)
· 12
100.0%
Compat Jamo
ValueCountFrequency (%)
3
50.0%
3
50.0%
CJK
ValueCountFrequency (%)
1
100.0%
CJK Compat
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-12T23:03:34.584938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:03:43.188965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업체유형업체특성업종
번호1.0000.4700.5960.634
업체유형0.4701.0000.7810.913
업체특성0.5960.7811.0000.792
업종0.6340.9130.7921.000
2023-12-12T23:03:43.266705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체유형세부업종업종업체특성
업체유형1.0001.0000.7260.550
세부업종1.0001.0001.0001.000
업종0.7261.0001.0000.435
업체특성0.5501.0000.4351.000
2023-12-12T23:03:43.350779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업체유형업체특성업종세부업종
번호1.0000.2700.3030.2521.000
업체유형0.2701.0000.5500.7261.000
업체특성0.3030.5501.0000.4351.000
업종0.2520.7260.4351.0001.000
세부업종1.0001.0001.0001.0001.000

Missing values

2023-12-12T23:03:34.738503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:03:34.933867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T23:03:35.121488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호업체유형업체특성대표사업장명사업장명업체 대표명사업자등록번호대표팩스번호주소상세주소업종세부업종주요업무
55225523공공행정공공기관한국광해관리공단한국광해관리공단 충청지사남광수305-82-14803042-627-6209대전광역시 대덕구 한밭대로 1027우성빌딩 6층 한국광해관리공단 충청지사공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정광해방지사업, 석탄지역진흥
42294230공공행정중앙행정기관법무부청주출입국·외국인사무소대표자315-83-02636043-236-4907충청북도 청주시 흥덕구 비하로12번길 52법무부 청주출입국·외국인사무소공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정외국인 체류관리, 출입국심사
1184211843공공행정공공기관도로교통공단강원교통방송윤종기203-82-3208600-000-000강원도 원주시 동부순환로 183원주시 동부순환로 183 한국교통방송 강원본부출판, 영상, 방송통신 및 정보서비스업<NA>교통방송
1122411225제조배출권할당대상업체대한유화(주)대한유화(주)정영태104-81-0679402-2122-1459서울특별시 종로구 자하문로 77유남빌딩 대한유화공업(주)제조업<NA>석유화학 제품 제조 및 판매
1172011721기타서비스배출권할당대상업체롯데쇼핑(주) 본점(본사)롯데쇼핑(주)안양점이원준123-85-2779802-2118-2280경기도 안양시 만안구 만안로 244<NA>도매 및 소매업<NA><NA>
14361437공공행정지방공단금천구시설관리공단(공단본부)독산도서관문길수119-82-0390902-863-9548서울특별시 금천구 독산로54길 114독산도서관사업시설관리 및 사업지원 서비스업<NA>시설관리, 주차사업, 체육사업
81578158교육서비스국공립대학공주교육대학교공주교육대학교안병근307-83-00490041-854-1578충청남도 공주시 봉황동 웅진로 27 공주교육대학교봉황동 376교육 서비스업<NA>교육대학교
99499950공공행정지방자치단체부여군청양화면사무소황인덕308-83-01153041-830-2649충청남도 부여군 양화면 입포로 535공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정공공행정서비스
83878388공공행정공공기관근로복지공단대구병원김봉옥504-82-15603053-715-7722대구광역시 북구 학정동 학정로 515대구병원공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정산재병원
53035304공공행정지방자치단체화성시청진안동주민센터김명숙124-83-06318031-369-4931경기도 화성시 진안동 병점4로 34진안동주민센터공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정공공행정
번호업체유형업체특성대표사업장명사업장명업체 대표명사업자등록번호대표팩스번호주소상세주소업종세부업종주요업무
63736374공공행정지방자치단체평택시청평택시청(중앙동)공재광125-83-01960031-8024-3859경기도 평택시 중앙로 275평택시청공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정서비스
34183419공공행정지방자치단체홍천군청내면홍천군수223-83-00023033-430-2609강원도 홍천군 내면 창촌로 59내면사무소공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정공공행정
51915192공공행정지방자치단체서대문구남가좌1동주민센터문석진111-83-0074902-330-8623서울특별시 서대문구 수색로2길 48<NA>공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정<NA>
1011710118공공행정공공기관한전KDN(주)한전KDN(주) 대전충남지역본부임수경116-81-32242042-330-7519대전광역시 대덕구 동서대로 1784비래동(555-1)출판, 영상, 방송통신 및 정보서비스업<NA>전력IT유지보수 및 공사
24462447제조온실가스목표관리업체영흥철강 창원공장영흥철강 창원공장한재열609-81-02065055-282-2676경상남도 창원시 성산구 공단로 193영흥철강(주)제조업<NA>영흥철강(주)
1016610167공공행정지방자치단체부산광역시 수영구수영구의회이정희617-83-01834051-610-4099부산광역시 수영구 남천동로 100수영구의회공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정구조례의 제정 및 개정폐지, 예산의 심의확저으 결산 승인, 행정사무감사.조사등
36623663기타서비스공공기관한국승강기안전관리원한국승강기안전관리원전북지원공창석418-82-0339202-3497-7420전라북도 전주시 덕진구 백제대로 566(금암동, 전북은행사옥 14층)공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정<NA>
22682269기타서비스공공기관한국산업은행한국산업은행 서소문지점서소문지점장104-82-1222102-398-9599서울 중구 세종대로 9길 41올리브타워 1층금융 및 보험업<NA><NA>
77877788공공행정지방자치단체해운대구반여2동주민센터반여2동장618-83-00314051-749-4389부산광역시 해운대구 재반로211번길 9<NA>공공행정, 국방 및 사회보장 행정공공행정, 국방 및 사회보장 행정<NA>
20682069기타산업온실가스목표관리업체(주)금남고속(주)금남고속조성일305-81-00637042-585-7774대전광역시 대덕구 읍내동 171(주)금남고속운수업<NA>시외여객운송