Overview

Dataset statistics

Number of variables6
Number of observations2677
Missing cells826
Missing cells (%)5.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory125.6 KiB
Average record size in memory48.0 B

Variable types

Numeric1
Categorical1
Text4

Dataset

Description인허가번호,민원구분,노동조합단체명,사업장명,소속단체명,노동조합주소
Author양천구
URLhttps://data.seoul.go.kr/dataList/OA-10862/S/1/datasetView.do

Alerts

인허가번호 is highly overall correlated with 민원구분High correlation
민원구분 is highly overall correlated with 인허가번호High correlation
소속단체명 has 627 (23.4%) missing valuesMissing
노동조합주소 has 193 (7.2%) missing valuesMissing

Reproduction

Analysis started2024-04-20 19:32:43.852467
Analysis finished2024-04-20 19:33:00.836908
Duration16.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인허가번호
Real number (ℝ)

HIGH CORRELATION 

Distinct771
Distinct (%)28.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.8462411 × 1018
Minimum2.011611 × 1017
Maximum2.023611 × 1019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.7 KiB
2024-04-21T04:33:00.923051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.011611 × 1017
5-th percentile1.999301 × 1018
Q12.007317 × 1018
median2.010313 × 1018
Q32.017317 × 1018
95-th percentile2.024317 × 1018
Maximum2.023611 × 1019
Range2.0034949 × 1019
Interquartile range (IQR)1.0000007 × 1016

Descriptive statistics

Standard deviation3.8090945 × 1018
Coefficient of variation (CV)1.3382895
Kurtosis16.657229
Mean2.8462411 × 1018
Median Absolute Deviation (MAD)3.9899995 × 1015
Skewness4.3165261
Sum7.6193874 × 1021
Variance1.4509201 × 1037
MonotonicityNot monotonic
2024-04-21T04:33:01.081413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2.0083220000261e+18 219
 
8.2%
2.0093180120261e+18 63
 
2.4%
2.0073210000261e+18 47
 
1.8%
2.0073170107261e+18 43
 
1.6%
2.0073000126261e+18 41
 
1.5%
2.0073160117261e+18 34
 
1.3%
2.0083010000261e+18 33
 
1.2%
2.0073090103261e+18 29
 
1.1%
2.0073010000261e+18 28
 
1.0%
2.0073020095261e+18 23
 
0.9%
Other values (761) 2117
79.1%
ValueCountFrequency (%)
2.01161100002e+17 1
< 0.1%
2.01261100002e+17 1
< 0.1%
2.01361100002e+17 1
< 0.1%
2.01461100002e+17 1
< 0.1%
2.01561100002e+17 1
< 0.1%
1.9613070118261e+18 1
< 0.1%
1.9633010100261005e+18 1
< 0.1%
1.9633200099261e+18 1
< 0.1%
1.9653070118261e+18 1
< 0.1%
1.9663070118261e+18 1
< 0.1%
ValueCountFrequency (%)
2.02361100001019e+19 2
 
0.1%
2.02261100001019e+19 8
0.3%
2.02161100001019e+19 5
0.2%
2.02061100001019e+19 9
0.3%
2.01961100001019e+19 10
0.4%
2.01861100001019e+19 6
0.2%
2.01761100001019e+19 11
0.4%
2.01661100001019e+19 6
0.2%
2.01561100001019e+19 8
0.3%
2.01461100001019e+19 7
0.3%

민원구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
노동조합해산신고
1296 
노동조합설립신고
1003 
노동조합변경신고
372 
노동조합신고
 
6

Length

Max length8
Median length8
Mean length7.9955174
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row노동조합변경신고
2nd row노동조합변경신고
3rd row노동조합변경신고
4th row노동조합설립신고
5th row노동조합설립신고

Common Values

ValueCountFrequency (%)
노동조합해산신고 1296
48.4%
노동조합설립신고 1003
37.5%
노동조합변경신고 372
 
13.9%
노동조합신고 6
 
0.2%

Length

2024-04-21T04:33:01.218628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T04:33:01.309604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
노동조합해산신고 1296
48.4%
노동조합설립신고 1003
37.5%
노동조합변경신고 372
 
13.9%
노동조합신고 6
 
0.2%
Distinct2532
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
2024-04-21T04:33:01.476793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length30
Mean length11.786328
Min length2

Characters and Unicode

Total characters31552
Distinct characters619
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2398 ?
Unique (%)89.6%

Sample

1st row삼성이앤에이 노동조합 &U
2nd row안정호
3rd row도원에프앤지㈜노동조합
4th row클래시스생산본부노동조합
5th row교권수호 한국체육대학교 교수 노동조합
ValueCountFrequency (%)
노동조합 993
 
22.6%
영업 186
 
4.2%
해산 56
 
1.3%
주식회사 17
 
0.4%
지부 12
 
0.3%
민주노동조합 11
 
0.3%
서울특별시 10
 
0.2%
공무직 9
 
0.2%
서울지점 9
 
0.2%
8
 
0.2%
Other values (2714) 3081
70.2%
2024-04-21T04:33:01.767781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2219
 
7.0%
2146
 
6.8%
2117
 
6.7%
2098
 
6.6%
1716
 
5.4%
820
 
2.6%
) 771
 
2.4%
( 760
 
2.4%
453
 
1.4%
432
 
1.4%
Other values (609) 18020
57.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27332
86.6%
Space Separator 1716
 
5.4%
Close Punctuation 1013
 
3.2%
Open Punctuation 1002
 
3.2%
Uppercase Letter 395
 
1.3%
Decimal Number 41
 
0.1%
Lowercase Letter 30
 
0.1%
Other Punctuation 13
 
< 0.1%
Dash Punctuation 8
 
< 0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2219
 
8.1%
2146
 
7.9%
2117
 
7.7%
2098
 
7.7%
820
 
3.0%
453
 
1.7%
432
 
1.6%
427
 
1.6%
410
 
1.5%
406
 
1.5%
Other values (551) 15804
57.8%
Uppercase Letter
ValueCountFrequency (%)
S 57
14.4%
K 50
12.7%
B 35
8.9%
C 33
 
8.4%
T 31
 
7.8%
G 25
 
6.3%
M 24
 
6.1%
N 22
 
5.6%
I 21
 
5.3%
A 14
 
3.5%
Other values (14) 83
21.0%
Lowercase Letter
ValueCountFrequency (%)
s 4
13.3%
c 3
10.0%
i 3
10.0%
h 3
10.0%
e 3
10.0%
k 2
 
6.7%
p 2
 
6.7%
r 2
 
6.7%
b 2
 
6.7%
a 1
 
3.3%
Other values (5) 5
16.7%
Decimal Number
ValueCountFrequency (%)
1 14
34.1%
2 7
17.1%
3 6
14.6%
9 5
 
12.2%
6 4
 
9.8%
4 3
 
7.3%
0 1
 
2.4%
5 1
 
2.4%
Other Punctuation
ValueCountFrequency (%)
. 4
30.8%
& 4
30.8%
, 4
30.8%
? 1
 
7.7%
Close Punctuation
ValueCountFrequency (%)
) 771
76.1%
] 242
 
23.9%
Open Punctuation
ValueCountFrequency (%)
( 760
75.8%
[ 242
 
24.2%
Space Separator
ValueCountFrequency (%)
1716
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27333
86.6%
Common 3793
 
12.0%
Latin 425
 
1.3%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2219
 
8.1%
2146
 
7.9%
2117
 
7.7%
2098
 
7.7%
820
 
3.0%
453
 
1.7%
432
 
1.6%
427
 
1.6%
410
 
1.5%
406
 
1.5%
Other values (551) 15805
57.8%
Latin
ValueCountFrequency (%)
S 57
13.4%
K 50
11.8%
B 35
 
8.2%
C 33
 
7.8%
T 31
 
7.3%
G 25
 
5.9%
M 24
 
5.6%
N 22
 
5.2%
I 21
 
4.9%
A 14
 
3.3%
Other values (29) 113
26.6%
Common
ValueCountFrequency (%)
1716
45.2%
) 771
20.3%
( 760
20.0%
[ 242
 
6.4%
] 242
 
6.4%
1 14
 
0.4%
- 8
 
0.2%
2 7
 
0.2%
3 6
 
0.2%
9 5
 
0.1%
Other values (8) 22
 
0.6%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27331
86.6%
ASCII 4218
 
13.4%
None 2
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2219
 
8.1%
2146
 
7.9%
2117
 
7.7%
2098
 
7.7%
820
 
3.0%
453
 
1.7%
432
 
1.6%
427
 
1.6%
410
 
1.5%
406
 
1.5%
Other values (550) 15803
57.8%
ASCII
ValueCountFrequency (%)
1716
40.7%
) 771
18.3%
( 760
18.0%
[ 242
 
5.7%
] 242
 
5.7%
S 57
 
1.4%
K 50
 
1.2%
B 35
 
0.8%
C 33
 
0.8%
T 31
 
0.7%
Other values (47) 281
 
6.7%
None
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct2302
Distinct (%)86.2%
Missing6
Missing (%)0.2%
Memory size21.0 KiB
2024-04-21T04:33:01.969661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length81
Median length40
Mean length7.5189068
Min length1

Characters and Unicode

Total characters20083
Distinct characters614
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2024 ?
Unique (%)75.8%

Sample

1st row삼성이앤에이
2nd row한국종합기술
3rd row도원에프앤지
4th row문정공장
5th row한국체육대학교
ValueCountFrequency (%)
주식회사 30
 
1.0%
17
 
0.6%
선진상운(주 8
 
0.3%
서울대학교 6
 
0.2%
sh공사 6
 
0.2%
노동조합 6
 
0.2%
재단법인 6
 
0.2%
서울신용보증재단 5
 
0.2%
남양상운 5
 
0.2%
5
 
0.2%
Other values (2400) 2789
96.7%
2024-04-21T04:33:02.348798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1092
 
5.4%
) 1040
 
5.2%
( 1023
 
5.1%
433
 
2.2%
393
 
2.0%
368
 
1.8%
344
 
1.7%
284
 
1.4%
283
 
1.4%
281
 
1.4%
Other values (604) 14542
72.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17296
86.1%
Close Punctuation 1040
 
5.2%
Open Punctuation 1023
 
5.1%
Uppercase Letter 352
 
1.8%
Space Separator 213
 
1.1%
Other Punctuation 58
 
0.3%
Decimal Number 50
 
0.2%
Lowercase Letter 27
 
0.1%
Dash Punctuation 20
 
0.1%
Other Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1092
 
6.3%
433
 
2.5%
393
 
2.3%
368
 
2.1%
344
 
2.0%
284
 
1.6%
283
 
1.6%
281
 
1.6%
273
 
1.6%
266
 
1.5%
Other values (547) 13279
76.8%
Uppercase Letter
ValueCountFrequency (%)
S 59
16.8%
K 46
13.1%
B 31
8.8%
T 30
8.5%
C 24
 
6.8%
G 24
 
6.8%
I 21
 
6.0%
M 16
 
4.5%
N 16
 
4.5%
H 14
 
4.0%
Other values (14) 71
20.2%
Lowercase Letter
ValueCountFrequency (%)
h 4
14.8%
k 4
14.8%
e 3
11.1%
s 3
11.1%
i 2
7.4%
b 2
7.4%
t 2
7.4%
c 2
7.4%
y 1
 
3.7%
p 1
 
3.7%
Other values (3) 3
11.1%
Decimal Number
ValueCountFrequency (%)
1 15
30.0%
2 9
18.0%
3 7
14.0%
6 5
 
10.0%
4 4
 
8.0%
5 3
 
6.0%
9 3
 
6.0%
8 2
 
4.0%
0 1
 
2.0%
7 1
 
2.0%
Other Punctuation
ValueCountFrequency (%)
, 46
79.3%
. 5
 
8.6%
& 4
 
6.9%
/ 2
 
3.4%
1
 
1.7%
Close Punctuation
ValueCountFrequency (%)
) 1040
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1023
100.0%
Space Separator
ValueCountFrequency (%)
213
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17300
86.1%
Common 2404
 
12.0%
Latin 379
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1092
 
6.3%
433
 
2.5%
393
 
2.3%
368
 
2.1%
344
 
2.0%
284
 
1.6%
283
 
1.6%
281
 
1.6%
273
 
1.6%
266
 
1.5%
Other values (548) 13283
76.8%
Latin
ValueCountFrequency (%)
S 59
15.6%
K 46
12.1%
B 31
 
8.2%
T 30
 
7.9%
C 24
 
6.3%
G 24
 
6.3%
I 21
 
5.5%
M 16
 
4.2%
N 16
 
4.2%
H 14
 
3.7%
Other values (27) 98
25.9%
Common
ValueCountFrequency (%)
) 1040
43.3%
( 1023
42.6%
213
 
8.9%
, 46
 
1.9%
- 20
 
0.8%
1 15
 
0.6%
2 9
 
0.4%
3 7
 
0.3%
6 5
 
0.2%
. 5
 
0.2%
Other values (9) 21
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17296
86.1%
ASCII 2782
 
13.9%
None 5
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1092
 
6.3%
433
 
2.5%
393
 
2.3%
368
 
2.1%
344
 
2.0%
284
 
1.6%
283
 
1.6%
281
 
1.6%
273
 
1.6%
266
 
1.5%
Other values (547) 13279
76.8%
ASCII
ValueCountFrequency (%)
) 1040
37.4%
( 1023
36.8%
213
 
7.7%
S 59
 
2.1%
, 46
 
1.7%
K 46
 
1.7%
B 31
 
1.1%
T 30
 
1.1%
C 24
 
0.9%
G 24
 
0.9%
Other values (45) 246
 
8.8%
None
ValueCountFrequency (%)
4
80.0%
1
 
20.0%

소속단체명
Text

MISSING 

Distinct367
Distinct (%)17.9%
Missing627
Missing (%)23.4%
Memory size21.0 KiB
2024-04-21T04:33:02.568334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length26
Mean length6.7239024
Min length1

Characters and Unicode

Total characters13784
Distinct characters206
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique228 ?
Unique (%)11.1%

Sample

1st row전국건설산업노동조합연맹
2nd row상급단체미가입
3rd row없음
4th row에이치피엘(주)
5th row상급단체 미가입
ValueCountFrequency (%)
없음 403
 
17.6%
미가입 272
 
11.9%
전국택시노동조합연맹 123
 
5.4%
상급단체미가입 118
 
5.2%
전국연합노동조합연맹 72
 
3.2%
한국노총 71
 
3.1%
상급단체 59
 
2.6%
택시노련 52
 
2.3%
민주노총 45
 
2.0%
전국자동차노동조합연맹 36
 
1.6%
Other values (349) 1033
45.2%
2024-04-21T04:33:02.973259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1208
 
8.8%
755
 
5.5%
745
 
5.4%
744
 
5.4%
693
 
5.0%
653
 
4.7%
642
 
4.7%
633
 
4.6%
444
 
3.2%
443
 
3.2%
Other values (196) 6824
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13363
96.9%
Space Separator 234
 
1.7%
Close Punctuation 83
 
0.6%
Open Punctuation 42
 
0.3%
Uppercase Letter 37
 
0.3%
Other Punctuation 20
 
0.1%
Dash Punctuation 4
 
< 0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1208
 
9.0%
755
 
5.6%
745
 
5.6%
744
 
5.6%
693
 
5.2%
653
 
4.9%
642
 
4.8%
633
 
4.7%
444
 
3.3%
443
 
3.3%
Other values (176) 6403
47.9%
Uppercase Letter
ValueCountFrequency (%)
T 13
35.1%
I 13
35.1%
K 2
 
5.4%
S 2
 
5.4%
B 1
 
2.7%
G 1
 
2.7%
N 1
 
2.7%
E 1
 
2.7%
C 1
 
2.7%
P 1
 
2.7%
Other Punctuation
ValueCountFrequency (%)
. 7
35.0%
, 6
30.0%
? 5
25.0%
/ 2
 
10.0%
Space Separator
ValueCountFrequency (%)
234
100.0%
Close Punctuation
ValueCountFrequency (%)
) 83
100.0%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Decimal Number
ValueCountFrequency (%)
0 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13361
96.9%
Common 384
 
2.8%
Latin 37
 
0.3%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1208
 
9.0%
755
 
5.7%
745
 
5.6%
744
 
5.6%
693
 
5.2%
653
 
4.9%
642
 
4.8%
633
 
4.7%
444
 
3.3%
443
 
3.3%
Other values (174) 6401
47.9%
Latin
ValueCountFrequency (%)
T 13
35.1%
I 13
35.1%
K 2
 
5.4%
S 2
 
5.4%
B 1
 
2.7%
G 1
 
2.7%
N 1
 
2.7%
E 1
 
2.7%
C 1
 
2.7%
P 1
 
2.7%
Common
ValueCountFrequency (%)
234
60.9%
) 83
 
21.6%
( 42
 
10.9%
. 7
 
1.8%
, 6
 
1.6%
? 5
 
1.3%
- 4
 
1.0%
/ 2
 
0.5%
0 1
 
0.3%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13359
96.9%
ASCII 421
 
3.1%
Compat Jamo 2
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1208
 
9.0%
755
 
5.7%
745
 
5.6%
744
 
5.6%
693
 
5.2%
653
 
4.9%
642
 
4.8%
633
 
4.7%
444
 
3.3%
443
 
3.3%
Other values (173) 6399
47.9%
ASCII
ValueCountFrequency (%)
234
55.6%
) 83
 
19.7%
( 42
 
10.0%
T 13
 
3.1%
I 13
 
3.1%
. 7
 
1.7%
, 6
 
1.4%
? 5
 
1.2%
- 4
 
1.0%
K 2
 
0.5%
Other values (10) 12
 
2.9%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

노동조합주소
Text

MISSING 

Distinct2183
Distinct (%)87.9%
Missing193
Missing (%)7.2%
Memory size21.0 KiB
2024-04-21T04:33:03.279545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length46
Mean length28.018116
Min length16

Characters and Unicode

Total characters69597
Distinct characters507
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1968 ?
Unique (%)79.2%

Sample

1st row서울특별시 강동구 강일동 679번지 2호 MIDC빌딩
2nd row서울특별시 송파구 문정동 645번지 에이치비지니스파크
3rd row서울특별시 송파구 방이동 88번지 15호 한국체육대학교
4th row서울특별시 송파구 마천동 194번지 1호 덕왕기업
5th row서울특별시 강남구 역삼동 648번지 9호 21층
ValueCountFrequency (%)
서울특별시 2480
 
18.7%
강남구 416
 
3.1%
1호 291
 
2.2%
중구 254
 
1.9%
영등포구 215
 
1.6%
마포구 158
 
1.2%
2호 150
 
1.1%
서초구 137
 
1.0%
강서구 132
 
1.0%
송파구 128
 
1.0%
Other values (2471) 8915
67.2%
2024-04-21T04:33:03.773539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17025
24.5%
3016
 
4.3%
2660
 
3.8%
2619
 
3.8%
2565
 
3.7%
2545
 
3.7%
2539
 
3.6%
2491
 
3.6%
2483
 
3.6%
2420
 
3.5%
Other values (497) 29234
42.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41436
59.5%
Space Separator 17025
24.5%
Decimal Number 10428
 
15.0%
Uppercase Letter 322
 
0.5%
Dash Punctuation 178
 
0.3%
Open Punctuation 57
 
0.1%
Close Punctuation 57
 
0.1%
Lowercase Letter 51
 
0.1%
Other Punctuation 34
 
< 0.1%
Math Symbol 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3016
 
7.3%
2660
 
6.4%
2619
 
6.3%
2565
 
6.2%
2545
 
6.1%
2539
 
6.1%
2491
 
6.0%
2483
 
6.0%
2420
 
5.8%
1710
 
4.1%
Other values (437) 16388
39.6%
Uppercase Letter
ValueCountFrequency (%)
S 36
11.2%
T 33
10.2%
K 32
9.9%
B 31
 
9.6%
C 24
 
7.5%
M 20
 
6.2%
I 20
 
6.2%
D 19
 
5.9%
E 16
 
5.0%
R 12
 
3.7%
Other values (14) 79
24.5%
Lowercase Letter
ValueCountFrequency (%)
e 11
21.6%
s 8
15.7%
r 5
9.8%
t 4
 
7.8%
c 4
 
7.8%
i 4
 
7.8%
o 3
 
5.9%
a 3
 
5.9%
n 3
 
5.9%
b 2
 
3.9%
Other values (4) 4
 
7.8%
Decimal Number
ValueCountFrequency (%)
1 2230
21.4%
2 1285
12.3%
3 1148
11.0%
4 972
9.3%
5 949
9.1%
6 912
8.7%
7 806
 
7.7%
0 791
 
7.6%
8 691
 
6.6%
9 644
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 13
38.2%
. 10
29.4%
/ 9
26.5%
& 1
 
2.9%
1
 
2.9%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
17025
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 178
100.0%
Open Punctuation
ValueCountFrequency (%)
( 57
100.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41436
59.5%
Common 27785
39.9%
Latin 376
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3016
 
7.3%
2660
 
6.4%
2619
 
6.3%
2565
 
6.2%
2545
 
6.1%
2539
 
6.1%
2491
 
6.0%
2483
 
6.0%
2420
 
5.8%
1710
 
4.1%
Other values (437) 16388
39.6%
Latin
ValueCountFrequency (%)
S 36
 
9.6%
T 33
 
8.8%
K 32
 
8.5%
B 31
 
8.2%
C 24
 
6.4%
M 20
 
5.3%
I 20
 
5.3%
D 19
 
5.1%
E 16
 
4.3%
R 12
 
3.2%
Other values (30) 133
35.4%
Common
ValueCountFrequency (%)
17025
61.3%
1 2230
 
8.0%
2 1285
 
4.6%
3 1148
 
4.1%
4 972
 
3.5%
5 949
 
3.4%
6 912
 
3.3%
7 806
 
2.9%
0 791
 
2.8%
8 691
 
2.5%
Other values (10) 976
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41436
59.5%
ASCII 28157
40.5%
Number Forms 3
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17025
60.5%
1 2230
 
7.9%
2 1285
 
4.6%
3 1148
 
4.1%
4 972
 
3.5%
5 949
 
3.4%
6 912
 
3.2%
7 806
 
2.9%
0 791
 
2.8%
8 691
 
2.5%
Other values (47) 1348
 
4.8%
Hangul
ValueCountFrequency (%)
3016
 
7.3%
2660
 
6.4%
2619
 
6.3%
2565
 
6.2%
2545
 
6.1%
2539
 
6.1%
2491
 
6.0%
2483
 
6.0%
2420
 
5.8%
1710
 
4.1%
Other values (437) 16388
39.6%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%
None
ValueCountFrequency (%)
1
100.0%

Interactions

2024-04-21T04:32:45.619205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T04:33:03.865427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가번호민원구분
인허가번호1.0000.181
민원구분0.1811.000
2024-04-21T04:33:03.933933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가번호민원구분
인허가번호1.0001.000
민원구분1.0001.000

Missing values

2024-04-21T04:33:00.567058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T04:33:00.689389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-21T04:33:00.781985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

인허가번호민원구분노동조합단체명사업장명소속단체명노동조합주소
02024324029026100003노동조합변경신고삼성이앤에이 노동조합 &U삼성이앤에이<NA><NA>
12024324029026100002노동조합변경신고안정호한국종합기술전국건설산업노동조합연맹<NA>
22024324029026100001노동조합변경신고도원에프앤지㈜노동조합도원에프앤지<NA>서울특별시 강동구 강일동 679번지 2호 MIDC빌딩
32024323029126100004노동조합설립신고클래시스생산본부노동조합문정공장<NA>서울특별시 송파구 문정동 645번지 에이치비지니스파크
42024323029126100002노동조합설립신고교권수호 한국체육대학교 교수 노동조합한국체육대학교<NA>서울특별시 송파구 방이동 88번지 15호 한국체육대학교
52024323029126100001노동조합설립신고덕왕기업㈜ 노동조합덕왕기업㈜<NA>서울특별시 송파구 마천동 194번지 1호 덕왕기업
62024322025026100002노동조합설립신고유니티테크놀로지코리아(유) 노동조합유니티테크놀로지<NA>서울특별시 강남구 역삼동 648번지 9호 21층
72024322025026100001노동조합설립신고태화태화용역<NA>서울특별시 강남구 일원동 639번지 1호 동아빌딩-B01
82024321019526100001노동조합설립신고금호익스프레스(주)노동조합금호익스프레스(주)<NA>서울특별시 서초구 반포동 19번지 4호 강남고속버스터미널 9층
92024317024926100002노동조합설립신고대한주택관리사협회 전국사무노조대한주택관리사협회<NA>서울특별시 금천구 가산동 60번지 73호 벽산디지털밸리5차-1514
인허가번호민원구분노동조합단체명사업장명소속단체명노동조합주소
26671969307011826100001노동조합해산신고대진여객(주) 노동조합대진여객(주)전국자동차노동조합연맹서울특별시 성북구 정릉동 820번지 18호
26681968303010326100002노동조합해산신고서울버스노동조합 태진운수지부서울버스노동조합<NA>서울특별시 성동구 성수동2가 649번지 1호
26691967301010026101201노동조합변경신고뱅크오브아메리카서울지점뱅크오브아메리카서울지점전국민주금융노동조합서울특별시 중구 태평로1가 84번지 파이낸스빌딩
26701967301010026101025노동조합설립신고서울클럽사단법인 서울클럽<NA>서울특별시 중구 장충동2가 208번지 서울클럽
26711966307011826100001노동조합해산신고대진여객 노동조합대진여객전국자동차노동조합서울특별시 성북구 정릉동 818번지
26721965307011826100001노동조합해산신고도원교통 노동조합도원교통(주)전자노련 서울버스서울특별시 성북구 정릉동 893번지 1호
26731963320009926100001노동조합설립신고한남여객지부(주)한남운수전국자동차노동조합연맹서울특별시 관악구 신림동 241번지 42호
26741963301010026100308노동조합해산신고국립중앙의료원 노동조합국립중앙의료원전국보건의료노동조합서울특별시 중구 을지로6가 18번지 79호
267519626110000101900001노동조합변경신고서울특별시청노동조합서울특별시청한국노총서울특별시 성동구 마장동 527번지
26761961307011826100001노동조합해산신고상진운수 노동조합상진운수주식회사서울시버스노동조합서울특별시 성북구 석관동 124번지 9호