Overview

Dataset statistics

Number of variables27
Number of observations10000
Missing cells41935
Missing cells (%)15.5%
Duplicate rows289
Duplicate rows (%)2.9%
Total size in memory2.2 MiB
Average record size in memory227.0 B

Variable types

DateTime6
Categorical11
Text9
Numeric1

Dataset

Description한국산업안전보건공단에서 제공하는 한국산업안전보건공단 인증원 민원접수 처리내역에 대한 내용으로심사종류, 사업장명, 대상품 대중소 분류, 관할지사에 대한 데이터를 제공합니다.
Author한국산업안전보건공단
URLhttps://www.data.go.kr/data/15087797/fileData.do

Alerts

Dataset has 289 (2.9%) duplicate rowsDuplicates
신청구분 is highly imbalanced (87.5%)Imbalance
재검사구분 is highly imbalanced (91.3%)Imbalance
수입품여부 is highly imbalanced (75.2%)Imbalance
변경여부 is highly imbalanced (89.6%)Imbalance
사업장개시번호 is highly imbalanced (99.9%)Imbalance
제조사업장개시번호 is highly imbalanced (99.8%)Imbalance
심사희망일 has 5459 (54.6%) missing valuesMissing
심사예정일 has 5766 (57.7%) missing valuesMissing
근로자수 has 1081 (10.8%) missing valuesMissing
업태 has 4137 (41.4%) missing valuesMissing
종목 has 4436 (44.4%) missing valuesMissing
제조사업장명 has 2043 (20.4%) missing valuesMissing
제조사업자등록번호 has 2051 (20.5%) missing valuesMissing
제조사업장관리번호 has 2049 (20.5%) missing valuesMissing
제조 국가 has 9987 (99.9%) missing valuesMissing
설치일자 has 4924 (49.2%) missing valuesMissing
근로자수 is highly skewed (γ1 = 35.53102684)Skewed
근로자수 has 379 (3.8%) zerosZeros

Reproduction

Analysis started2023-12-12 15:14:16.065649
Analysis finished2023-12-12 15:14:17.445781
Duration1.38 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct208
Distinct (%)2.1%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
Minimum2022-01-24 00:00:00
Maximum2023-06-29 00:00:00
2023-12-13T00:14:17.514436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:14:17.669156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기관
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
한국승강기안전공단
3718 
대한산업안전협회
3300 
안전보건공단
2982 

Length

Max length9
Median length8
Mean length7.7754
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안전보건공단
2nd row대한산업안전협회
3rd row한국승강기안전공단
4th row한국승강기안전공단
5th row대한산업안전협회

Common Values

ValueCountFrequency (%)
한국승강기안전공단 3718
37.2%
대한산업안전협회 3300
33.0%
안전보건공단 2982
29.8%

Length

2023-12-13T00:14:17.821336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:14:17.936726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국승강기안전공단 3718
37.2%
대한산업안전협회 3300
33.0%
안전보건공단 2982
29.8%

지사
Categorical

Distinct42
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서울지역본부
1507 
경기지역본부
1216 
대구경북지역본부
791 
인천광역본부
664 
중앙회
621 
Other values (37)
5201 

Length

Max length8
Median length6
Mean length5.9623
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대전세종광역본부
2nd row경기지역본부
3rd row서울지역본부
4th row천안지사
5th row성남지회

Common Values

ValueCountFrequency (%)
서울지역본부 1507
15.1%
경기지역본부 1216
 
12.2%
대구경북지역본부 791
 
7.9%
인천광역본부 664
 
6.6%
중앙회 621
 
6.2%
부산광역본부 535
 
5.3%
경기강원지역본부 448
 
4.5%
부산지역본부 413
 
4.1%
대전지역본부 379
 
3.8%
경인지역본부 275
 
2.8%
Other values (32) 3151
31.5%

Length

2023-12-13T00:14:18.077059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울지역본부 1507
15.1%
경기지역본부 1216
 
12.2%
대구경북지역본부 791
 
7.9%
인천광역본부 664
 
6.6%
중앙회 621
 
6.2%
부산광역본부 535
 
5.3%
경기강원지역본부 448
 
4.5%
부산지역본부 413
 
4.1%
대전지역본부 379
 
3.8%
경인지역본부 275
 
2.8%
Other values (32) 3151
31.5%

노동관서
Categorical

Distinct49
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기
1281 
부천
714 
안산
690 
대구서부
 
512
양산
 
484
Other values (44)
6319 

Length

Max length9
Median length2
Mean length2.8542
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대전청(7000)
2nd row부천
3rd row대전청(7000)
4th row서산
5th row성남

Common Values

ValueCountFrequency (%)
경기 1281
 
12.8%
부천 714
 
7.1%
안산 690
 
6.9%
대구서부 512
 
5.1%
양산 484
 
4.8%
부산북부 427
 
4.3%
천안 410
 
4.1%
평택 334
 
3.3%
창원 301
 
3.0%
중부청 274
 
2.7%
Other values (39) 4573
45.7%

Length

2023-12-13T00:14:18.264890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 1281
 
12.8%
부천 714
 
7.1%
안산 690
 
6.9%
대구서부 512
 
5.1%
양산 484
 
4.8%
부산북부 427
 
4.3%
천안 410
 
4.1%
평택 334
 
3.3%
창원 301
 
3.0%
중부청 274
 
2.7%
Other values (39) 4573
45.7%
Distinct206
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-01-24 00:00:00
Maximum2023-06-30 00:00:00
2023-12-13T00:14:18.441618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:14:18.637918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

신청구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Off-Line
9829 
Web
 
171

Length

Max length8
Median length8
Mean length7.9145
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOff-Line
2nd rowOff-Line
3rd rowOff-Line
4th rowOff-Line
5th rowOff-Line

Common Values

ValueCountFrequency (%)
Off-Line 9829
98.3%
Web 171
 
1.7%

Length

2023-12-13T00:14:18.797603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:14:18.924895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
off-line 9829
98.3%
web 171
 
1.7%

심사구분
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
제품심사(개별)
5739 
서면심사
3629 
제품심사(제작중)
 
516
제품심사(형식별)
 
114
기술능력 및 생산체계심사
 
2

Length

Max length13
Median length8
Mean length6.6124
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제품심사(제작중)
2nd row서면심사
3rd row서면심사
4th row제품심사(개별)
5th row제품심사(개별)

Common Values

ValueCountFrequency (%)
제품심사(개별) 5739
57.4%
서면심사 3629
36.3%
제품심사(제작중) 516
 
5.2%
제품심사(형식별) 114
 
1.1%
기술능력 및 생산체계심사 2
 
< 0.1%

Length

2023-12-13T00:14:19.060009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:14:19.193328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제품심사(개별 5739
57.4%
서면심사 3629
36.3%
제품심사(제작중 516
 
5.2%
제품심사(형식별 114
 
1.1%
기술능력 2
 
< 0.1%
2
 
< 0.1%
생산체계심사 2
 
< 0.1%

재검사구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
신규신청
9890 
재검사
 
110

Length

Max length4
Median length4
Mean length3.989
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신규신청
2nd row신규신청
3rd row신규신청
4th row신규신청
5th row신규신청

Common Values

ValueCountFrequency (%)
신규신청 9890
98.9%
재검사 110
 
1.1%

Length

2023-12-13T00:14:19.340374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:14:19.455530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신규신청 9890
98.9%
재검사 110
 
1.1%

수입품여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
국내품
9587 
수입품
 
413

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내품
2nd row국내품
3rd row국내품
4th row국내품
5th row국내품

Common Values

ValueCountFrequency (%)
국내품 9587
95.9%
수입품 413
 
4.1%

Length

2023-12-13T00:14:19.590654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:14:19.717545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내품 9587
95.9%
수입품 413
 
4.1%

변경여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9863 
변경신청
 
137

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9863
98.6%
변경신청 137
 
1.4%

Length

2023-12-13T00:14:19.870320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:14:19.989981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9863
98.6%
변경신청 137
 
1.4%

심사희망일
Date

MISSING 

Distinct206
Distinct (%)4.5%
Missing5459
Missing (%)54.6%
Memory size156.2 KiB
Minimum2022-01-26 00:00:00
Maximum2023-12-28 00:00:00
2023-12-13T00:14:20.123319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:14:20.302618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

심사예정일
Date

MISSING 

Distinct154
Distinct (%)3.6%
Missing5766
Missing (%)57.7%
Memory size156.2 KiB
Minimum2022-09-07 00:00:00
Maximum2023-06-30 00:00:00
2023-12-13T00:14:20.471811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:14:20.635098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct147
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-01-02 00:00:00
Maximum2023-06-30 00:00:00
2023-12-13T00:14:20.775661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:14:20.952878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

대상품_대
Categorical

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
크레인
4765 
리프트
2071 
압력용기
1979 
곤돌라
 
473
프레스
 
180
Other values (5)
532 

Length

Max length5
Median length3
Mean length3.2599
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row압력용기
2nd row리프트
3rd row크레인
4th row크레인
5th row리프트

Common Values

ValueCountFrequency (%)
크레인 4765
47.6%
리프트 2071
20.7%
압력용기 1979
19.8%
곤돌라 473
 
4.7%
프레스 180
 
1.8%
고소작업대 164
 
1.6%
사출성형기 146
 
1.5%
절곡기 143
 
1.4%
전단기 71
 
0.7%
롤러기 8
 
0.1%

Length

2023-12-13T00:14:21.119421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:14:21.268062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
크레인 4765
47.6%
리프트 2071
20.7%
압력용기 1979
19.8%
곤돌라 473
 
4.7%
프레스 180
 
1.8%
고소작업대 164
 
1.6%
사출성형기 146
 
1.5%
절곡기 143
 
1.4%
전단기 71
 
0.7%
롤러기 8
 
0.1%
Distinct1755
Distinct (%)17.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:14:21.507651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length50
Mean length8.0654
Min length2

Characters and Unicode

Total characters80654
Distinct characters429
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique585 ?
Unique (%)5.9%

Sample

1st row강우산업
2nd row(주)부흥산전
3rd row삼공(주)
4th row극동호이스트(주)
5th row(주)모아기계산업정공
ValueCountFrequency (%)
주식회사 281
 
2.6%
국제산업렌탈(주 167
 
1.5%
주)대오정공 99
 
0.9%
이진산업(주 97
 
0.9%
주)동호리프트 96
 
0.9%
주)한국호이스트 91
 
0.8%
주)금강 83
 
0.8%
주)고려호이스트 77
 
0.7%
주)국제곤도라 64
 
0.6%
주)우리곤도라 61
 
0.6%
Other values (1841) 9693
89.7%
2023-12-13T00:14:22.287815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6806
 
8.4%
( 6152
 
7.6%
) 6147
 
7.6%
3932
 
4.9%
2761
 
3.4%
2279
 
2.8%
2114
 
2.6%
1883
 
2.3%
1818
 
2.3%
1751
 
2.2%
Other values (419) 45011
55.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62373
77.3%
Open Punctuation 6160
 
7.6%
Close Punctuation 6160
 
7.6%
Uppercase Letter 2900
 
3.6%
Lowercase Letter 1784
 
2.2%
Space Separator 810
 
1.0%
Other Punctuation 367
 
0.5%
Other Symbol 61
 
0.1%
Dash Punctuation 25
 
< 0.1%
Decimal Number 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6806
 
10.9%
3932
 
6.3%
2761
 
4.4%
2279
 
3.7%
2114
 
3.4%
1883
 
3.0%
1818
 
2.9%
1751
 
2.8%
1212
 
1.9%
1140
 
1.8%
Other values (353) 36677
58.8%
Uppercase Letter
ValueCountFrequency (%)
A 299
 
10.3%
N 274
 
9.4%
C 246
 
8.5%
E 212
 
7.3%
L 210
 
7.2%
T 191
 
6.6%
I 184
 
6.3%
G 181
 
6.2%
M 162
 
5.6%
S 137
 
4.7%
Other values (15) 804
27.7%
Lowercase Letter
ValueCountFrequency (%)
o 206
11.5%
i 187
 
10.5%
n 184
 
10.3%
e 130
 
7.3%
t 129
 
7.2%
a 128
 
7.2%
c 103
 
5.8%
r 91
 
5.1%
d 85
 
4.8%
g 73
 
4.1%
Other values (15) 468
26.2%
Other Punctuation
ValueCountFrequency (%)
. 216
58.9%
, 98
26.7%
& 48
 
13.1%
/ 5
 
1.4%
Open Punctuation
ValueCountFrequency (%)
( 6152
99.9%
7
 
0.1%
[ 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 6147
99.8%
12
 
0.2%
] 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 6
42.9%
2 5
35.7%
9 3
21.4%
Space Separator
ValueCountFrequency (%)
810
100.0%
Other Symbol
ValueCountFrequency (%)
61
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62434
77.4%
Common 13536
 
16.8%
Latin 4684
 
5.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6806
 
10.9%
3932
 
6.3%
2761
 
4.4%
2279
 
3.7%
2114
 
3.4%
1883
 
3.0%
1818
 
2.9%
1751
 
2.8%
1212
 
1.9%
1140
 
1.8%
Other values (354) 36738
58.8%
Latin
ValueCountFrequency (%)
A 299
 
6.4%
N 274
 
5.8%
C 246
 
5.3%
E 212
 
4.5%
L 210
 
4.5%
o 206
 
4.4%
T 191
 
4.1%
i 187
 
4.0%
n 184
 
3.9%
I 184
 
3.9%
Other values (40) 2491
53.2%
Common
ValueCountFrequency (%)
( 6152
45.4%
) 6147
45.4%
810
 
6.0%
. 216
 
1.6%
, 98
 
0.7%
& 48
 
0.4%
- 25
 
0.2%
12
 
0.1%
7
 
0.1%
1 6
 
< 0.1%
Other values (5) 15
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 62373
77.3%
ASCII 18201
 
22.6%
None 80
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6806
 
10.9%
3932
 
6.3%
2761
 
4.4%
2279
 
3.7%
2114
 
3.4%
1883
 
3.0%
1818
 
2.9%
1751
 
2.8%
1212
 
1.9%
1140
 
1.8%
Other values (353) 36677
58.8%
ASCII
ValueCountFrequency (%)
( 6152
33.8%
) 6147
33.8%
810
 
4.5%
A 299
 
1.6%
N 274
 
1.5%
C 246
 
1.4%
. 216
 
1.2%
E 212
 
1.2%
L 210
 
1.2%
o 206
 
1.1%
Other values (53) 3429
18.8%
None
ValueCountFrequency (%)
61
76.2%
12
 
15.0%
7
 
8.8%
Distinct1801
Distinct (%)18.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:14:22.581170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters100000
Distinct characters15
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique588 ?
Unique (%)5.9%

Sample

1st row3053371539
2nd row1218605946
3rd row3058601325
4th row1348101335
5th row4038126095
ValueCountFrequency (%)
1078160797 167
 
1.7%
1288119387 99
 
1.0%
1178153328 97
 
1.0%
1178135470 96
 
1.0%
6068163933 86
 
0.9%
2108153267 83
 
0.8%
1348132648 77
 
0.8%
1388108965 64
 
0.6%
1268121174 61
 
0.6%
1258175231 56
 
0.6%
Other values (1791) 9114
91.1%
2023-12-13T00:14:22.988250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 18900
18.9%
8 12878
12.9%
0 12380
12.4%
3 10099
10.1%
6 9246
9.2%
2 9077
9.1%
5 7609
7.6%
4 7487
 
7.5%
7 6867
 
6.9%
9 5296
 
5.3%
Other values (5) 161
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 99839
99.8%
Uppercase Letter 161
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 18900
18.9%
8 12878
12.9%
0 12380
12.4%
3 10099
10.1%
6 9246
9.3%
2 9077
9.1%
5 7609
7.6%
4 7487
 
7.5%
7 6867
 
6.9%
9 5296
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
E 127
78.9%
B 11
 
6.8%
A 11
 
6.8%
M 10
 
6.2%
C 2
 
1.2%

Most occurring scripts

ValueCountFrequency (%)
Common 99839
99.8%
Latin 161
 
0.2%

Most frequent character per script

Common
ValueCountFrequency (%)
1 18900
18.9%
8 12878
12.9%
0 12380
12.4%
3 10099
10.1%
6 9246
9.3%
2 9077
9.1%
5 7609
7.6%
4 7487
 
7.5%
7 6867
 
6.9%
9 5296
 
5.3%
Latin
ValueCountFrequency (%)
E 127
78.9%
B 11
 
6.8%
A 11
 
6.8%
M 10
 
6.2%
C 2
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 18900
18.9%
8 12878
12.9%
0 12380
12.4%
3 10099
10.1%
6 9246
9.2%
2 9077
9.1%
5 7609
7.6%
4 7487
 
7.5%
7 6867
 
6.9%
9 5296
 
5.3%
Other values (5) 161
 
0.2%
Distinct1857
Distinct (%)18.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:14:23.318243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length11
Mean length10.9999
Min length10

Characters and Unicode

Total characters109999
Distinct characters31
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique628 ?
Unique (%)6.3%

Sample

1st row92211336417
2nd row12186059460
3rd row30586013256
4th row13481013350
5th row40381260950
ValueCountFrequency (%)
10781607970 125
 
1.2%
11781533280 97
 
1.0%
11781354700 96
 
1.0%
60681639330 86
 
0.9%
90700281171 83
 
0.8%
13481326480 71
 
0.7%
90700239501 63
 
0.6%
12881193870 61
 
0.6%
12681211740 61
 
0.6%
30381391880 56
 
0.6%
Other values (1847) 9201
92.0%
2023-12-13T00:14:23.840959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 22452
20.4%
1 18720
17.0%
8 11944
10.9%
2 10074
9.2%
3 9207
8.4%
6 8945
 
8.1%
5 7011
 
6.4%
7 6995
 
6.4%
4 6953
 
6.3%
9 5974
 
5.4%
Other values (21) 1724
 
1.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 108275
98.4%
Uppercase Letter 1724
 
1.6%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E 880
51.0%
C 326
 
18.9%
B 184
 
10.7%
A 151
 
8.8%
M 47
 
2.7%
G 26
 
1.5%
I 23
 
1.3%
F 19
 
1.1%
U 17
 
1.0%
J 11
 
0.6%
Other values (11) 40
 
2.3%
Decimal Number
ValueCountFrequency (%)
0 22452
20.7%
1 18720
17.3%
8 11944
11.0%
2 10074
9.3%
3 9207
8.5%
6 8945
 
8.3%
5 7011
 
6.5%
7 6995
 
6.5%
4 6953
 
6.4%
9 5974
 
5.5%

Most occurring scripts

ValueCountFrequency (%)
Common 108275
98.4%
Latin 1724
 
1.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
E 880
51.0%
C 326
 
18.9%
B 184
 
10.7%
A 151
 
8.8%
M 47
 
2.7%
G 26
 
1.5%
I 23
 
1.3%
F 19
 
1.1%
U 17
 
1.0%
J 11
 
0.6%
Other values (11) 40
 
2.3%
Common
ValueCountFrequency (%)
0 22452
20.7%
1 18720
17.3%
8 11944
11.0%
2 10074
9.3%
3 9207
8.5%
6 8945
 
8.3%
5 7011
 
6.5%
7 6995
 
6.5%
4 6953
 
6.4%
9 5974
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 109999
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 22452
20.4%
1 18720
17.0%
8 11944
10.9%
2 10074
9.2%
3 9207
8.4%
6 8945
 
8.1%
5 7011
 
6.4%
7 6995
 
6.4%
4 6953
 
6.3%
9 5974
 
5.4%
Other values (21) 1724
 
1.6%

사업장개시번호
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9999 
91301592387
 
1

Length

Max length11
Median length1
Mean length1.001
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9999
> 99.9%
91301592387 1
 
< 0.1%

Length

2023-12-13T00:14:23.992043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:14:24.076122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9999
> 99.9%
91301592387 1
 
< 0.1%

근로자수
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct146
Distinct (%)1.6%
Missing1081
Missing (%)10.8%
Infinite0
Infinite (%)0.0%
Mean29.372351
Minimum0
Maximum12147
Zeros379
Zeros (%)3.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T00:14:24.182643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median7
Q320
95-th percentile77
Maximum12147
Range12147
Interquartile range (IQR)18

Descriptive statistics

Standard deviation294.24116
Coefficient of variation (CV)10.017624
Kurtosis1392.0956
Mean29.372351
Median Absolute Deviation (MAD)6
Skewness35.531027
Sum261972
Variance86577.858
MonotonicityNot monotonic
2023-12-13T00:14:24.312346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1369
 
13.7%
5 765
 
7.6%
3 595
 
5.9%
2 511
 
5.1%
21 470
 
4.7%
4 406
 
4.1%
0 379
 
3.8%
10 339
 
3.4%
6 333
 
3.3%
11 298
 
3.0%
Other values (136) 3454
34.5%
(Missing) 1081
 
10.8%
ValueCountFrequency (%)
0 379
 
3.8%
1 1369
13.7%
2 511
 
5.1%
3 595
5.9%
4 406
 
4.1%
5 765
7.6%
6 333
 
3.3%
7 152
 
1.5%
8 196
 
2.0%
9 229
 
2.3%
ValueCountFrequency (%)
12147 2
< 0.1%
12144 2
< 0.1%
8867 1
 
< 0.1%
4600 1
 
< 0.1%
3500 2
< 0.1%
2820 1
 
< 0.1%
2078 2
< 0.1%
1896 2
< 0.1%
1851 2
< 0.1%
1500 4
< 0.1%

업태
Text

MISSING 

Distinct77
Distinct (%)1.3%
Missing4137
Missing (%)41.4%
Memory size156.2 KiB
2023-12-13T00:14:24.497715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length3
Mean length3.1175166
Min length1

Characters and Unicode

Total characters18278
Distinct characters47
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)0.2%

Sample

1st row제조업외
2nd row제조업
3rd row제조업
4th row제조
5th row제조
ValueCountFrequency (%)
제조업 2480
39.7%
제조 1621
25.9%
건설업 294
 
4.7%
제조업외 286
 
4.6%
건설외 257
 
4.1%
164
 
2.6%
159
 
2.5%
건설 145
 
2.3%
제조외 107
 
1.7%
서비스,제조 56
 
0.9%
Other values (62) 684
 
10.9%
2023-12-13T00:14:24.840130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4848
26.5%
4846
26.5%
3309
18.1%
916
 
5.0%
819
 
4.5%
800
 
4.4%
390
 
2.1%
, 390
 
2.1%
244
 
1.3%
235
 
1.3%
Other values (37) 1481
 
8.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17334
94.8%
Other Punctuation 410
 
2.2%
Space Separator 390
 
2.1%
Decimal Number 144
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4848
28.0%
4846
28.0%
3309
19.1%
916
 
5.3%
819
 
4.7%
800
 
4.6%
244
 
1.4%
235
 
1.4%
223
 
1.3%
216
 
1.2%
Other values (30) 878
 
5.1%
Decimal Number
ValueCountFrequency (%)
1 73
50.7%
6 35
24.3%
0 29
 
20.1%
2 7
 
4.9%
Other Punctuation
ValueCountFrequency (%)
, 390
95.1%
. 20
 
4.9%
Space Separator
ValueCountFrequency (%)
390
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17334
94.8%
Common 944
 
5.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4848
28.0%
4846
28.0%
3309
19.1%
916
 
5.3%
819
 
4.7%
800
 
4.6%
244
 
1.4%
235
 
1.4%
223
 
1.3%
216
 
1.2%
Other values (30) 878
 
5.1%
Common
ValueCountFrequency (%)
390
41.3%
, 390
41.3%
1 73
 
7.7%
6 35
 
3.7%
0 29
 
3.1%
. 20
 
2.1%
2 7
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17334
94.8%
ASCII 944
 
5.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4848
28.0%
4846
28.0%
3309
19.1%
916
 
5.3%
819
 
4.7%
800
 
4.6%
244
 
1.4%
235
 
1.4%
223
 
1.3%
216
 
1.2%
Other values (30) 878
 
5.1%
ASCII
ValueCountFrequency (%)
390
41.3%
, 390
41.3%
1 73
 
7.7%
6 35
 
3.7%
0 29
 
3.1%
. 20
 
2.1%
2 7
 
0.7%

종목
Text

MISSING 

Distinct498
Distinct (%)9.0%
Missing4436
Missing (%)44.4%
Memory size156.2 KiB
2023-12-13T00:14:25.081260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length6.4705248
Min length2

Characters and Unicode

Total characters36002
Distinct characters234
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique134 ?
Unique (%)2.4%

Sample

1st row승강기제조외
2nd row운반하역기계
3rd row건설장비
4th row호이스트철구조물
5th row산업기계
ValueCountFrequency (%)
284
 
4.2%
호이스트 281
 
4.1%
산업기계 268
 
3.9%
158
 
2.3%
건설기계대여외 155
 
2.3%
임대 150
 
2.2%
압력용기 139
 
2.0%
운반하역기계 132
 
1.9%
크레인 125
 
1.8%
물품취급장비외 111
 
1.6%
Other values (477) 4994
73.5%
2023-12-13T00:14:25.528042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3033
 
8.4%
1819
 
5.1%
1449
 
4.0%
1233
 
3.4%
1192
 
3.3%
1034
 
2.9%
928
 
2.6%
900
 
2.5%
877
 
2.4%
875
 
2.4%
Other values (224) 22662
62.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 34006
94.5%
Space Separator 1233
 
3.4%
Other Punctuation 724
 
2.0%
Uppercase Letter 21
 
0.1%
Decimal Number 10
 
< 0.1%
Close Punctuation 4
 
< 0.1%
Open Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3033
 
8.9%
1819
 
5.3%
1449
 
4.3%
1192
 
3.5%
1034
 
3.0%
928
 
2.7%
900
 
2.6%
877
 
2.6%
875
 
2.6%
853
 
2.5%
Other values (210) 21046
61.9%
Uppercase Letter
ValueCountFrequency (%)
C 10
47.6%
D 4
 
19.0%
L 4
 
19.0%
N 3
 
14.3%
Decimal Number
ValueCountFrequency (%)
2 7
70.0%
6 1
 
10.0%
1 1
 
10.0%
4 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
, 698
96.4%
/ 24
 
3.3%
. 2
 
0.3%
Space Separator
ValueCountFrequency (%)
1233
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 34006
94.5%
Common 1975
 
5.5%
Latin 21
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3033
 
8.9%
1819
 
5.3%
1449
 
4.3%
1192
 
3.5%
1034
 
3.0%
928
 
2.7%
900
 
2.6%
877
 
2.6%
875
 
2.6%
853
 
2.5%
Other values (210) 21046
61.9%
Common
ValueCountFrequency (%)
1233
62.4%
, 698
35.3%
/ 24
 
1.2%
2 7
 
0.4%
) 4
 
0.2%
( 4
 
0.2%
. 2
 
0.1%
6 1
 
0.1%
1 1
 
0.1%
4 1
 
0.1%
Latin
ValueCountFrequency (%)
C 10
47.6%
D 4
 
19.0%
L 4
 
19.0%
N 3
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 34006
94.5%
ASCII 1996
 
5.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3033
 
8.9%
1819
 
5.3%
1449
 
4.3%
1192
 
3.5%
1034
 
3.0%
928
 
2.7%
900
 
2.6%
877
 
2.6%
875
 
2.6%
853
 
2.5%
Other values (210) 21046
61.9%
ASCII
ValueCountFrequency (%)
1233
61.8%
, 698
35.0%
/ 24
 
1.2%
C 10
 
0.5%
2 7
 
0.4%
) 4
 
0.2%
( 4
 
0.2%
D 4
 
0.2%
L 4
 
0.2%
N 3
 
0.2%
Other values (4) 5
 
0.3%

제조사업장명
Text

MISSING 

Distinct1610
Distinct (%)20.2%
Missing2043
Missing (%)20.4%
Memory size156.2 KiB
2023-12-13T00:14:25.742722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length50
Mean length8.2426794
Min length2

Characters and Unicode

Total characters65587
Distinct characters419
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique636 ?
Unique (%)8.0%

Sample

1st row강우산업
2nd row(주)부흥산전
3rd row삼공(주)
4th row(주)모아기계산업정공
5th row삼성기계호이스트
ValueCountFrequency (%)
주식회사 198
 
2.3%
국제산업렌탈(주 122
 
1.4%
이진산업(주 79
 
0.9%
주)고려호이스트 66
 
0.8%
주)대오정공 65
 
0.7%
주)국제곤도라 62
 
0.7%
주)금강 61
 
0.7%
ltd 59
 
0.7%
co 56
 
0.6%
주)경민호이스트 52
 
0.6%
Other values (1722) 7931
90.6%
2023-12-13T00:14:26.233489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5340
 
8.1%
( 4853
 
7.4%
) 4849
 
7.4%
3107
 
4.7%
2191
 
3.3%
1765
 
2.7%
1629
 
2.5%
1513
 
2.3%
1384
 
2.1%
1364
 
2.1%
Other values (409) 37592
57.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 49382
75.3%
Open Punctuation 4860
 
7.4%
Close Punctuation 4860
 
7.4%
Uppercase Letter 3552
 
5.4%
Lowercase Letter 1692
 
2.6%
Space Separator 795
 
1.2%
Other Punctuation 369
 
0.6%
Other Symbol 42
 
0.1%
Dash Punctuation 23
 
< 0.1%
Decimal Number 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5340
 
10.8%
3107
 
6.3%
2191
 
4.4%
1765
 
3.6%
1629
 
3.3%
1513
 
3.1%
1384
 
2.8%
1364
 
2.8%
983
 
2.0%
916
 
1.9%
Other values (343) 29190
59.1%
Uppercase Letter
ValueCountFrequency (%)
A 371
 
10.4%
C 322
 
9.1%
N 317
 
8.9%
T 247
 
7.0%
I 244
 
6.9%
O 240
 
6.8%
L 223
 
6.3%
E 210
 
5.9%
M 209
 
5.9%
G 170
 
4.8%
Other values (16) 999
28.1%
Lowercase Letter
ValueCountFrequency (%)
o 202
11.9%
n 179
10.6%
i 174
 
10.3%
t 126
 
7.4%
e 125
 
7.4%
a 117
 
6.9%
c 94
 
5.6%
r 83
 
4.9%
d 80
 
4.7%
h 71
 
4.2%
Other values (15) 441
26.1%
Other Punctuation
ValueCountFrequency (%)
. 215
58.3%
, 100
27.1%
& 47
 
12.7%
/ 7
 
1.9%
Decimal Number
ValueCountFrequency (%)
2 5
45.5%
1 4
36.4%
9 2
 
18.2%
Open Punctuation
ValueCountFrequency (%)
( 4853
99.9%
7
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 4849
99.8%
11
 
0.2%
Space Separator
ValueCountFrequency (%)
795
100.0%
Other Symbol
ValueCountFrequency (%)
42
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 23
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 49424
75.4%
Common 10919
 
16.6%
Latin 5244
 
8.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5340
 
10.8%
3107
 
6.3%
2191
 
4.4%
1765
 
3.6%
1629
 
3.3%
1513
 
3.1%
1384
 
2.8%
1364
 
2.8%
983
 
2.0%
916
 
1.9%
Other values (344) 29232
59.1%
Latin
ValueCountFrequency (%)
A 371
 
7.1%
C 322
 
6.1%
N 317
 
6.0%
T 247
 
4.7%
I 244
 
4.7%
O 240
 
4.6%
L 223
 
4.3%
E 210
 
4.0%
M 209
 
4.0%
o 202
 
3.9%
Other values (41) 2659
50.7%
Common
ValueCountFrequency (%)
( 4853
44.4%
) 4849
44.4%
795
 
7.3%
. 215
 
2.0%
, 100
 
0.9%
& 47
 
0.4%
- 23
 
0.2%
11
 
0.1%
7
 
0.1%
/ 7
 
0.1%
Other values (4) 12
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 49382
75.3%
ASCII 16145
 
24.6%
None 60
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5340
 
10.8%
3107
 
6.3%
2191
 
4.4%
1765
 
3.6%
1629
 
3.3%
1513
 
3.1%
1384
 
2.8%
1364
 
2.8%
983
 
2.0%
916
 
1.9%
Other values (343) 29190
59.1%
ASCII
ValueCountFrequency (%)
( 4853
30.1%
) 4849
30.0%
795
 
4.9%
A 371
 
2.3%
C 322
 
2.0%
N 317
 
2.0%
T 247
 
1.5%
I 244
 
1.5%
O 240
 
1.5%
L 223
 
1.4%
Other values (53) 3684
22.8%
None
ValueCountFrequency (%)
42
70.0%
11
 
18.3%
7
 
11.7%
Distinct1633
Distinct (%)20.5%
Missing2051
Missing (%)20.5%
Memory size156.2 KiB
2023-12-13T00:14:26.554606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters79490
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique618 ?
Unique (%)7.8%

Sample

1st row3053371539
2nd row1218605946
3rd row3058601325
4th row4038126095
5th row6062759723
ValueCountFrequency (%)
1078160797 122
 
1.5%
1178153328 79
 
1.0%
1348132648 66
 
0.8%
1288119387 65
 
0.8%
1388108965 62
 
0.8%
2108153267 61
 
0.8%
6228125192 52
 
0.7%
1268121174 51
 
0.6%
1178135470 50
 
0.6%
1258175231 48
 
0.6%
Other values (1623) 7293
91.7%
2023-12-13T00:14:26.945116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 15265
19.2%
8 10207
12.8%
0 9738
12.3%
3 7958
10.0%
2 7363
9.3%
6 7137
9.0%
5 6029
 
7.6%
4 6005
 
7.6%
7 5408
 
6.8%
9 4190
 
5.3%
Other values (4) 190
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 79300
99.8%
Uppercase Letter 190
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 15265
19.2%
8 10207
12.9%
0 9738
12.3%
3 7958
10.0%
2 7363
9.3%
6 7137
9.0%
5 6029
 
7.6%
4 6005
 
7.6%
7 5408
 
6.8%
9 4190
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
E 162
85.3%
B 11
 
5.8%
A 11
 
5.8%
M 6
 
3.2%

Most occurring scripts

ValueCountFrequency (%)
Common 79300
99.8%
Latin 190
 
0.2%

Most frequent character per script

Common
ValueCountFrequency (%)
1 15265
19.2%
8 10207
12.9%
0 9738
12.3%
3 7958
10.0%
2 7363
9.3%
6 7137
9.0%
5 6029
 
7.6%
4 6005
 
7.6%
7 5408
 
6.8%
9 4190
 
5.3%
Latin
ValueCountFrequency (%)
E 162
85.3%
B 11
 
5.8%
A 11
 
5.8%
M 6
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 79490
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 15265
19.2%
8 10207
12.8%
0 9738
12.3%
3 7958
10.0%
2 7363
9.3%
6 7137
9.0%
5 6029
 
7.6%
4 6005
 
7.6%
7 5408
 
6.8%
9 4190
 
5.3%
Other values (4) 190
 
0.2%
Distinct1684
Distinct (%)21.2%
Missing2049
Missing (%)20.5%
Memory size156.2 KiB
2023-12-13T00:14:27.230352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length11
Mean length10.998617
Min length1

Characters and Unicode

Total characters87450
Distinct characters30
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique666 ?
Unique (%)8.4%

Sample

1st row92211336417
2nd row12186059460
3rd row30586013256
4th row40381260950
5th row60627597230
ValueCountFrequency (%)
10781607970 97
 
1.2%
11781533280 79
 
1.0%
13481326480 62
 
0.8%
90700281171 61
 
0.8%
90700239501 59
 
0.7%
62281251920 52
 
0.7%
12681211740 51
 
0.6%
12881193870 51
 
0.6%
11781354700 50
 
0.6%
12581752310 48
 
0.6%
Other values (1674) 7341
92.3%
2023-12-13T00:14:27.607720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 17700
20.2%
1 15083
17.2%
8 9493
10.9%
2 8180
9.4%
3 7269
8.3%
6 6962
 
8.0%
4 5600
 
6.4%
5 5536
 
6.3%
7 5462
 
6.2%
9 4667
 
5.3%
Other values (20) 1498
 
1.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 85952
98.3%
Uppercase Letter 1497
 
1.7%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E 804
53.7%
C 270
 
18.0%
B 156
 
10.4%
A 140
 
9.4%
G 24
 
1.6%
I 21
 
1.4%
M 19
 
1.3%
F 15
 
1.0%
U 9
 
0.6%
J 8
 
0.5%
Other values (9) 31
 
2.1%
Decimal Number
ValueCountFrequency (%)
0 17700
20.6%
1 15083
17.5%
8 9493
11.0%
2 8180
9.5%
3 7269
8.5%
6 6962
 
8.1%
4 5600
 
6.5%
5 5536
 
6.4%
7 5462
 
6.4%
9 4667
 
5.4%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 85953
98.3%
Latin 1497
 
1.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
E 804
53.7%
C 270
 
18.0%
B 156
 
10.4%
A 140
 
9.4%
G 24
 
1.6%
I 21
 
1.4%
M 19
 
1.3%
F 15
 
1.0%
U 9
 
0.6%
J 8
 
0.5%
Other values (9) 31
 
2.1%
Common
ValueCountFrequency (%)
0 17700
20.6%
1 15083
17.5%
8 9493
11.0%
2 8180
9.5%
3 7269
8.5%
6 6962
 
8.1%
4 5600
 
6.5%
5 5536
 
6.4%
7 5462
 
6.4%
9 4667
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 87450
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 17700
20.2%
1 15083
17.2%
8 9493
10.9%
2 8180
9.4%
3 7269
8.3%
6 6962
 
8.0%
4 5600
 
6.4%
5 5536
 
6.3%
7 5462
 
6.2%
9 4667
 
5.3%
Other values (20) 1498
 
1.7%

제조사업장개시번호
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9998 
91907318607
 
1
92124998367
 
1

Length

Max length11
Median length4
Mean length4.0014
Min length4

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9998
> 99.9%
91907318607 1
 
< 0.1%
92124998367 1
 
< 0.1%

Length

2023-12-13T00:14:27.733813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:14:27.818779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9998
> 99.9%
91907318607 1
 
< 0.1%
92124998367 1
 
< 0.1%

제조 국가
Text

MISSING 

Distinct9
Distinct (%)69.2%
Missing9987
Missing (%)99.9%
Memory size156.2 KiB
2023-12-13T00:14:27.926030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length4.3076923
Min length2

Characters and Unicode

Total characters56
Distinct characters29
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)53.8%

Sample

1st rowGermany
2nd rowCHINA
3rd row한국
4th rowTURKEY
5th row한국
ValueCountFrequency (%)
한국 4
30.8%
china 3
23.1%
germany 1
 
7.7%
turkey 1
 
7.7%
대한민국 1
 
7.7%
spain 1
 
7.7%
japan 1
 
7.7%
france 1
 
7.7%
2023-12-13T00:14:28.168183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5
 
8.9%
a 5
 
8.9%
5
 
8.9%
C 4
 
7.1%
n 4
 
7.1%
A 3
 
5.4%
N 3
 
5.4%
R 2
 
3.6%
E 2
 
3.6%
I 2
 
3.6%
Other values (19) 21
37.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 26
46.4%
Lowercase Letter 18
32.1%
Other Letter 12
21.4%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
C 4
15.4%
A 3
11.5%
N 3
11.5%
R 2
 
7.7%
E 2
 
7.7%
I 2
 
7.7%
J 1
 
3.8%
P 1
 
3.8%
S 1
 
3.8%
Y 1
 
3.8%
Other values (6) 6
23.1%
Lowercase Letter
ValueCountFrequency (%)
a 5
27.8%
n 4
22.2%
i 2
 
11.1%
h 2
 
11.1%
p 1
 
5.6%
e 1
 
5.6%
y 1
 
5.6%
m 1
 
5.6%
r 1
 
5.6%
Other Letter
ValueCountFrequency (%)
5
41.7%
5
41.7%
1
 
8.3%
1
 
8.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 44
78.6%
Hangul 12
 
21.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 5
 
11.4%
C 4
 
9.1%
n 4
 
9.1%
A 3
 
6.8%
N 3
 
6.8%
R 2
 
4.5%
E 2
 
4.5%
I 2
 
4.5%
i 2
 
4.5%
h 2
 
4.5%
Other values (15) 15
34.1%
Hangul
ValueCountFrequency (%)
5
41.7%
5
41.7%
1
 
8.3%
1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 44
78.6%
Hangul 12
 
21.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5
41.7%
5
41.7%
1
 
8.3%
1
 
8.3%
ASCII
ValueCountFrequency (%)
a 5
 
11.4%
C 4
 
9.1%
n 4
 
9.1%
A 3
 
6.8%
N 3
 
6.8%
R 2
 
4.5%
E 2
 
4.5%
I 2
 
4.5%
i 2
 
4.5%
h 2
 
4.5%
Other values (15) 15
34.1%

설치일자
Date

MISSING 

Distinct143
Distinct (%)2.8%
Missing4924
Missing (%)49.2%
Memory size156.2 KiB
Minimum2023-01-02 00:00:00
Maximum2023-06-30 00:00:00
2023-12-13T00:14:28.295477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:14:28.436477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Sample

신청일기관지사노동관서접수일신청구분심사구분재검사구분수입품여부변경여부심사희망일심사예정일심사완료일대상품_대사업장명사업자등록번호산재관리번호사업장개시번호근로자수업태종목제조사업장명제조사업자등록번호제조사업장관리번호제조사업장개시번호제조 국가설치일자
125532023-01-13안전보건공단대전세종광역본부대전청(7000)2023-01-13Off-Line제품심사(제작중)신규신청국내품<NA>2023-01-13<NA>2023-02-09압력용기강우산업30533715399221133641701<NA><NA>강우산업305337153992211336417<NA><NA>2023-02-09
75842023-03-28대한산업안전협회경기지역본부부천2023-03-28Off-Line서면심사신규신청국내품<NA><NA><NA>2023-04-10리프트(주)부흥산전121860594612186059460013제조업외승강기제조외(주)부흥산전121860594612186059460<NA><NA><NA>
131622023-01-30한국승강기안전공단서울지역본부대전청(7000)2023-01-30Off-Line서면심사신규신청국내품<NA><NA><NA>2023-02-02크레인삼공(주)3058601325305860132560<NA><NA><NA>삼공(주)305860132530586013256<NA><NA><NA>
107522023-02-14한국승강기안전공단천안지사서산2023-02-20Off-Line제품심사(개별)신규신청국내품<NA>2023-03-032023-03-032023-03-03크레인극동호이스트(주)134810133513481013350020제조업운반하역기계<NA><NA><NA><NA><NA>2023-03-03
83922023-03-13대한산업안전협회성남지회성남2023-03-13Off-Line제품심사(개별)신규신청국내품<NA>2023-03-312023-03-312023-03-31리프트(주)모아기계산업정공40381260954038126095008제조업건설장비(주)모아기계산업정공403812609540381260950<NA><NA>2023-03-31
44732023-05-04대한산업안전협회부산지역본부부산북부2023-05-04Off-Line서면심사신규신청국내품<NA><NA><NA>2023-05-15크레인삼성기계호이스트60627597236062759723003제조호이스트철구조물삼성기계호이스트606275972360627597230<NA><NA><NA>
96632023-03-07한국승강기안전공단부산경남지역본부부산북부2023-03-07Off-Line제품심사(개별)신규신청국내품<NA><NA><NA>2023-03-16크레인신성중공업(주)606860880460686088040010제조산업기계<NA><NA><NA><NA><NA>2023-03-16
89482023-03-17한국승강기안전공단서울지역본부중부청2023-03-17Off-Line서면심사신규신청국내품<NA><NA><NA>2023-03-24크레인현대호이스트 인천12202322951220232295005도소매,서비스호이스트,크레인현대호이스트 인천122023229512202322950<NA><NA><NA>
125502022-12-09안전보건공단경기지역본부경기2022-12-09Off-Line제품심사(제작중)신규신청국내품<NA>2022-12-12<NA>2023-02-09압력용기천복테크(주)86886003038688600303009제조업기계<NA><NA><NA><NA><NA><NA>
113972022-12-13안전보건공단경기지역본부성남2022-12-12Off-Line제품심사(제작중)신규신청국내품<NA>2022-12-22<NA>2023-02-22압력용기(주)제이에프씨312813623031281362300014<NA><NA><NA><NA><NA><NA><NA><NA>
신청일기관지사노동관서접수일신청구분심사구분재검사구분수입품여부변경여부심사희망일심사예정일심사완료일대상품_대사업장명사업자등록번호산재관리번호사업장개시번호근로자수업태종목제조사업장명제조사업자등록번호제조사업장관리번호제조사업장개시번호제조 국가설치일자
114912023-02-08대한산업안전협회충북지회청주2023-02-08Off-Line제품심사(개별)신규신청국내품<NA><NA>2023-02-222023-02-22리프트이진산업(주)117815332811781533280011건설업리프트,타워크레인 임대<NA><NA><NA><NA><NA>2023-02-22
128802022-12-12안전보건공단경기지역본부경기2022-12-12Off-Line제품심사(제작중)신규신청국내품<NA>2022-12-132022-12-132023-02-06압력용기한강산업주식회사130862789013086278900011<NA><NA><NA><NA><NA><NA><NA><NA>
44532023-05-09한국승강기안전공단경기강원지역본부경기2023-05-09Off-Line제품심사(개별)신규신청국내품<NA>2023-05-152023-05-152023-05-15크레인동해기계12443735621244373562001<NA><NA>동해기계124437356212443735620<NA><NA>2023-05-15
96062023-03-07안전보건공단인천광역본부의정부2023-03-07Off-Line제품심사(개별)신규신청국내품<NA>2023-03-082023-03-082023-03-16압력용기주식회사 진우기계45287018224528701822001제조압력용기주식회사 진우기계452870182245287018220<NA><NA><NA>
42952023-05-12대한산업안전협회경기지역본부중부청2023-05-12Off-Line서면심사신규신청국내품<NA><NA><NA>2023-05-16리프트(주)우리피엠아이13186355441318635544005제조업,도매 및 소매업산업용기계,전기자재(주)우리피엠아이131863554413186355440<NA><NA><NA>
127842023-01-26안전보건공단경기지역본부경기2023-01-26Off-Line제품심사(개별)신규신청국내품<NA>2023-01-312023-02-062023-02-07압력용기지에이텍(주)14081078961408107896008제조공압기계및부품지에이텍(주)140810789614081078960<NA><NA><NA>
57842023-04-17한국승강기안전공단경기강원지역본부경기2023-04-17Off-Line제품심사(개별)신규신청국내품<NA>2023-04-282023-04-282023-04-28크레인대상산업1430163425143016342500<NA>제조업호이스트대상산업143016342514301634250<NA><NA>2023-04-28
28682023-05-17한국승강기안전공단서울지역본부안산2023-05-17Off-Line서면심사신규신청국내품<NA><NA><NA>2023-06-01크레인(주)고려호이스트13481326481348132648005제조업물품취급장비외(주)고려호이스트134813264813481326480<NA><NA><NA>
30792023-05-23한국승강기안전공단서울지역본부성남2023-05-23Off-Line서면심사신규신청국내품<NA><NA><NA>2023-05-30리프트(주)태영엘리베이터써비스1298659183129865918300<NA><NA><NA>(주)태영엘리베이터써비스129865918312986591830<NA><NA><NA>
24492023-05-26한국승강기안전공단대구경북지역본부양산2023-05-26Off-Line서면심사신규신청국내품<NA><NA><NA>2023-06-07크레인에스텍시스템61511714676151171467004<NA><NA><NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

신청일기관지사노동관서접수일신청구분심사구분재검사구분수입품여부변경여부심사희망일심사예정일심사완료일대상품_대사업장명사업자등록번호산재관리번호사업장개시번호근로자수업태종목제조사업장명제조사업자등록번호제조사업장관리번호제조사업장개시번호제조 국가설치일자# duplicates
22022-08-23안전보건공단경기지역본부안산2022-08-23Off-Line제품심사(제작중)신규신청국내품<NA>2022-08-25<NA>2023-01-03압력용기(주)일신엔지니어링133814196913381419690017제조.도매화학장치기계<NA><NA><NA><NA><NA><NA>5
62022-10-12안전보건공단경기지역본부경기2022-10-12Off-Line제품심사(제작중)신규신청국내품<NA>2022-10-20<NA>2023-01-17압력용기피티케이(주)51381311965138131196609제조플랜트장치<NA><NA><NA><NA><NA><NA>5
102022-11-09안전보건공단경기지역본부경기2022-11-09Off-Line제품심사(제작중)신규신청국내품<NA>2022-11-10<NA>2023-02-21압력용기(주)우리이엠아이143810187214381018720024제조특정설비<NA><NA><NA><NA><NA><NA>4
912023-02-07안전보건공단인천광역본부부천2023-02-07Off-Line제품심사(제작중)신규신청국내품<NA>2023-02-09<NA>2023-06-20압력용기(주)템텍52381004055238100405004제조업외밸브류 외<NA><NA><NA><NA><NA><NA>4
1192023-02-27한국승강기안전공단서울지역본부부천2023-02-27Off-Line서면심사신규신청국내품<NA><NA><NA>2023-03-06크레인동성중공업(주)13781897421378189742004<NA><NA>동성중공업(주)137818974213781897420<NA><NA><NA>4
1292023-03-03안전보건공단경기지역본부안산2023-03-03Off-Line제품심사(제작중)신규신청국내품<NA>2023-03-03<NA>2023-05-04압력용기(주)태인에프앤씨140811939214081193920043제조산업용기계<NA><NA><NA><NA><NA><NA>4
1542023-03-20안전보건공단경기지역본부안산2023-03-20Off-Line제품심사(제작중)신규신청국내품<NA>2023-03-28<NA>2023-06-07압력용기(주)중앙이엔지140815741714081574170013제조업산업용기계제작 및 관련업<NA><NA><NA><NA><NA><NA>4
1822023-04-07한국승강기안전공단경인지역본부중부청2023-04-07Off-Line제품심사(개별)신규신청국내품<NA><NA><NA>2023-04-25크레인준호이스트12118205171211820517000<NA><NA>준호이스트121182051712118205170<NA><NA>2023-04-254
2512023-05-19한국승강기안전공단서울지역본부부천2023-05-19Off-Line서면심사신규신청국내품<NA><NA><NA>2023-05-30크레인극동기계크레인130383134013038313400016제조업하역운반기계외극동기계크레인130383134013038313400<NA><NA><NA>4
2532023-05-19한국승강기안전공단호남지역본부여수2023-05-19Off-Line제품심사(개별)신규신청국내품<NA><NA><NA>2023-06-12크레인(주)대유41281194489080019516102제조크레인제작<NA><NA><NA><NA><NA>2023-06-124