Overview

Dataset statistics

Number of variables11
Number of observations10000
Missing cells311
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory947.3 KiB
Average record size in memory97.0 B

Variable types

Text5
Numeric1
Categorical5

Dataset

Description비공개 기록물 공개 재분류 목록 현황
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=LXHGDSEN8JGLFODSHUOQ20287932&infSeq=1

Alerts

공개제한내용 is highly overall correlated with 공개구분명 and 2 other fieldsHigh correlation
공개재분류의견 is highly overall correlated with 공개구분명 and 2 other fieldsHigh correlation
공개재분류-최종의견-비공개호수명 is highly overall correlated with 공개구분명 and 2 other fieldsHigh correlation
생산일자 is highly overall correlated with 기존공개구분명High correlation
기존공개구분명 is highly overall correlated with 생산일자High correlation
공개구분명 is highly overall correlated with 공개재분류-최종의견-비공개호수명 and 2 other fieldsHigh correlation
기존공개구분명 is highly imbalanced (79.7%)Imbalance
공개구분명 is highly imbalanced (73.4%)Imbalance
공개재분류-최종의견-비공개호수명 is highly imbalanced (54.2%)Imbalance
공개제한내용 is highly imbalanced (54.5%)Imbalance
공개재분류의견 is highly imbalanced (56.0%)Imbalance
생산등록번호 has 311 (3.1%) missing valuesMissing

Reproduction

Analysis started2023-12-10 22:31:54.285281
Analysis finished2023-12-10 22:31:56.837104
Duration2.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct146
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T07:31:57.035567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length15
Mean length16.0352
Min length11

Characters and Unicode

Total characters160352
Distinct characters167
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)0.3%

Sample

1st row경기도 이천소방서 현장대응단
2nd row경기도 양주소방서 재난예방과
3rd row경기도 평택소방서 재난예방과
4th row경기도 양주소방서 재난예방과
5th row경기도 안산소방서 재난예방과
ValueCountFrequency (%)
경기도 10000
32.3%
재난예방과 6624
21.4%
수원소방서 2663
 
8.6%
안양소방서 1164
 
3.8%
일산소방서 984
 
3.2%
고양소방서 736
 
2.4%
안산소방서 682
 
2.2%
소방행정과 509
 
1.6%
양주소방서 471
 
1.5%
정책기획관 437
 
1.4%
Other values (148) 6672
21.6%
2023-12-11T07:31:57.510057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20942
13.1%
15376
 
9.6%
11336
 
7.1%
10915
 
6.8%
10469
 
6.5%
8859
 
5.5%
8770
 
5.5%
7972
 
5.0%
7224
 
4.5%
6885
 
4.3%
Other values (157) 51604
32.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 139158
86.8%
Space Separator 20942
 
13.1%
Decimal Number 243
 
0.2%
Uppercase Letter 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15376
 
11.0%
11336
 
8.1%
10915
 
7.8%
10469
 
7.5%
8859
 
6.4%
8770
 
6.3%
7972
 
5.7%
7224
 
5.2%
6885
 
4.9%
6636
 
4.8%
Other values (151) 44716
32.1%
Uppercase Letter
ValueCountFrequency (%)
D 3
33.3%
M 3
33.3%
Z 3
33.3%
Decimal Number
ValueCountFrequency (%)
1 162
66.7%
9 81
33.3%
Space Separator
ValueCountFrequency (%)
20942
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 139158
86.8%
Common 21185
 
13.2%
Latin 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15376
 
11.0%
11336
 
8.1%
10915
 
7.8%
10469
 
7.5%
8859
 
6.4%
8770
 
6.3%
7972
 
5.7%
7224
 
5.2%
6885
 
4.9%
6636
 
4.8%
Other values (151) 44716
32.1%
Common
ValueCountFrequency (%)
20942
98.9%
1 162
 
0.8%
9 81
 
0.4%
Latin
ValueCountFrequency (%)
D 3
33.3%
M 3
33.3%
Z 3
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 139158
86.8%
ASCII 21194
 
13.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
20942
98.8%
1 162
 
0.8%
9 81
 
0.4%
D 3
 
< 0.1%
M 3
 
< 0.1%
Z 3
 
< 0.1%
Hangul
ValueCountFrequency (%)
15376
 
11.0%
11336
 
8.1%
10915
 
7.8%
10469
 
7.5%
8859
 
6.4%
8770
 
6.3%
7972
 
5.7%
7224
 
5.2%
6885
 
4.9%
6636
 
4.8%
Other values (151) 44716
32.1%
Distinct159
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T07:31:57.798364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length13
Mean length14.6966
Min length11

Characters and Unicode

Total characters146966
Distinct characters168
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)0.3%

Sample

1st row경기도 이천소방서 현장지휘과
2nd row경기도 양주소방서 예방과
3rd row경기도 평택소방서 예방과
4th row경기도 양주소방서 예방과
5th row경기도 안산소방서 예방과
ValueCountFrequency (%)
경기도 10000
32.5%
예방과 6620
21.5%
수원소방서 2663
 
8.7%
안양소방서 1164
 
3.8%
일산소방서 984
 
3.2%
고양소방서 736
 
2.4%
안산소방서 682
 
2.2%
현장지휘과 599
 
1.9%
소방행정과 509
 
1.7%
양주소방서 471
 
1.5%
Other values (159) 6322
20.6%
2023-12-11T07:31:58.243681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20750
14.1%
15586
 
10.6%
11316
 
7.7%
10914
 
7.4%
10486
 
7.1%
9325
 
6.3%
8810
 
6.0%
7972
 
5.4%
6655
 
4.5%
2948
 
2.0%
Other values (158) 42204
28.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 125962
85.7%
Space Separator 20750
 
14.1%
Decimal Number 245
 
0.2%
Uppercase Letter 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15586
 
12.4%
11316
 
9.0%
10914
 
8.7%
10486
 
8.3%
9325
 
7.4%
8810
 
7.0%
7972
 
6.3%
6655
 
5.3%
2948
 
2.3%
2778
 
2.2%
Other values (151) 39172
31.1%
Decimal Number
ValueCountFrequency (%)
1 162
66.1%
9 81
33.1%
2 2
 
0.8%
Uppercase Letter
ValueCountFrequency (%)
M 3
33.3%
Z 3
33.3%
D 3
33.3%
Space Separator
ValueCountFrequency (%)
20750
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 125962
85.7%
Common 20995
 
14.3%
Latin 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15586
 
12.4%
11316
 
9.0%
10914
 
8.7%
10486
 
8.3%
9325
 
7.4%
8810
 
7.0%
7972
 
6.3%
6655
 
5.3%
2948
 
2.3%
2778
 
2.2%
Other values (151) 39172
31.1%
Common
ValueCountFrequency (%)
20750
98.8%
1 162
 
0.8%
9 81
 
0.4%
2 2
 
< 0.1%
Latin
ValueCountFrequency (%)
M 3
33.3%
Z 3
33.3%
D 3
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 125962
85.7%
ASCII 21004
 
14.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
20750
98.8%
1 162
 
0.8%
9 81
 
0.4%
M 3
 
< 0.1%
Z 3
 
< 0.1%
D 3
 
< 0.1%
2 2
 
< 0.1%
Hangul
ValueCountFrequency (%)
15586
 
12.4%
11316
 
9.0%
10914
 
8.7%
10486
 
8.3%
9325
 
7.4%
8810
 
7.0%
7972
 
6.3%
6655
 
5.3%
2948
 
2.3%
2778
 
2.2%
Other values (151) 39172
31.1%
Distinct1042
Distinct (%)10.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T07:31:58.488956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length73
Median length52
Mean length11.0856
Min length2

Characters and Unicode

Total characters110856
Distinct characters448
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique713 ?
Unique (%)7.1%

Sample

1st row화재조사일반
2nd row소방시설완공검사
3rd row위험물제조소등일반
4th row소방시설완공검사
5th row소방시설업등록변경
ValueCountFrequency (%)
소방시설완공검사 1003
 
8.9%
소방건축허가동의 549
 
4.8%
안전시설등완비증명발급 419
 
3.7%
소방시설시공신고 379
 
3.3%
소방시설공사착공(변경)신고 364
 
3.2%
건축허가동의 353
 
3.1%
소방시설완공검사(감리 347
 
3.1%
방염후처리물품방염성능검사관리 324
 
2.9%
소방시설업등록변경 317
 
2.8%
소방시설착공신고 314
 
2.8%
Other values (1160) 6952
61.4%
2023-12-11T07:31:58.963462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6302
 
5.7%
4985
 
4.5%
4898
 
4.4%
4488
 
4.0%
4111
 
3.7%
4031
 
3.6%
2535
 
2.3%
2287
 
2.1%
2198
 
2.0%
2191
 
2.0%
Other values (438) 72830
65.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 96937
87.4%
Decimal Number 5759
 
5.2%
Close Punctuation 2092
 
1.9%
Open Punctuation 2092
 
1.9%
Space Separator 1322
 
1.2%
Dash Punctuation 1095
 
1.0%
Connector Punctuation 768
 
0.7%
Other Punctuation 591
 
0.5%
Uppercase Letter 159
 
0.1%
Lowercase Letter 17
 
< 0.1%
Other values (2) 24
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6302
 
6.5%
4985
 
5.1%
4898
 
5.1%
4488
 
4.6%
4111
 
4.2%
4031
 
4.2%
2535
 
2.6%
2287
 
2.4%
2198
 
2.3%
2191
 
2.3%
Other values (390) 58911
60.8%
Uppercase Letter
ValueCountFrequency (%)
C 102
64.2%
G 10
 
6.3%
F 8
 
5.0%
I 7
 
4.4%
L 6
 
3.8%
N 5
 
3.1%
H 4
 
2.5%
O 3
 
1.9%
D 3
 
1.9%
M 3
 
1.9%
Other values (4) 8
 
5.0%
Decimal Number
ValueCountFrequency (%)
1 1530
26.6%
2 1401
24.3%
0 1119
19.4%
3 510
 
8.9%
4 455
 
7.9%
5 234
 
4.1%
6 192
 
3.3%
7 114
 
2.0%
8 113
 
2.0%
9 91
 
1.6%
Lowercase Letter
ValueCountFrequency (%)
o 6
35.3%
m 2
 
11.8%
s 2
 
11.8%
u 1
 
5.9%
l 1
 
5.9%
h 1
 
5.9%
c 1
 
5.9%
r 1
 
5.9%
i 1
 
5.9%
e 1
 
5.9%
Other Punctuation
ValueCountFrequency (%)
. 490
82.9%
, 69
 
11.7%
: 22
 
3.7%
/ 9
 
1.5%
· 1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 2051
98.0%
] 41
 
2.0%
Open Punctuation
ValueCountFrequency (%)
( 2050
98.0%
[ 42
 
2.0%
Space Separator
ValueCountFrequency (%)
1322
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1095
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 768
100.0%
Math Symbol
ValueCountFrequency (%)
~ 16
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 96945
87.5%
Common 13735
 
12.4%
Latin 176
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6302
 
6.5%
4985
 
5.1%
4898
 
5.1%
4488
 
4.6%
4111
 
4.2%
4031
 
4.2%
2535
 
2.6%
2287
 
2.4%
2198
 
2.3%
2191
 
2.3%
Other values (391) 58919
60.8%
Latin
ValueCountFrequency (%)
C 102
58.0%
G 10
 
5.7%
F 8
 
4.5%
I 7
 
4.0%
L 6
 
3.4%
o 6
 
3.4%
N 5
 
2.8%
H 4
 
2.3%
O 3
 
1.7%
D 3
 
1.7%
Other values (14) 22
 
12.5%
Common
ValueCountFrequency (%)
) 2051
14.9%
( 2050
14.9%
1 1530
11.1%
2 1401
10.2%
1322
9.6%
0 1119
8.1%
- 1095
8.0%
_ 768
 
5.6%
3 510
 
3.7%
. 490
 
3.6%
Other values (13) 1399
10.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 96937
87.4%
ASCII 13910
 
12.5%
None 9
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6302
 
6.5%
4985
 
5.1%
4898
 
5.1%
4488
 
4.6%
4111
 
4.2%
4031
 
4.2%
2535
 
2.6%
2287
 
2.4%
2198
 
2.3%
2191
 
2.3%
Other values (390) 58911
60.8%
ASCII
ValueCountFrequency (%)
) 2051
14.7%
( 2050
14.7%
1 1530
11.0%
2 1401
10.1%
1322
9.5%
0 1119
8.0%
- 1095
7.9%
_ 768
 
5.5%
3 510
 
3.7%
. 490
 
3.5%
Other values (36) 1574
11.3%
None
ValueCountFrequency (%)
8
88.9%
· 1
 
11.1%

생산등록번호
Text

MISSING 

Distinct8972
Distinct (%)92.6%
Missing311
Missing (%)3.1%
Memory size156.2 KiB
2023-12-11T07:31:59.274355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length18
Mean length9.0227062
Min length1

Characters and Unicode

Total characters87421
Distinct characters146
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8319 ?
Unique (%)85.9%

Sample

1st row현장지휘과-4409
2nd row예방과-15414
3rd row예방과-10279
4th row예방과-7594
5th row예방과-17573
ValueCountFrequency (%)
화재조사분석과 7
 
0.1%
소방행정과 5
 
0.1%
현장지휘과 5
 
0.1%
예방과 4
 
< 0.1%
종무과-400122 4
 
< 0.1%
종무과-400061 4
 
< 0.1%
예방과-212 4
 
< 0.1%
종무과-400069 4
 
< 0.1%
예방과-3886 3
 
< 0.1%
예방과-8882 3
 
< 0.1%
Other values (8962) 9646
99.6%
2023-12-11T07:31:59.797323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 9350
 
10.7%
9030
 
10.3%
1 7295
 
8.3%
7218
 
8.3%
6530
 
7.5%
2 5081
 
5.8%
0 4591
 
5.3%
3 4145
 
4.7%
4 4105
 
4.7%
5 3794
 
4.3%
Other values (136) 26282
30.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 43454
49.7%
Other Letter 34608
39.6%
Dash Punctuation 9350
 
10.7%
Uppercase Letter 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9030
26.1%
7218
20.9%
6530
18.9%
924
 
2.7%
725
 
2.1%
606
 
1.8%
605
 
1.7%
599
 
1.7%
599
 
1.7%
540
 
1.6%
Other values (122) 7232
20.9%
Decimal Number
ValueCountFrequency (%)
1 7295
16.8%
2 5081
11.7%
0 4591
10.6%
3 4145
9.5%
4 4105
9.4%
5 3794
8.7%
9 3788
8.7%
6 3651
8.4%
7 3507
8.1%
8 3497
8.0%
Uppercase Letter
ValueCountFrequency (%)
D 3
33.3%
M 3
33.3%
Z 3
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 9350
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 52804
60.4%
Hangul 34608
39.6%
Latin 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9030
26.1%
7218
20.9%
6530
18.9%
924
 
2.7%
725
 
2.1%
606
 
1.8%
605
 
1.7%
599
 
1.7%
599
 
1.7%
540
 
1.6%
Other values (122) 7232
20.9%
Common
ValueCountFrequency (%)
- 9350
17.7%
1 7295
13.8%
2 5081
9.6%
0 4591
8.7%
3 4145
7.8%
4 4105
7.8%
5 3794
7.2%
9 3788
7.2%
6 3651
 
6.9%
7 3507
 
6.6%
Latin
ValueCountFrequency (%)
D 3
33.3%
M 3
33.3%
Z 3
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 52813
60.4%
Hangul 34608
39.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 9350
17.7%
1 7295
13.8%
2 5081
9.6%
0 4591
8.7%
3 4145
7.8%
4 4105
7.8%
5 3794
7.2%
9 3788
7.2%
6 3651
 
6.9%
7 3507
 
6.6%
Other values (4) 3506
 
6.6%
Hangul
ValueCountFrequency (%)
9030
26.1%
7218
20.9%
6530
18.9%
924
 
2.7%
725
 
2.1%
606
 
1.8%
605
 
1.7%
599
 
1.7%
599
 
1.7%
540
 
1.6%
Other values (122) 7232
20.9%
Distinct8337
Distinct (%)83.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T07:32:00.162089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length73
Median length50
Mean length26.3031
Min length3

Characters and Unicode

Total characters263031
Distinct characters874
Distinct categories16 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7811 ?
Unique (%)78.1%

Sample

1st row2012년 화재피해주민 주택재건축사업 추천대상 현지실사 계획 알림
2nd row소방시설완공검사신청서
3rd row위험물 이동탱크저장소 허가내역 협조요청
4th row소방시설완공검사필증 교부사항 알림
5th row소방시설공사업 등록사항 변경신고 처리에 따른 보고(통보)(주.고산)
ValueCountFrequency (%)
따른 1524
 
3.8%
소방시설 1073
 
2.7%
대한 837
 
2.1%
결과 724
 
1.8%
신청에 645
 
1.6%
안전시설등 593
 
1.5%
586
 
1.5%
완공검사 568
 
1.4%
건물 554
 
1.4%
소방시설업 532
 
1.3%
Other values (10890) 32403
80.9%
2023-12-11T07:32:00.755703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30052
 
11.4%
) 7640
 
2.9%
( 7620
 
2.9%
6442
 
2.4%
6342
 
2.4%
6081
 
2.3%
5869
 
2.2%
5344
 
2.0%
5307
 
2.0%
5064
 
1.9%
Other values (864) 177270
67.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 198227
75.4%
Space Separator 30052
 
11.4%
Open Punctuation 11102
 
4.2%
Close Punctuation 11101
 
4.2%
Decimal Number 8711
 
3.3%
Dash Punctuation 1458
 
0.6%
Uppercase Letter 1077
 
0.4%
Other Punctuation 1075
 
0.4%
Lowercase Letter 115
 
< 0.1%
Math Symbol 56
 
< 0.1%
Other values (6) 57
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6442
 
3.2%
6342
 
3.2%
6081
 
3.1%
5869
 
3.0%
5344
 
2.7%
5307
 
2.7%
5064
 
2.6%
4951
 
2.5%
4634
 
2.3%
3679
 
1.9%
Other values (772) 144514
72.9%
Uppercase Letter
ValueCountFrequency (%)
C 256
23.8%
P 119
11.0%
B 80
 
7.4%
A 75
 
7.0%
S 74
 
6.9%
E 47
 
4.4%
L 41
 
3.8%
I 37
 
3.4%
O 36
 
3.3%
F 34
 
3.2%
Other values (15) 278
25.8%
Lowercase Letter
ValueCountFrequency (%)
e 14
12.2%
o 12
10.4%
p 11
9.6%
c 10
 
8.7%
l 9
 
7.8%
a 8
 
7.0%
i 8
 
7.0%
n 7
 
6.1%
t 6
 
5.2%
s 5
 
4.3%
Other values (10) 25
21.7%
Other Punctuation
ValueCountFrequency (%)
: 576
53.6%
. 172
 
16.0%
, 144
 
13.4%
· 72
 
6.7%
/ 60
 
5.6%
& 17
 
1.6%
* 14
 
1.3%
" 10
 
0.9%
; 5
 
0.5%
# 3
 
0.3%
Other values (2) 2
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 2159
24.8%
2 2001
23.0%
0 1307
15.0%
3 620
 
7.1%
4 500
 
5.7%
8 434
 
5.0%
5 434
 
5.0%
6 420
 
4.8%
7 419
 
4.8%
9 417
 
4.8%
Close Punctuation
ValueCountFrequency (%)
) 7640
68.8%
] 3366
30.3%
29
 
0.3%
26
 
0.2%
21
 
0.2%
19
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 7620
68.6%
[ 3390
30.5%
26
 
0.2%
26
 
0.2%
21
 
0.2%
19
 
0.2%
Math Symbol
ValueCountFrequency (%)
~ 47
83.9%
< 4
 
7.1%
> 4
 
7.1%
1
 
1.8%
Modifier Symbol
ValueCountFrequency (%)
16
88.9%
` 2
 
11.1%
Space Separator
ValueCountFrequency (%)
30052
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1458
100.0%
Other Symbol
ValueCountFrequency (%)
28
100.0%
Initial Punctuation
ValueCountFrequency (%)
5
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 198249
75.4%
Common 63583
 
24.2%
Latin 1194
 
0.5%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6442
 
3.2%
6342
 
3.2%
6081
 
3.1%
5869
 
3.0%
5344
 
2.7%
5307
 
2.7%
5064
 
2.6%
4951
 
2.5%
4634
 
2.3%
3679
 
1.9%
Other values (768) 144536
72.9%
Latin
ValueCountFrequency (%)
C 256
21.4%
P 119
 
10.0%
B 80
 
6.7%
A 75
 
6.3%
S 74
 
6.2%
E 47
 
3.9%
L 41
 
3.4%
I 37
 
3.1%
O 36
 
3.0%
F 34
 
2.8%
Other values (37) 395
33.1%
Common
ValueCountFrequency (%)
30052
47.3%
) 7640
 
12.0%
( 7620
 
12.0%
[ 3390
 
5.3%
] 3366
 
5.3%
1 2159
 
3.4%
2 2001
 
3.1%
- 1458
 
2.3%
0 1307
 
2.1%
3 620
 
1.0%
Other values (35) 3970
 
6.2%
Han
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 198214
75.4%
ASCII 64492
 
24.5%
None 305
 
0.1%
Compat Jamo 7
 
< 0.1%
Punctuation 6
 
< 0.1%
CJK 5
 
< 0.1%
Number Forms 1
 
< 0.1%
Arrows 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30052
46.6%
) 7640
 
11.8%
( 7620
 
11.8%
[ 3390
 
5.3%
] 3366
 
5.2%
1 2159
 
3.3%
2 2001
 
3.1%
- 1458
 
2.3%
0 1307
 
2.0%
3 620
 
1.0%
Other values (66) 4879
 
7.6%
Hangul
ValueCountFrequency (%)
6442
 
3.3%
6342
 
3.2%
6081
 
3.1%
5869
 
3.0%
5344
 
2.7%
5307
 
2.7%
5064
 
2.6%
4951
 
2.5%
4634
 
2.3%
3679
 
1.9%
Other values (766) 144501
72.9%
None
ValueCountFrequency (%)
· 72
23.6%
29
9.5%
28
 
9.2%
26
 
8.5%
26
 
8.5%
26
 
8.5%
21
 
6.9%
21
 
6.9%
19
 
6.2%
19
 
6.2%
Other values (3) 18
 
5.9%
Compat Jamo
ValueCountFrequency (%)
7
100.0%
Punctuation
ValueCountFrequency (%)
5
83.3%
1
 
16.7%
CJK
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Arrows
ValueCountFrequency (%)
1
100.0%

생산일자
Real number (ℝ)

HIGH CORRELATION 

Distinct828
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20122380
Minimum20120101
Maximum20171108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:32:00.927379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20120101
5-th percentile20120126
Q120120409
median20120713
Q320121031
95-th percentile20140123
Maximum20171108
Range51007
Interquartile range (IQR)622

Descriptive statistics

Standard deviation6194.9291
Coefficient of variation (CV)0.00030786264
Kurtosis18.94226
Mean20122380
Median Absolute Deviation (MAD)310
Skewness4.1891345
Sum2.012238 × 1011
Variance38377147
MonotonicityNot monotonic
2023-12-11T07:32:01.122317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20120316 60
 
0.6%
20121228 56
 
0.6%
20120409 55
 
0.5%
20121227 54
 
0.5%
20121101 54
 
0.5%
20120410 53
 
0.5%
20120523 52
 
0.5%
20120531 52
 
0.5%
20120319 51
 
0.5%
20120305 51
 
0.5%
Other values (818) 9462
94.6%
ValueCountFrequency (%)
20120101 1
 
< 0.1%
20120102 27
0.3%
20120103 32
0.3%
20120104 39
0.4%
20120105 27
0.3%
20120106 35
0.4%
20120109 33
0.3%
20120110 38
0.4%
20120111 16
0.2%
20120112 25
0.2%
ValueCountFrequency (%)
20171108 1
< 0.1%
20170926 1
< 0.1%
20170831 1
< 0.1%
20170817 1
< 0.1%
20170613 1
< 0.1%
20170501 1
< 0.1%
20170220 1
< 0.1%
20170201 1
< 0.1%
20170124 1
< 0.1%
20161230 1
< 0.1%

기존공개구분명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
부분공개
9682 
비공개
 
318

Length

Max length4
Median length4
Mean length3.9682
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부분공개
2nd row부분공개
3rd row부분공개
4th row부분공개
5th row부분공개

Common Values

ValueCountFrequency (%)
부분공개 9682
96.8%
비공개 318
 
3.2%

Length

2023-12-11T07:32:01.329806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:32:01.424259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부분공개 9682
96.8%
비공개 318
 
3.2%

공개구분명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
부분공개
9153 
공개
 
844
부분공개
 
3

Length

Max length5
Median length4
Mean length3.8315
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부분공개
2nd row부분공개
3rd row부분공개
4th row부분공개
5th row부분공개

Common Values

ValueCountFrequency (%)
부분공개 9153
91.5%
공개 844
 
8.4%
부분공개 3
 
< 0.1%

Length

2023-12-11T07:32:01.549656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:32:01.658717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부분공개 9156
91.6%
공개 844
 
8.4%

공개재분류-최종의견-비공개호수명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct15
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
6호
5232 
7호
2567 
6,7호
1166 
<NA>
843 
6
 
149
Other values (10)
 
43

Length

Max length6
Median length2
Mean length2.3895
Min length1

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row6호
2nd row6호
3rd row6호
4th row6호
5th row6호

Common Values

ValueCountFrequency (%)
6호 5232
52.3%
7호 2567
25.7%
6,7호 1166
 
11.7%
<NA> 843
 
8.4%
6 149
 
1.5%
5호 20
 
0.2%
5,6호 9
 
0.1%
2호 6
 
0.1%
8호 2
 
< 0.1%
해당없음 1
 
< 0.1%
Other values (5) 5
 
0.1%

Length

2023-12-11T07:32:01.781636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
6호 5232
52.3%
7호 2567
25.7%
6,7호 1166
 
11.7%
na 843
 
8.4%
6 149
 
1.5%
5호 20
 
0.2%
5,6호 9
 
0.1%
2호 6
 
0.1%
8호 2
 
< 0.1%
해당없음 1
 
< 0.1%
Other values (5) 5
 
< 0.1%

공개제한내용
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
개인정보
5201 
업체정보
2121 
<NA>
843 
개인정보 및 업체정보
829 
단체정보
 
434
Other values (20)
572 

Length

Max length35
Median length4
Mean length4.9125
Min length4

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row개인정보
2nd row개인정보
3rd row개인정보
4th row개인정보
5th row개인정보

Common Values

ValueCountFrequency (%)
개인정보 5201
52.0%
업체정보 2121
21.2%
<NA> 843
 
8.4%
개인정보 및 업체정보 829
 
8.3%
단체정보 434
 
4.3%
개인정보 및 단체정보 315
 
3.1%
민원인 개인정보 149
 
1.5%
개인정보 및 업체고유정보 23
 
0.2%
개인정보 22
 
0.2%
법률자문관련정보 17
 
0.2%
Other values (15) 46
 
0.5%

Length

2023-12-11T07:32:01.911350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
개인정보 6557
52.4%
업체정보 2950
23.6%
1178
 
9.4%
na 843
 
6.7%
단체정보 749
 
6.0%
민원인 149
 
1.2%
업체고유정보 31
 
0.2%
법률자문관련정보 25
 
0.2%
국가안전보장정보 6
 
< 0.1%
법인관련정보 5
 
< 0.1%
Other values (20) 26
 
0.2%

공개재분류의견
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct27
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
개인정보 일부 포함되어 부분공개함
5226 
업체정보 일부 포함되어 부분공개함
2121 
비공개 대상 정보 미포함되어 공개함
843 
개인정보 및 업체정보 일부 포함되어 부분공개함
830 
단체정보 일부 포함되어 부분공개함
 
434
Other values (22)
546 

Length

Max length49
Median length18
Mean length18.7783
Min length1

Unique

Unique11 ?
Unique (%)0.1%

Sample

1st row개인정보 일부 포함되어 부분공개함
2nd row개인정보 일부 포함되어 부분공개함
3rd row개인정보 일부 포함되어 부분공개함
4th row개인정보 일부 포함되어 부분공개함
5th row개인정보 일부 포함되어 부분공개함

Common Values

ValueCountFrequency (%)
개인정보 일부 포함되어 부분공개함 5226
52.3%
업체정보 일부 포함되어 부분공개함 2121
21.2%
비공개 대상 정보 미포함되어 공개함 843
 
8.4%
개인정보 및 업체정보 일부 포함되어 부분공개함 830
 
8.3%
단체정보 일부 포함되어 부분공개함 434
 
4.3%
개인정보 및 단체정보 일부 포함되어 부분공개함 315
 
3.1%
개인정보 보호 149
 
1.5%
개인정보 및 업체고유정보 일부 포함되어 부분공개함 23
 
0.2%
법률자문관련 정보 일부 포함되어 부분공개함 17
 
0.2%
업체고유정보 일부 포함되어 부분공개함 8
 
0.1%
Other values (17) 34
 
0.3%

Length

2023-12-11T07:32:02.222169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
포함되어 9000
20.9%
부분공개함 8992
20.9%
일부 8991
20.9%
개인정보 6556
15.3%
업체정보 2951
 
6.9%
1178
 
2.7%
정보 879
 
2.0%
비공개 853
 
2.0%
대상 843
 
2.0%
미포함되어 843
 
2.0%
Other values (47) 1880
 
4.4%

Interactions

2023-12-11T07:31:56.405501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:32:02.312027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
생산일자기존공개구분명공개구분명공개재분류-최종의견-비공개호수명공개제한내용공개재분류의견
생산일자1.0000.7930.2380.2790.4810.494
기존공개구분명0.7931.0000.2430.0530.2360.512
공개구분명0.2380.2431.0000.8400.9170.873
공개재분류-최종의견-비공개호수명0.2790.0530.8401.0000.9940.994
공개제한내용0.4810.2360.9170.9941.0000.993
공개재분류의견0.4940.5120.8730.9940.9931.000
2023-12-11T07:32:02.417890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기존공개구분명공개구분명공개제한내용공개재분류의견공개재분류-최종의견-비공개호수명
기존공개구분명1.0000.3960.1870.4080.042
공개구분명0.3961.0000.7080.7080.706
공개제한내용0.1870.7081.0000.9040.951
공개재분류의견0.4080.7080.9041.0000.952
공개재분류-최종의견-비공개호수명0.0420.7060.9510.9521.000
2023-12-11T07:32:02.507672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
생산일자기존공개구분명공개구분명공개재분류-최종의견-비공개호수명공개제한내용공개재분류의견
생산일자1.0000.6270.1460.1160.1980.202
기존공개구분명0.6271.0000.3960.0420.1870.408
공개구분명0.1460.3961.0000.7060.7080.708
공개재분류-최종의견-비공개호수명0.1160.0420.7061.0000.9510.952
공개제한내용0.1980.1870.7080.9511.0000.904
공개재분류의견0.2020.4080.7080.9520.9041.000

Missing values

2023-12-11T07:31:56.576211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:31:56.758623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

담당부서정보생산부서정보철제목생산등록번호건제목생산일자기존공개구분명공개구분명공개재분류-최종의견-비공개호수명공개제한내용공개재분류의견
35810경기도 이천소방서 현장대응단경기도 이천소방서 현장지휘과화재조사일반현장지휘과-44092012년 화재피해주민 주택재건축사업 추천대상 현지실사 계획 알림20120601부분공개부분공개6호개인정보개인정보 일부 포함되어 부분공개함
18564경기도 양주소방서 재난예방과경기도 양주소방서 예방과소방시설완공검사예방과-15414소방시설완공검사신청서20121228부분공개부분공개6호개인정보개인정보 일부 포함되어 부분공개함
33613경기도 평택소방서 재난예방과경기도 평택소방서 예방과위험물제조소등일반예방과-10279위험물 이동탱크저장소 허가내역 협조요청20120913부분공개부분공개6호개인정보개인정보 일부 포함되어 부분공개함
12949경기도 양주소방서 재난예방과경기도 양주소방서 예방과소방시설완공검사예방과-7594소방시설완공검사필증 교부사항 알림20120629부분공개부분공개6호개인정보개인정보 일부 포함되어 부분공개함
23778경기도 안산소방서 재난예방과경기도 안산소방서 예방과소방시설업등록변경예방과-17573소방시설공사업 등록사항 변경신고 처리에 따른 보고(통보)(주.고산)20120820부분공개부분공개6호개인정보개인정보 일부 포함되어 부분공개함
13577경기도 수원소방서 재난예방과경기도 수원소방서 예방과방염후처리물품방염성능검사관리예방과-22515안전시설등 완공신고서(휴일식)20120905부분공개부분공개7호업체정보업체정보 일부 포함되어 부분공개함
35657경기도 문화체육관광국 체육과경기도 문화체육관광국 체육과고양한양대중CC사업계획(변경)승인신청서(2-2)체육과-400312한양컨트리클럽[회원제36홀/대중제9홀]사업계획변경승인의제협의요청에대한검토보고20120907부분공개부분공개6,7호개인정보 및 업체정보개인정보 및 업체정보 일부 포함되어 부분공개함
41813경기도 안양소방서 재난예방과경기도 안양소방서 예방과소방시설완공검사예방과-6857소방시설 완공검사증명서 교부[김두수외 3인건물]20120507부분공개부분공개6호개인정보개인정보 일부 포함되어 부분공개함
32854경기도 의정부소방서 화재조사분석과경기도 의정부소방서 화재조사분석과화재조사일반화재조사분석과-39399월중 멘토링제 운영계획 알림20120921부분공개공개<NA><NA>비공개 대상 정보 미포함되어 공개함
37880경기도 기획조정실 정책기획관 정보통신보안담당관경기도 기획조정실 정책기획관 정보통신보안담당관2014-정보통신공사업등록관련-31551정보통신공사업 등록기준신고서-(주)무아레전자통신20140212부분공개부분공개7호업체정보업체정보 일부 포함되어 부분공개함
담당부서정보생산부서정보철제목생산등록번호건제목생산일자기존공개구분명공개구분명공개재분류-최종의견-비공개호수명공개제한내용공개재분류의견
28227경기도 안성소방서 재난예방과경기도 안성소방서 예방과소방시설업일반예방과-16454방염처리업 신규 등록업체 보고(통보)[소방나라119]20121214부분공개부분공개6호개인정보개인정보 일부 포함되어 부분공개함
5630경기도 자치행정국 총무과경기도 자치행정국 총무과기록관리기준표관리총무과-233842012년도 기록관리기준표 제출20121017부분공개공개<NA><NA>비공개 대상 정보 미포함되어 공개함
21886경기도 안산소방서 재난예방과경기도 안산소방서 예방과소방시설업등록변경예방과-21514소방시설업 등록사항 변경신고 검토결과 보고[한두이앤지㈜]20121017부분공개부분공개6호개인정보개인정보 일부 포함되어 부분공개함
8692경기도 수원소방서 재난예방과경기도 수원소방서 예방과소방시설완공검사예방과-1239소방시설 완공검사 신청에 따른 결과 통보[조춘연]20120117부분공개부분공개6호개인정보개인정보 일부 포함되어 부분공개함
40629경기도 북부소방재난본부 북부특수대응단경기도 북부소방재난본부 방호구조과화재조사일반방호구조과-5167화재조사 사례연구 논문 작성 보고20120619부분공개공개<NA><NA>비공개 대상 정보 미포함되어 공개함
37164경기도 도시주택실 도시정책관 지역정책과경기도 경기도위원회 경기도지방토지수용위원회지방토지수용위원회 개최 및 수용재결경기도지방토지수용위원회-1762토지수용재결신청서 열람 및 공고 결과 제출20120827부분공개부분공개6호개인정보개인정보 일부 포함되어 부분공개함
2930경기도 과천소방서 현장대응단경기도 과천소방서 현장지휘과소방시설업일반현장지휘과-1266소방시설업 등록 결격사유 확인의뢰20120209부분공개부분공개6호개인정보개인정보 일부 포함되어 부분공개함
45592경기도 양주소방서 소방행정과경기도 양주소방서 소방행정과계약공사일반소방행정과-12565신용카드 사용내역(생명존중문화 확산 MOU 체결 현수막 외 1종 제작)20121231부분공개부분공개6호개인정보개인정보 일부 포함되어 부분공개함
36160경기도 도시주택실 도시정책관 지역정책과경기도 경기도위원회 경기도지방토지수용위원회지방토지수용위원회 개최 및 수용재결경기도지방토지수용위원회-1807감정평가 제출기한 및 가격시점 변경 통보20120903부분공개부분공개6호개인정보개인정보 일부 포함되어 부분공개함
38252경기도 기획조정실 정책기획관 정보통신보안담당관경기도 기획조정실 정책기획관 정보통신보안담당관2015-정보통신공사업등록관련-5229정보통신공사업 재교부신청서-대유플러스(주)20150109부분공개부분공개7호업체정보업체정보 일부 포함되어 부분공개함