Overview

Dataset statistics

Number of variables7
Number of observations117
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.5 KiB
Average record size in memory57.1 B

Variable types

Categorical4
Text2
Boolean1

Dataset

Description경상북도 울진군의 대표홈페이지, 울진군정 스마트알리미(모바일앱), 울진여행(모바일앱)의 카테고리 현황(카테고리코드, 카테고리명 등)을 제공합니다.
URLhttps://www.data.go.kr/data/15119030/fileData.do

Alerts

사용여부 has constant value ""Constant
카테고리명_1차 is highly overall correlated with 카테고리코드_1차 and 2 other fieldsHigh correlation
카테고리코드_1차 is highly overall correlated with 카테고리명_1차 and 2 other fieldsHigh correlation
카테고리명_2차 is highly overall correlated with 카테고리코드_1차 and 2 other fieldsHigh correlation
카테고리코드_2차 is highly overall correlated with 카테고리코드_1차 and 2 other fieldsHigh correlation
카테고리코드_1차 is highly imbalanced (73.2%)Imbalance
카테고리명_1차 is highly imbalanced (73.2%)Imbalance

Reproduction

Analysis started2023-12-12 13:03:04.442420
Analysis finished2023-12-12 13:03:04.862218
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

카테고리코드_1차
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
BOARD
109 
PAY
 
5
DISCOUNT
 
3

Length

Max length8
Median length5
Mean length4.991453
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDISCOUNT
2nd rowPAY
3rd rowDISCOUNT
4th rowPAY
5th rowDISCOUNT

Common Values

ValueCountFrequency (%)
BOARD 109
93.2%
PAY 5
 
4.3%
DISCOUNT 3
 
2.6%

Length

2023-12-12T22:03:04.928010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:03:05.026336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
board 109
93.2%
pay 5
 
4.3%
discount 3
 
2.6%

카테고리명_1차
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
사전정보공표
109 
대관/대여 이용료
 
5
대관/대여 할인_미사용
 
3

Length

Max length12
Median length6
Mean length6.2820513
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대관/대여 할인_미사용
2nd row대관/대여 이용료
3rd row대관/대여 할인_미사용
4th row대관/대여 이용료
5th row대관/대여 할인_미사용

Common Values

ValueCountFrequency (%)
사전정보공표 109
93.2%
대관/대여 이용료 5
 
4.3%
대관/대여 할인_미사용 3
 
2.6%

Length

2023-12-12T22:03:05.121821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:03:05.213752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사전정보공표 109
87.2%
대관/대여 8
 
6.4%
이용료 5
 
4.0%
할인_미사용 3
 
2.4%

카테고리코드_2차
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)18.8%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
PART12
17 
PART03
13 
PART14
12 
PART08
11 
PART09
Other values (17)
55 

Length

Max length6
Median length6
Mean length5.7264957
Min length2

Unique

Unique4 ?
Unique (%)3.4%

Sample

1st row12
2nd row12
3rd row12
4th row12
5th row12

Common Values

ValueCountFrequency (%)
PART12 17
14.5%
PART03 13
11.1%
PART14 12
10.3%
PART08 11
 
9.4%
PART09 9
 
7.7%
12 6
 
5.1%
PART13 6
 
5.1%
PART11 6
 
5.1%
PART04 4
 
3.4%
PART06 4
 
3.4%
Other values (12) 29
24.8%

Length

2023-12-12T22:03:05.331778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
part12 17
14.5%
part03 13
11.1%
part14 12
10.3%
part08 11
 
9.4%
part09 9
 
7.7%
12 6
 
5.1%
part13 6
 
5.1%
part11 6
 
5.1%
part17 4
 
3.4%
part01 4
 
3.4%
Other values (12) 29
24.8%

카테고리명_2차
Categorical

HIGH CORRELATION 

Distinct23
Distinct (%)19.7%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
일자리경제과
17 
사회복지과
13 
건설과
12 
환경위생과
11 
농정과
Other values (18)
55 

Length

Max length9
Median length8
Mean length4.8803419
Min length3

Unique

Unique4 ?
Unique (%)3.4%

Sample

1st row안전체험관 할인
2nd row안전체험관 이용료
3rd row안전체험관 할인
4th row안전체험관 이용료
5th row안전체험관 할인

Common Values

ValueCountFrequency (%)
일자리경제과 17
14.5%
사회복지과 13
11.1%
건설과 12
 
10.3%
환경위생과 11
 
9.4%
농정과 9
 
7.7%
도시새마을과 6
 
5.1%
해양수산과 6
 
5.1%
재무과 4
 
3.4%
산림과 4
 
3.4%
문화관광과 4
 
3.4%
Other values (13) 31
26.5%

Length

2023-12-12T22:03:05.465990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일자리경제과 17
13.6%
사회복지과 13
 
10.4%
건설과 12
 
9.6%
환경위생과 11
 
8.8%
농정과 9
 
7.2%
도시새마을과 6
 
4.8%
해양수산과 6
 
4.8%
안전체험관 6
 
4.8%
이용료 5
 
4.0%
맑은물사업소 4
 
3.2%
Other values (14) 36
28.8%
Distinct112
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-12T22:03:05.722820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length3.8119658
Min length1

Characters and Unicode

Total characters446
Distinct characters16
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique109 ?
Unique (%)93.2%

Sample

1st rowP
2nd rowP
3rd rowG
4th rowG
5th row01
ValueCountFrequency (%)
p 3
 
2.6%
g 3
 
2.6%
01 2
 
1.7%
a82z 1
 
0.9%
a64z 1
 
0.9%
a63z 1
 
0.9%
a62z 1
 
0.9%
a61z 1
 
0.9%
a60z 1
 
0.9%
a58z 1
 
0.9%
Other values (102) 102
87.2%
2023-12-12T22:03:06.072315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 94
21.1%
Z 94
21.1%
1 41
9.2%
0 34
 
7.6%
2 23
 
5.2%
4 21
 
4.7%
6 20
 
4.5%
5 20
 
4.5%
9 20
 
4.5%
8 20
 
4.5%
Other values (6) 59
13.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 237
53.1%
Uppercase Letter 209
46.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 41
17.3%
0 34
14.3%
2 23
9.7%
4 21
8.9%
6 20
8.4%
5 20
8.4%
9 20
8.4%
8 20
8.4%
3 19
8.0%
7 19
8.0%
Uppercase Letter
ValueCountFrequency (%)
A 94
45.0%
Z 94
45.0%
B 14
 
6.7%
P 3
 
1.4%
G 3
 
1.4%
C 1
 
0.5%

Most occurring scripts

ValueCountFrequency (%)
Common 237
53.1%
Latin 209
46.9%

Most frequent character per script

Common
ValueCountFrequency (%)
1 41
17.3%
0 34
14.3%
2 23
9.7%
4 21
8.9%
6 20
8.4%
5 20
8.4%
9 20
8.4%
8 20
8.4%
3 19
8.0%
7 19
8.0%
Latin
ValueCountFrequency (%)
A 94
45.0%
Z 94
45.0%
B 14
 
6.7%
P 3
 
1.4%
G 3
 
1.4%
C 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 446
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A 94
21.1%
Z 94
21.1%
1 41
9.2%
0 34
 
7.6%
2 23
 
5.2%
4 21
 
4.7%
6 20
 
4.5%
5 20
 
4.5%
9 20
 
4.5%
8 20
 
4.5%
Other values (6) 59
13.2%
Distinct112
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-12T22:03:06.289030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length18
Mean length9.2222222
Min length2

Characters and Unicode

Total characters1079
Distinct characters204
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique109 ?
Unique (%)93.2%

Sample

1st row개인
2nd row개인
3rd row단체
4th row단체
5th row울진군민
ValueCountFrequency (%)
현황 57
 
20.7%
7
 
2.5%
안내 7
 
2.5%
지원사업 4
 
1.4%
단체 4
 
1.4%
통계 3
 
1.1%
결과 3
 
1.1%
개인 3
 
1.1%
민방위 2
 
0.7%
상수도 2
 
0.7%
Other values (173) 184
66.7%
2023-12-12T22:03:06.634185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
159
 
14.7%
70
 
6.5%
70
 
6.5%
44
 
4.1%
22
 
2.0%
19
 
1.8%
17
 
1.6%
16
 
1.5%
14
 
1.3%
14
 
1.3%
Other values (194) 634
58.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 920
85.3%
Space Separator 159
 
14.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
70
 
7.6%
70
 
7.6%
44
 
4.8%
22
 
2.4%
19
 
2.1%
17
 
1.8%
16
 
1.7%
14
 
1.5%
14
 
1.5%
14
 
1.5%
Other values (193) 620
67.4%
Space Separator
ValueCountFrequency (%)
159
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 920
85.3%
Common 159
 
14.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
70
 
7.6%
70
 
7.6%
44
 
4.8%
22
 
2.4%
19
 
2.1%
17
 
1.8%
16
 
1.7%
14
 
1.5%
14
 
1.5%
14
 
1.5%
Other values (193) 620
67.4%
Common
ValueCountFrequency (%)
159
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 918
85.1%
ASCII 159
 
14.7%
Compat Jamo 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
159
100.0%
Hangul
ValueCountFrequency (%)
70
 
7.6%
70
 
7.6%
44
 
4.8%
22
 
2.4%
19
 
2.1%
17
 
1.9%
16
 
1.7%
14
 
1.5%
14
 
1.5%
14
 
1.5%
Other values (192) 618
67.3%
Compat Jamo
ValueCountFrequency (%)
2
100.0%

사용여부
Boolean

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size249.0 B
True
117 
ValueCountFrequency (%)
True 117
100.0%
2023-12-12T22:03:06.720370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:03:06.764527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
카테고리코드_1차카테고리명_1차카테고리코드_2차카테고리명_2차
카테고리코드_1차1.0001.0000.8711.000
카테고리명_1차1.0001.0000.8711.000
카테고리코드_2차0.8710.8711.0001.000
카테고리명_2차1.0001.0001.0001.000
2023-12-12T22:03:06.838389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
카테고리명_1차카테고리코드_1차카테고리명_2차카테고리코드_2차
카테고리명_1차1.0001.0000.9080.653
카테고리코드_1차1.0001.0000.9080.653
카테고리명_2차0.9080.9081.0000.995
카테고리코드_2차0.6530.6530.9951.000
2023-12-12T22:03:06.918056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
카테고리코드_1차카테고리명_1차카테고리코드_2차카테고리명_2차
카테고리코드_1차1.0001.0000.6530.908
카테고리명_1차1.0001.0000.6530.908
카테고리코드_2차0.6530.6531.0000.995
카테고리명_2차0.9080.9080.9951.000

Missing values

2023-12-12T22:03:04.719278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:03:04.817452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

카테고리코드_1차카테고리명_1차카테고리코드_2차카테고리명_2차카테고리코드_3차카테고리명_3차사용여부
0DISCOUNT대관/대여 할인_미사용12안전체험관 할인P개인Y
1PAY대관/대여 이용료12안전체험관 이용료P개인Y
2DISCOUNT대관/대여 할인_미사용12안전체험관 할인G단체Y
3PAY대관/대여 이용료12안전체험관 이용료G단체Y
4DISCOUNT대관/대여 할인_미사용12안전체험관 할인01울진군민Y
5PAY대관/대여 이용료12안전체험관 이용료01울진군민Y
6BOARD사전정보공표PART01기획예산실A39Z군정백서Y
7BOARD사전정보공표PART01기획예산실A01Z위원회현황Y
8BOARD사전정보공표PART01기획예산실A02Z출입언론 현황Y
9BOARD사전정보공표PART01기획예산실B200업무추진비 자료Y
카테고리코드_1차카테고리명_1차카테고리코드_2차카테고리명_2차카테고리코드_3차카테고리명_3차사용여부
107BOARD사전정보공표PART17맑은물사업소A97Z상수도 통계Y
108BOARD사전정보공표PART17맑은물사업소A98Z상수도 관련 공사 현황Y
109BOARD사전정보공표PART17맑은물사업소A99Z하수도 통계Y
110BOARD사전정보공표PART17맑은물사업소B100하수관거 공사추진 현황Y
111BOARD사전정보공표PART18왕피천공원사업소B105엑스포공원 이용안내Y
112BOARD사전정보공표PART19체육진흥사업소A27Z체육 육성지원 현황Y
113BOARD사전정보공표PART21의회사무과B101의원발의 조례안Y
114BOARD사전정보공표PART21의회사무과B102의원 국외연수 보고서Y
115PAY대관/대여 이용료17다도체험관 이용료P개인Y
116PAY대관/대여 이용료17다도체험관 이용료G단체Y