Overview

Dataset statistics

Number of variables6
Number of observations1843
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory86.5 KiB
Average record size in memory48.1 B

Variable types

Categorical5
Text1

Dataset

Description중장기개방계획에따른 경상남도 경남도립거창대학 데이터자료입니다.(대분류, 중분류, 소분류, 세분류, 세세분류, 직업명등의 데이터를 포함하고있습니다.)
Author경상남도
URLhttps://www.data.go.kr/data/15066703/fileData.do

Reproduction

Analysis started2023-12-12 15:17:55.561309
Analysis finished2023-12-12 15:17:56.233708
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

대분류
Categorical

Distinct10
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size14.5 KiB
2
648 
8
341 
7
304 
1
122 
4
121 
Other values (5)
307 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 648
35.2%
8 341
18.5%
7 304
16.5%
1 122
 
6.6%
4 121
 
6.6%
3 97
 
5.3%
9 91
 
4.9%
5 59
 
3.2%
6 50
 
2.7%
A 10
 
0.5%

Length

2023-12-13T00:17:56.300352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:17:56.406070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 648
35.2%
8 341
18.5%
7 304
16.5%
1 122
 
6.6%
4 121
 
6.6%
3 97
 
5.3%
9 91
 
4.9%
5 59
 
3.2%
6 50
 
2.7%
a 10
 
0.5%

중분류
Categorical

Distinct10
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size14.5 KiB
3
310 
1
251 
2
247 
4
242 
5
216 
Other values (5)
577 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row8
2nd row8
3rd row8
4th row8
5th row8

Common Values

ValueCountFrequency (%)
3 310
16.8%
1 251
13.6%
2 247
13.4%
4 242
13.1%
5 216
11.7%
7 201
10.9%
8 160
8.7%
9 125
6.8%
6 81
 
4.4%
- 10
 
0.5%

Length

2023-12-13T00:17:56.513228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:17:56.622308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 310
16.8%
1 251
13.6%
2 247
13.4%
4 242
13.1%
5 216
11.7%
7 201
10.9%
8 160
8.7%
9 125
6.8%
6 81
 
4.4%
10
 
0.5%

소분류
Categorical

Distinct11
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size14.5 KiB
1
507 
2
452 
3
246 
4
163 
0
124 
Other values (6)
351 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 507
27.5%
2 452
24.5%
3 246
13.3%
4 163
 
8.8%
0 124
 
6.7%
9 106
 
5.8%
5 94
 
5.1%
- 62
 
3.4%
6 48
 
2.6%
7 30
 
1.6%

Length

2023-12-13T00:17:57.052863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1 507
27.5%
2 452
24.5%
3 246
13.3%
4 163
 
8.8%
0 124
 
6.7%
9 106
 
5.8%
5 94
 
5.1%
62
 
3.4%
6 48
 
2.6%
7 30
 
1.6%

세분류
Categorical

Distinct11
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size14.5 KiB
1
419 
2
355 
3
215 
-
211 
0
201 
Other values (6)
442 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row3
4th row3
5th row4

Common Values

ValueCountFrequency (%)
1 419
22.7%
2 355
19.3%
3 215
11.7%
- 211
11.4%
0 201
10.9%
9 170
9.2%
4 142
 
7.7%
5 76
 
4.1%
6 36
 
2.0%
7 15
 
0.8%

Length

2023-12-13T00:17:57.154224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1 419
22.7%
2 355
19.3%
3 215
11.7%
211
11.4%
0 201
10.9%
9 170
9.2%
4 142
 
7.7%
5 76
 
4.1%
6 36
 
2.0%
7 15
 
0.8%

세세분류
Categorical

Distinct11
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size14.5 KiB
-
637 
1
297 
2
277 
9
169 
3
158 
Other values (6)
305 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row0
3rd row-
4th row0
5th row-

Common Values

ValueCountFrequency (%)
- 637
34.6%
1 297
16.1%
2 277
15.0%
9 169
 
9.2%
3 158
 
8.6%
0 129
 
7.0%
4 89
 
4.8%
5 44
 
2.4%
6 26
 
1.4%
7 11
 
0.6%

Length

2023-12-13T00:17:57.260471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
637
34.6%
1 297
16.1%
2 277
15.0%
9 169
 
9.2%
3 158
 
8.6%
0 129
 
7.0%
4 89
 
4.8%
5 44
 
2.4%
6 26
 
1.4%
7 11
 
0.6%
Distinct1677
Distinct (%)91.0%
Missing0
Missing (%)0.0%
Memory size14.5 KiB
2023-12-13T00:17:57.502886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length22
Mean length9.9224091
Min length2

Characters and Unicode

Total characters18287
Distinct characters412
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1516 ?
Unique (%)82.3%

Sample

1st row번역가
2nd row번역가
3rd row통역가
4th row통역가
5th row기자 및 논설위원
ValueCountFrequency (%)
624
 
11.4%
조작원 189
 
3.5%
171
 
3.1%
171
 
3.1%
관련 102
 
1.9%
관리자 101
 
1.9%
연구원 98
 
1.8%
기술자 87
 
1.6%
종사원 84
 
1.5%
사무원 69
 
1.3%
Other values (1436) 3759
68.9%
2023-12-13T00:17:57.861124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3612
 
19.8%
1117
 
6.1%
643
 
3.5%
624
 
3.4%
580
 
3.2%
469
 
2.6%
441
 
2.4%
407
 
2.2%
282
 
1.5%
257
 
1.4%
Other values (402) 9855
53.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14668
80.2%
Space Separator 3612
 
19.8%
Uppercase Letter 4
 
< 0.1%
Decimal Number 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1117
 
7.6%
643
 
4.4%
624
 
4.3%
580
 
4.0%
469
 
3.2%
441
 
3.0%
407
 
2.8%
282
 
1.9%
257
 
1.8%
251
 
1.7%
Other values (397) 9597
65.4%
Uppercase Letter
ValueCountFrequency (%)
P 2
50.0%
C 2
50.0%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
9 1
33.3%
Space Separator
ValueCountFrequency (%)
3612
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14668
80.2%
Common 3615
 
19.8%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1117
 
7.6%
643
 
4.4%
624
 
4.3%
580
 
4.0%
469
 
3.2%
441
 
3.0%
407
 
2.8%
282
 
1.9%
257
 
1.8%
251
 
1.7%
Other values (397) 9597
65.4%
Common
ValueCountFrequency (%)
3612
99.9%
1 2
 
0.1%
9 1
 
< 0.1%
Latin
ValueCountFrequency (%)
P 2
50.0%
C 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14667
80.2%
ASCII 3619
 
19.8%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3612
99.8%
P 2
 
0.1%
C 2
 
0.1%
1 2
 
0.1%
9 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
1117
 
7.6%
643
 
4.4%
624
 
4.3%
580
 
4.0%
469
 
3.2%
441
 
3.0%
407
 
2.8%
282
 
1.9%
257
 
1.8%
251
 
1.7%
Other values (396) 9596
65.4%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Correlations

2023-12-13T00:17:57.945807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대분류중분류소분류세분류세세분류
대분류1.0000.6720.4500.3350.053
중분류0.6721.0000.5430.2630.000
소분류0.4500.5431.0000.5270.271
세분류0.3350.2630.5271.0000.479
세세분류0.0530.0000.2710.4791.000
2023-12-13T00:17:58.031451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세분류세세분류중분류대분류소분류
세분류1.0000.1590.1150.1490.179
세세분류0.1591.0000.0000.0220.083
중분류0.1150.0001.0000.2640.267
대분류0.1490.0220.2641.0000.210
소분류0.1790.0830.2670.2101.000
2023-12-13T00:17:58.120816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대분류중분류소분류세분류세세분류
대분류1.0000.2640.2100.1490.022
중분류0.2641.0000.2670.1150.000
소분류0.2100.2671.0000.1790.083
세분류0.1490.1150.1791.0000.159
세세분류0.0220.0000.0830.1591.000

Missing values

2023-12-13T00:17:56.078614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:17:56.193949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대분류중분류소분류세분류세세분류직업명
02812-번역가
128120번역가
22813-통역가
328130통역가
42814-기자 및 논설위원
528141기자
628142논설위원
728143칼럼니스트
82815-출판물 전문가
928151출판물 기획자
대분류중분류소분류세분류세세분류직업명
1833A----군인
1834A1---군인
1835A11--장교
1836A111-영관급 이상
1837A1110영관급 이상 장교
1838A112-위관급
1839A1120위관급 장교
1840A12--장기 부사관 및 준위
1841A120-장기 부사관 및 준위
1842A1200장기 부사관 및 준위