Overview

Dataset statistics

Number of variables3
Number of observations685
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.9 KiB
Average record size in memory25.2 B

Variable types

Numeric1
Categorical1
Text1

Dataset

Description상훈법, 상훈법시행령, 정부포상업무지침 등에 근거한 2023년도 과학기술정보통신부 장관표창 및 상장 현황 목록 (2023년 07월말 기준) -컬럼명: 순번, 구분, 공적분야
URLhttps://www.data.go.kr/data/15033646/fileData.do

Alerts

순번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 순번High correlation
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:54:01.981164
Analysis finished2023-12-12 07:54:02.523104
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct685
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean343
Minimum1
Maximum685
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.1 KiB
2023-12-12T16:54:02.588144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile35.2
Q1172
median343
Q3514
95-th percentile650.8
Maximum685
Range684
Interquartile range (IQR)342

Descriptive statistics

Standard deviation197.88675
Coefficient of variation (CV)0.57692931
Kurtosis-1.2
Mean343
Median Absolute Deviation (MAD)171
Skewness0
Sum234955
Variance39159.167
MonotonicityStrictly increasing
2023-12-12T16:54:02.747065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
461 1
 
0.1%
453 1
 
0.1%
454 1
 
0.1%
455 1
 
0.1%
456 1
 
0.1%
457 1
 
0.1%
458 1
 
0.1%
459 1
 
0.1%
460 1
 
0.1%
Other values (675) 675
98.5%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
685 1
0.1%
684 1
0.1%
683 1
0.1%
682 1
0.1%
681 1
0.1%
680 1
0.1%
679 1
0.1%
678 1
0.1%
677 1
0.1%
676 1
0.1%

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
상장
362 
표창
323 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row표창
2nd row표창
3rd row표창
4th row표창
5th row표창

Common Values

ValueCountFrequency (%)
상장 362
52.8%
표창 323
47.2%

Length

2023-12-12T16:54:02.891994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:54:03.002291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상장 362
52.8%
표창 323
47.2%
Distinct682
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2023-12-12T16:54:03.296549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length42
Mean length14.252555
Min length4

Characters and Unicode

Total characters9763
Distinct characters465
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique679 ?
Unique (%)99.1%

Sample

1st row정책기획 및 수행 유공
2nd row예·결산 업무유공
3rd row정부업무 평가유공
4th row정부혁신유공
5th row공공기관 혁신유공
ValueCountFrequency (%)
유공 186
 
9.8%
경진대회 42
 
2.2%
공모전 37
 
1.9%
32
 
1.7%
대한민국 26
 
1.4%
2023 26
 
1.4%
대상 15
 
0.8%
ict 15
 
0.8%
인공지능 13
 
0.7%
활성화 12
 
0.6%
Other values (1132) 1502
78.8%
2023-12-12T16:54:03.876283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1242
 
12.7%
438
 
4.5%
332
 
3.4%
238
 
2.4%
187
 
1.9%
174
 
1.8%
172
 
1.8%
149
 
1.5%
148
 
1.5%
120
 
1.2%
Other values (455) 6563
67.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7264
74.4%
Space Separator 1242
 
12.7%
Uppercase Letter 615
 
6.3%
Decimal Number 237
 
2.4%
Lowercase Letter 214
 
2.2%
Other Punctuation 75
 
0.8%
Open Punctuation 47
 
0.5%
Close Punctuation 47
 
0.5%
Dash Punctuation 21
 
0.2%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
438
 
6.0%
332
 
4.6%
238
 
3.3%
187
 
2.6%
174
 
2.4%
172
 
2.4%
149
 
2.1%
148
 
2.0%
120
 
1.7%
115
 
1.6%
Other values (383) 5191
71.5%
Uppercase Letter
ValueCountFrequency (%)
I 86
14.0%
S 80
13.0%
T 70
11.4%
C 57
9.3%
A 55
8.9%
W 44
 
7.2%
D 32
 
5.2%
R 25
 
4.1%
O 24
 
3.9%
K 21
 
3.4%
Other values (14) 121
19.7%
Lowercase Letter
ValueCountFrequency (%)
a 31
14.5%
e 26
12.1%
t 22
10.3%
o 17
 
7.9%
r 17
 
7.9%
c 14
 
6.5%
n 13
 
6.1%
i 12
 
5.6%
h 9
 
4.2%
w 8
 
3.7%
Other values (14) 45
21.0%
Decimal Number
ValueCountFrequency (%)
2 102
43.0%
0 47
19.8%
3 43
18.1%
1 16
 
6.8%
4 9
 
3.8%
5 6
 
2.5%
6 5
 
2.1%
9 4
 
1.7%
7 3
 
1.3%
8 2
 
0.8%
Other Punctuation
ValueCountFrequency (%)
· 22
29.3%
, 14
18.7%
. 13
17.3%
& 10
13.3%
/ 6
 
8.0%
' 4
 
5.3%
" 4
 
5.3%
1
 
1.3%
! 1
 
1.3%
Space Separator
ValueCountFrequency (%)
1242
100.0%
Open Punctuation
ValueCountFrequency (%)
( 47
100.0%
Close Punctuation
ValueCountFrequency (%)
) 47
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7263
74.4%
Common 1670
 
17.1%
Latin 829
 
8.5%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
438
 
6.0%
332
 
4.6%
238
 
3.3%
187
 
2.6%
174
 
2.4%
172
 
2.4%
149
 
2.1%
148
 
2.0%
120
 
1.7%
115
 
1.6%
Other values (382) 5190
71.5%
Latin
ValueCountFrequency (%)
I 86
 
10.4%
S 80
 
9.7%
T 70
 
8.4%
C 57
 
6.9%
A 55
 
6.6%
W 44
 
5.3%
D 32
 
3.9%
a 31
 
3.7%
e 26
 
3.1%
R 25
 
3.0%
Other values (38) 323
39.0%
Common
ValueCountFrequency (%)
1242
74.4%
2 102
 
6.1%
( 47
 
2.8%
) 47
 
2.8%
0 47
 
2.8%
3 43
 
2.6%
· 22
 
1.3%
- 21
 
1.3%
1 16
 
1.0%
, 14
 
0.8%
Other values (14) 69
 
4.1%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7263
74.4%
ASCII 2476
 
25.4%
None 23
 
0.2%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1242
50.2%
2 102
 
4.1%
I 86
 
3.5%
S 80
 
3.2%
T 70
 
2.8%
C 57
 
2.3%
A 55
 
2.2%
( 47
 
1.9%
) 47
 
1.9%
0 47
 
1.9%
Other values (60) 643
26.0%
Hangul
ValueCountFrequency (%)
438
 
6.0%
332
 
4.6%
238
 
3.3%
187
 
2.6%
174
 
2.4%
172
 
2.4%
149
 
2.1%
148
 
2.0%
120
 
1.7%
115
 
1.6%
Other values (382) 5190
71.5%
None
ValueCountFrequency (%)
· 22
95.7%
1
 
4.3%
CJK
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-12T16:54:02.296282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:54:03.984087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구분
순번1.0000.998
구분0.9981.000
2023-12-12T16:54:04.077746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구분
순번1.0000.951
구분0.9511.000

Missing values

2023-12-12T16:54:02.426803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:54:02.494342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번구분공적분야
01표창정책기획 및 수행 유공
12표창예·결산 업무유공
23표창정부업무 평가유공
34표창정부혁신유공
45표창공공기관 혁신유공
56표창제안제도 유공
67표창국회업무유공
78표창규제개혁 및 법제업무유공
89표창민원서비스유공
910표창정보화유공
순번구분공적분야
675676상장2022 SF공모전
676677상장전국청소년과학송경연대회
677678상장꼬마피카소 그림그리기 대회
678679상장어린이 과학 미션캠프
679680상장우주전파재난 예측 인공지능 경진대회
680681상장양자컴퓨터활용경진대회
681682상장출연연 테크노믹스 오디션
682683상장_2023년 제11회 전국대학교 스마트로봇 경진대회, 2023년 제6회 전국대학교 인공지능 드론 경진대회
683684상장대학정보보호동아리
684685상장청소년정보보호페스티벌