Overview

Dataset statistics

Number of variables4
Number of observations622
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.8 KiB
Average record size in memory34.2 B

Variable types

Numeric2
Text1
Categorical1

Dataset

Description중소벤처기업진흥공단 컨설팅 대학원 사업 지원을 통해 발간된 연구논문 발간 리스트(순번, 논문명, 년도, 대학명)에 관한 데이터
Author중소벤처기업진흥공단
URLhttps://www.data.go.kr/data/15018348/fileData.do

Alerts

순번 is highly overall correlated with 년도High correlation
년도 is highly overall correlated with 순번High correlation
순번 has unique valuesUnique

Reproduction

Analysis started2024-04-19 06:56:23.842776
Analysis finished2024-04-19 06:56:24.639115
Duration0.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct622
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean311.5
Minimum1
Maximum622
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2024-04-19T15:56:24.715898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile32.05
Q1156.25
median311.5
Q3466.75
95-th percentile590.95
Maximum622
Range621
Interquartile range (IQR)310.5

Descriptive statistics

Standard deviation179.70021
Coefficient of variation (CV)0.57688672
Kurtosis-1.2
Mean311.5
Median Absolute Deviation (MAD)155.5
Skewness0
Sum193753
Variance32292.167
MonotonicityStrictly increasing
2024-04-19T15:56:24.873389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
419 1
 
0.2%
412 1
 
0.2%
413 1
 
0.2%
414 1
 
0.2%
415 1
 
0.2%
416 1
 
0.2%
417 1
 
0.2%
418 1
 
0.2%
420 1
 
0.2%
Other values (612) 612
98.4%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
622 1
0.2%
621 1
0.2%
620 1
0.2%
619 1
0.2%
618 1
0.2%
617 1
0.2%
616 1
0.2%
615 1
0.2%
614 1
0.2%
613 1
0.2%
Distinct616
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
2024-04-19T15:56:25.217955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length211
Median length134
Mean length52.916399
Min length13

Characters and Unicode

Total characters32914
Distinct characters527
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique610 ?
Unique (%)98.1%

Sample

1st row국내 프랜차이즈 기업의 원가 행태에 관한 연구
2nd rowCritical Success Factors (CSF) on eCommerce Adoption in Banglade
3rd row학생들의 참여가 회계교육에 미치는 영향
4th rowThe effect of personal value on CSV(creating shared value)
5th row3D 프린팅 기술을 활용한 음파 증폭 무지향성 스피커 구조 및 조형에 관한 연구
ValueCountFrequency (%)
연구 209
 
3.5%
미치는 180
 
3.0%
관한 177
 
3.0%
of 165
 
2.8%
on 141
 
2.4%
the 112
 
1.9%
and 94
 
1.6%
영향에 83
 
1.4%
영향 80
 
1.3%
64
 
1.1%
Other values (2382) 4629
78.0%
2024-04-19T15:56:26.002976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5364
 
16.3%
e 1440
 
4.4%
n 1350
 
4.1%
o 1271
 
3.9%
t 1145
 
3.5%
i 1037
 
3.2%
a 941
 
2.9%
r 752
 
2.3%
s 750
 
2.3%
c 624
 
1.9%
Other values (517) 18240
55.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 12907
39.2%
Other Letter 12597
38.3%
Space Separator 5364
16.3%
Uppercase Letter 1788
 
5.4%
Other Punctuation 124
 
0.4%
Decimal Number 63
 
0.2%
Open Punctuation 28
 
0.1%
Close Punctuation 28
 
0.1%
Final Punctuation 11
 
< 0.1%
Dash Punctuation 2
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
469
 
3.7%
440
 
3.5%
365
 
2.9%
319
 
2.5%
311
 
2.5%
301
 
2.4%
296
 
2.3%
275
 
2.2%
271
 
2.2%
269
 
2.1%
Other values (445) 9281
73.7%
Lowercase Letter
ValueCountFrequency (%)
e 1440
11.2%
n 1350
10.5%
o 1271
9.8%
t 1145
 
8.9%
i 1037
 
8.0%
a 941
 
7.3%
r 752
 
5.8%
s 750
 
5.8%
c 624
 
4.8%
l 521
 
4.0%
Other values (16) 3076
23.8%
Uppercase Letter
ValueCountFrequency (%)
C 231
12.9%
S 200
11.2%
A 145
 
8.1%
E 135
 
7.6%
P 134
 
7.5%
I 128
 
7.2%
T 119
 
6.7%
M 112
 
6.3%
R 96
 
5.4%
F 77
 
4.3%
Other values (16) 411
23.0%
Decimal Number
ValueCountFrequency (%)
1 20
31.7%
2 17
27.0%
0 7
 
11.1%
6 6
 
9.5%
3 5
 
7.9%
5 4
 
6.3%
9 3
 
4.8%
7 1
 
1.6%
Other Punctuation
ValueCountFrequency (%)
, 77
62.1%
& 20
 
16.1%
· 11
 
8.9%
' 10
 
8.1%
. 6
 
4.8%
Space Separator
ValueCountFrequency (%)
5364
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Final Punctuation
ValueCountFrequency (%)
11
100.0%
Dash Punctuation
ValueCountFrequency (%)
2
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 14695
44.6%
Hangul 12540
38.1%
Common 5622
 
17.1%
Han 57
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
469
 
3.7%
440
 
3.5%
365
 
2.9%
319
 
2.5%
311
 
2.5%
301
 
2.4%
296
 
2.4%
275
 
2.2%
271
 
2.2%
269
 
2.1%
Other values (399) 9224
73.6%
Latin
ValueCountFrequency (%)
e 1440
 
9.8%
n 1350
 
9.2%
o 1271
 
8.6%
t 1145
 
7.8%
i 1037
 
7.1%
a 941
 
6.4%
r 752
 
5.1%
s 750
 
5.1%
c 624
 
4.2%
l 521
 
3.5%
Other values (42) 4864
33.1%
Han
ValueCountFrequency (%)
3
 
5.3%
3
 
5.3%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
1
 
1.8%
Other values (36) 36
63.2%
Common
ValueCountFrequency (%)
5364
95.4%
, 77
 
1.4%
( 28
 
0.5%
) 28
 
0.5%
1 20
 
0.4%
& 20
 
0.4%
2 17
 
0.3%
11
 
0.2%
· 11
 
0.2%
' 10
 
0.2%
Other values (10) 36
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 20293
61.7%
Hangul 12540
38.1%
CJK 56
 
0.2%
Punctuation 13
 
< 0.1%
None 11
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5364
26.4%
e 1440
 
7.1%
n 1350
 
6.7%
o 1271
 
6.3%
t 1145
 
5.6%
i 1037
 
5.1%
a 941
 
4.6%
r 752
 
3.7%
s 750
 
3.7%
c 624
 
3.1%
Other values (59) 5619
27.7%
Hangul
ValueCountFrequency (%)
469
 
3.7%
440
 
3.5%
365
 
2.9%
319
 
2.5%
311
 
2.5%
301
 
2.4%
296
 
2.4%
275
 
2.2%
271
 
2.2%
269
 
2.1%
Other values (399) 9224
73.6%
Punctuation
ValueCountFrequency (%)
11
84.6%
2
 
15.4%
None
ValueCountFrequency (%)
· 11
100.0%
CJK
ValueCountFrequency (%)
3
 
5.4%
3
 
5.4%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
2
 
3.6%
1
 
1.8%
Other values (35) 35
62.5%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

년도
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2014.6061
Minimum2009
Maximum2019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2024-04-19T15:56:26.122065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2009
5-th percentile2010
Q12012
median2015
Q32017
95-th percentile2019
Maximum2019
Range10
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.6932542
Coefficient of variation (CV)0.0013368639
Kurtosis-0.81920094
Mean2014.6061
Median Absolute Deviation (MAD)2
Skewness-0.21204038
Sum1253085
Variance7.253618
MonotonicityDecreasing
2024-04-19T15:56:26.224030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
2015 129
20.7%
2016 69
11.1%
2013 68
10.9%
2012 65
10.5%
2017 62
10.0%
2018 59
9.5%
2019 46
 
7.4%
2011 46
 
7.4%
2014 32
 
5.1%
2010 24
 
3.9%
ValueCountFrequency (%)
2009 22
 
3.5%
2010 24
 
3.9%
2011 46
 
7.4%
2012 65
10.5%
2013 68
10.9%
2014 32
 
5.1%
2015 129
20.7%
2016 69
11.1%
2017 62
10.0%
2018 59
9.5%
ValueCountFrequency (%)
2019 46
 
7.4%
2018 59
9.5%
2017 62
10.0%
2016 69
11.1%
2015 129
20.7%
2014 32
 
5.1%
2013 68
10.9%
2012 65
10.5%
2011 46
 
7.4%
2010 24
 
3.9%

대학교
Categorical

Distinct4
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
한성대
289 
금오공대
132 
대전대
101 
한양대
100 

Length

Max length4
Median length3
Mean length3.2122186
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row금오공대
2nd row금오공대
3rd row금오공대
4th row금오공대
5th row금오공대

Common Values

ValueCountFrequency (%)
한성대 289
46.5%
금오공대 132
21.2%
대전대 101
 
16.2%
한양대 100
 
16.1%

Length

2024-04-19T15:56:26.413475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:56:26.584154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한성대 289
46.5%
금오공대 132
21.2%
대전대 101
 
16.2%
한양대 100
 
16.1%

Interactions

2024-04-19T15:56:24.313470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:56:24.154609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:56:24.406404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:56:24.237159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-19T15:56:26.670884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번년도대학교
순번1.0000.9750.544
년도0.9751.0000.525
대학교0.5440.5251.000
2024-04-19T15:56:26.784876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번년도대학교
순번1.000-0.9920.355
년도-0.9921.0000.337
대학교0.3550.3371.000

Missing values

2024-04-19T15:56:24.518828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T15:56:24.604937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번논문명년도대학교
01국내 프랜차이즈 기업의 원가 행태에 관한 연구2019금오공대
12Critical Success Factors (CSF) on eCommerce Adoption in Banglade2019금오공대
23학생들의 참여가 회계교육에 미치는 영향2019금오공대
34The effect of personal value on CSV(creating shared value)2019금오공대
453D 프린팅 기술을 활용한 음파 증폭 무지향성 스피커 구조 및 조형에 관한 연구2019금오공대
56Virtual RealityBased Ergonomic Modeling and Evaluation Framework2019금오공대
67surface reconstruction from FE mesh model2019금오공대
78경량 모델을 활용한 해양구조물 시공 용접장 산출2019금오공대
89인력구조조정이 구성원의 정서적 몰입에 미치는 영향 고용불안정성 지각의 매개효과와 경력정체의 조절효과2019대전대
910근거이론을 통한 일학습병행의 성공적 정착 및 지속발전 모형 연구2019대전대
순번논문명년도대학교
612613중소소매점의 영업특성과 경쟁점포 출점의 영업위협도에 관한 연구2009한성대
613614중소컨설팅기업의 CRS경영이 재무성과에 미치는 영향에 관한 연구2009한성대
614615중소컨설팅기업의 R&D활동과 제품화 능력이 경쟁력에 미치는 영향2009한성대
615616중소컨설팅기업의 마케팅 능력과 분석이 시장경쟁력에 미치는 영향2009한성대
616617중소컨설팅기업의 생산력과 조직혁신체제가 기술축적에 미치는 영향2009한성대
617618중소컨설팅기업의 인프라 구축과 생산능력이 기술정책성과에 미치는 영향2009한성대
618619지식재산정보의 서비스 품질이 고객만족과 고객 충성도에 미치는 영향에 관한 연구2009한성대
619620컨설팅 관계혜택이 고객충성도에 미치는 영향2009한성대
620621컨설팅 제조기업의 경영능력과 기술축적이 기업내부성과에 미치는 영향2009한성대
621622쿠폰제 컨설팅 성공요인에 관한 연구2009한성대