Overview

Dataset statistics

Number of variables6
Number of observations280
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.8 KiB
Average record size in memory50.5 B

Variable types

Categorical3
Text2
Numeric1

Alerts

집계년도 is highly overall correlated with 실과명High correlation
실과명 is highly overall correlated with 집계년도 and 1 other fieldsHigh correlation
실국명 is highly overall correlated with 실과명High correlation

Reproduction

Analysis started2023-12-10 22:31:39.193312
Analysis finished2023-12-10 22:31:39.827413
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

집계년도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2015
154 
2014
126 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015
2nd row2015
3rd row2015
4th row2015
5th row2015

Common Values

ValueCountFrequency (%)
2015 154
55.0%
2014 126
45.0%

Length

2023-12-11T07:31:39.891562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:31:39.968204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2015 154
55.0%
2014 126
45.0%
Distinct145
Distinct (%)51.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-11T07:31:40.109003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length19
Mean length11.732143
Min length3

Characters and Unicode

Total characters3285
Distinct characters259
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)32.1%

Sample

1st row(사)경기벤처기업협회
2nd row(사)경기중소기업연합회
3rd row한민족민속6일장문화계승전국중앙협의회경기도지부
4th row한국소비자연맹경기지회
5th row(사)경기중소기업연합회
ValueCountFrequency (%)
경기도새마을회 12
 
4.3%
광복회경기도지부 11
 
3.9%
무공수훈자회경기도지부 9
 
3.2%
전몰군경유족회경기도지부 8
 
2.9%
상이군경회경기도지부 7
 
2.5%
한국자유총연맹경기도지부 7
 
2.5%
6.15공동선언실천남측위원회경기본부 7
 
2.5%
특수임무유공자회경기도지부 6
 
2.1%
월남전참전자회경기도지부 6
 
2.1%
전몰군경미망인회경기도지부 6
 
2.1%
Other values (133) 201
71.8%
2023-12-11T07:31:40.381307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
267
 
8.1%
254
 
7.7%
224
 
6.8%
170
 
5.2%
121
 
3.7%
118
 
3.6%
96
 
2.9%
) 75
 
2.3%
( 71
 
2.2%
63
 
1.9%
Other values (249) 1826
55.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3025
92.1%
Close Punctuation 75
 
2.3%
Open Punctuation 71
 
2.2%
Decimal Number 65
 
2.0%
Uppercase Letter 29
 
0.9%
Other Punctuation 20
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
267
 
8.8%
254
 
8.4%
224
 
7.4%
170
 
5.6%
121
 
4.0%
118
 
3.9%
96
 
3.2%
63
 
2.1%
62
 
2.0%
53
 
1.8%
Other values (224) 1597
52.8%
Uppercase Letter
ValueCountFrequency (%)
S 4
13.8%
C 4
13.8%
O 3
10.3%
T 2
6.9%
A 2
6.9%
I 2
6.9%
W 2
6.9%
Y 2
6.9%
E 2
6.9%
U 2
6.9%
Other values (3) 4
13.8%
Decimal Number
ValueCountFrequency (%)
6 22
33.8%
5 13
20.0%
2 12
18.5%
1 10
15.4%
4 2
 
3.1%
9 2
 
3.1%
7 2
 
3.1%
0 1
 
1.5%
8 1
 
1.5%
Close Punctuation
ValueCountFrequency (%)
) 75
100.0%
Open Punctuation
ValueCountFrequency (%)
( 71
100.0%
Other Punctuation
ValueCountFrequency (%)
. 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3025
92.1%
Common 231
 
7.0%
Latin 29
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
267
 
8.8%
254
 
8.4%
224
 
7.4%
170
 
5.6%
121
 
4.0%
118
 
3.9%
96
 
3.2%
63
 
2.1%
62
 
2.0%
53
 
1.8%
Other values (224) 1597
52.8%
Latin
ValueCountFrequency (%)
S 4
13.8%
C 4
13.8%
O 3
10.3%
T 2
6.9%
A 2
6.9%
I 2
6.9%
W 2
6.9%
Y 2
6.9%
E 2
6.9%
U 2
6.9%
Other values (3) 4
13.8%
Common
ValueCountFrequency (%)
) 75
32.5%
( 71
30.7%
6 22
 
9.5%
. 20
 
8.7%
5 13
 
5.6%
2 12
 
5.2%
1 10
 
4.3%
4 2
 
0.9%
9 2
 
0.9%
7 2
 
0.9%
Other values (2) 2
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3025
92.1%
ASCII 260
 
7.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
267
 
8.8%
254
 
8.4%
224
 
7.4%
170
 
5.6%
121
 
4.0%
118
 
3.9%
96
 
3.2%
63
 
2.1%
62
 
2.0%
53
 
1.8%
Other values (224) 1597
52.8%
ASCII
ValueCountFrequency (%)
) 75
28.8%
( 71
27.3%
6 22
 
8.5%
. 20
 
7.7%
5 13
 
5.0%
2 12
 
4.6%
1 10
 
3.8%
S 4
 
1.5%
C 4
 
1.5%
O 3
 
1.2%
Other values (15) 26
 
10.0%

실국명
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)6.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
보건복지국
88 
자치행정국
67 
문화체육관광국
40 
경제투자실
14 
여성가족국
13 
Other values (14)
58 

Length

Max length8
Median length5
Mean length5.1857143
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경제실
2nd row경제실
3rd row경제실
4th row경제실
5th row경제실

Common Values

ValueCountFrequency (%)
보건복지국 88
31.4%
자치행정국 67
23.9%
문화체육관광국 40
14.3%
경제투자실 14
 
5.0%
여성가족국 13
 
4.6%
경제실 11
 
3.9%
대변인실 7
 
2.5%
교통국 6
 
2.1%
복지여성실 5
 
1.8%
균형발전기획실 4
 
1.4%
Other values (9) 25
 
8.9%

Length

2023-12-11T07:31:40.490262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보건복지국 88
31.4%
자치행정국 67
23.9%
문화체육관광국 40
14.3%
경제투자실 14
 
5.0%
여성가족국 13
 
4.6%
경제실 11
 
3.9%
대변인실 7
 
2.5%
교통국 6
 
2.1%
복지여성실 5
 
1.8%
축산산림국 4
 
1.4%
Other values (9) 25
 
8.9%

실과명
Categorical

HIGH CORRELATION 

Distinct37
Distinct (%)13.2%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
<NA>
126 
무한돌봄복지과
38 
자치행정과
33 
문화정책과
15 
기업지원과
 
7
Other values (32)
61 

Length

Max length10
Median length9
Mean length4.9714286
Min length3

Unique

Unique16 ?
Unique (%)5.7%

Sample

1st row기업지원과
2nd row기업지원과
3rd row사회적경제과
4th row사회적경제과
5th row기업지원과

Common Values

ValueCountFrequency (%)
<NA> 126
45.0%
무한돌봄복지과 38
 
13.6%
자치행정과 33
 
11.8%
문화정책과 15
 
5.4%
기업지원과 7
 
2.5%
장애인복지과 6
 
2.1%
교통정책과 5
 
1.8%
언론담당관 4
 
1.4%
아동청소년과 4
 
1.4%
통일기반조성담당관실 3
 
1.1%
Other values (27) 39
 
13.9%

Length

2023-12-11T07:31:40.595763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 126
44.7%
무한돌봄복지과 38
 
13.5%
자치행정과 33
 
11.7%
문화정책과 15
 
5.3%
기업지원과 7
 
2.5%
장애인복지과 6
 
2.1%
교통정책과 5
 
1.8%
언론담당관 4
 
1.4%
아동청소년과 4
 
1.4%
통일기반조성담당관실 3
 
1.1%
Other values (28) 41
 
14.5%
Distinct249
Distinct (%)88.9%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-11T07:31:40.814210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length39
Mean length17.871429
Min length3

Characters and Unicode

Total characters5004
Distinct characters428
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique224 ?
Unique (%)80.0%

Sample

1st row중소·벤처기업 성공전략 설명회
2nd row경기중소기업 경쟁력 강화 프로젝트
3rd row「친절?청결?질서」 교육?홍보?실천 캠페인 운동전개
4th row온라인 소비자고발센터 운영
5th row중소기업 애로 발굴 토크콘서트
ValueCountFrequency (%)
57
 
5.4%
위한 17
 
1.6%
순례 16
 
1.5%
전적지 14
 
1.3%
경기도 14
 
1.3%
운영비 12
 
1.1%
청소년 11
 
1.1%
교육 8
 
0.8%
캠페인 8
 
0.8%
8
 
0.8%
Other values (615) 881
84.2%
2023-12-11T07:31:41.167796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
853
 
17.0%
126
 
2.5%
112
 
2.2%
100
 
2.0%
82
 
1.6%
68
 
1.4%
64
 
1.3%
62
 
1.2%
62
 
1.2%
61
 
1.2%
Other values (418) 3414
68.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3749
74.9%
Space Separator 853
 
17.0%
Decimal Number 208
 
4.2%
Other Punctuation 80
 
1.6%
Uppercase Letter 42
 
0.8%
Close Punctuation 25
 
0.5%
Open Punctuation 24
 
0.5%
Lowercase Letter 20
 
0.4%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
126
 
3.4%
112
 
3.0%
100
 
2.7%
82
 
2.2%
68
 
1.8%
64
 
1.7%
62
 
1.7%
62
 
1.7%
61
 
1.6%
60
 
1.6%
Other values (365) 2952
78.7%
Uppercase Letter
ValueCountFrequency (%)
O 11
26.2%
E 5
11.9%
C 5
11.9%
K 4
 
9.5%
B 4
 
9.5%
I 2
 
4.8%
S 2
 
4.8%
U 1
 
2.4%
T 1
 
2.4%
J 1
 
2.4%
Other values (6) 6
14.3%
Lowercase Letter
ValueCountFrequency (%)
a 3
15.0%
t 3
15.0%
s 2
10.0%
e 2
10.0%
m 2
10.0%
r 2
10.0%
p 1
 
5.0%
f 1
 
5.0%
n 1
 
5.0%
c 1
 
5.0%
Other values (2) 2
10.0%
Decimal Number
ValueCountFrequency (%)
1 47
22.6%
2 38
18.3%
5 32
15.4%
0 26
12.5%
6 17
 
8.2%
4 16
 
7.7%
3 13
 
6.2%
8 8
 
3.8%
9 7
 
3.4%
7 4
 
1.9%
Other Punctuation
ValueCountFrequency (%)
, 20
25.0%
· 18
22.5%
" 14
17.5%
' 10
12.5%
. 6
 
7.5%
! 5
 
6.2%
? 5
 
6.2%
/ 1
 
1.2%
: 1
 
1.2%
Close Punctuation
ValueCountFrequency (%)
) 23
92.0%
2
 
8.0%
Open Punctuation
ValueCountFrequency (%)
( 22
91.7%
2
 
8.3%
Space Separator
ValueCountFrequency (%)
853
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3744
74.8%
Common 1193
 
23.8%
Latin 62
 
1.2%
Han 5
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
126
 
3.4%
112
 
3.0%
100
 
2.7%
82
 
2.2%
68
 
1.8%
64
 
1.7%
62
 
1.7%
62
 
1.7%
61
 
1.6%
60
 
1.6%
Other values (360) 2947
78.7%
Latin
ValueCountFrequency (%)
O 11
17.7%
E 5
 
8.1%
C 5
 
8.1%
K 4
 
6.5%
B 4
 
6.5%
a 3
 
4.8%
t 3
 
4.8%
I 2
 
3.2%
S 2
 
3.2%
s 2
 
3.2%
Other values (18) 21
33.9%
Common
ValueCountFrequency (%)
853
71.5%
1 47
 
3.9%
2 38
 
3.2%
5 32
 
2.7%
0 26
 
2.2%
) 23
 
1.9%
( 22
 
1.8%
, 20
 
1.7%
· 18
 
1.5%
6 17
 
1.4%
Other values (15) 97
 
8.1%
Han
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3744
74.8%
ASCII 1233
 
24.6%
None 22
 
0.4%
CJK 5
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
853
69.2%
1 47
 
3.8%
2 38
 
3.1%
5 32
 
2.6%
0 26
 
2.1%
) 23
 
1.9%
( 22
 
1.8%
, 20
 
1.6%
6 17
 
1.4%
4 16
 
1.3%
Other values (40) 139
 
11.3%
Hangul
ValueCountFrequency (%)
126
 
3.4%
112
 
3.0%
100
 
2.7%
82
 
2.2%
68
 
1.8%
64
 
1.7%
62
 
1.7%
62
 
1.7%
61
 
1.6%
60
 
1.6%
Other values (360) 2947
78.7%
None
ValueCountFrequency (%)
· 18
81.8%
2
 
9.1%
2
 
9.1%
CJK
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

지원액(천원)
Real number (ℝ)

Distinct135
Distinct (%)48.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6195.7857
Minimum0
Maximum45000
Zeros2
Zeros (%)0.7%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2023-12-11T07:31:41.284194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1000
Q12500
median4066.5
Q36800
95-th percentile20050
Maximum45000
Range45000
Interquartile range (IQR)4300

Descriptive statistics

Standard deviation7264.4392
Coefficient of variation (CV)1.1724807
Kurtosis12.550407
Mean6195.7857
Median Absolute Deviation (MAD)1995
Skewness3.3764714
Sum1734820
Variance52772077
MonotonicityNot monotonic
2023-12-11T07:31:41.393533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5000 18
 
6.4%
2000 17
 
6.1%
10000 15
 
5.4%
3000 15
 
5.4%
4000 11
 
3.9%
1500 9
 
3.2%
3500 7
 
2.5%
1000 6
 
2.1%
5500 6
 
2.1%
6500 5
 
1.8%
Other values (125) 171
61.1%
ValueCountFrequency (%)
0 2
 
0.7%
500 2
 
0.7%
600 2
 
0.7%
700 1
 
0.4%
760 1
 
0.4%
950 1
 
0.4%
1000 6
2.1%
1100 2
 
0.7%
1200 1
 
0.4%
1210 1
 
0.4%
ValueCountFrequency (%)
45000 1
 
0.4%
42960 1
 
0.4%
40000 3
1.1%
39500 1
 
0.4%
36000 1
 
0.4%
35200 1
 
0.4%
30000 1
 
0.4%
29000 1
 
0.4%
26000 1
 
0.4%
25080 1
 
0.4%

Interactions

2023-12-11T07:31:39.610645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:31:41.463357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
집계년도실국명실과명지원액(천원)
집계년도1.0000.435NaN0.438
실국명0.4351.0001.0000.420
실과명NaN1.0001.0000.000
지원액(천원)0.4380.4200.0001.000
2023-12-11T07:31:41.543851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
집계년도실과명실국명
집계년도1.0001.0000.374
실과명1.0001.0000.925
실국명0.3740.9251.000
2023-12-11T07:31:41.612358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지원액(천원)집계년도실국명실과명
지원액(천원)1.0000.3320.1670.000
집계년도0.3321.0000.3741.000
실국명0.1670.3741.0000.925
실과명0.0001.0000.9251.000

Missing values

2023-12-11T07:31:39.700896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:31:39.785467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

집계년도단체명실국명실과명사업내용지원액(천원)
02015(사)경기벤처기업협회경제실기업지원과중소·벤처기업 성공전략 설명회5000
12015(사)경기중소기업연합회경제실기업지원과경기중소기업 경쟁력 강화 프로젝트4800
22015한민족민속6일장문화계승전국중앙협의회경기도지부경제실사회적경제과「친절?청결?질서」 교육?홍보?실천 캠페인 운동전개5000
32015한국소비자연맹경기지회경제실사회적경제과온라인 소비자고발센터 운영10000
42015(사)경기중소기업연합회경제실기업지원과중소기업 애로 발굴 토크콘서트5000
52015(사)경기벤처기업협회경제실기업지원과2015년 경기벤처기업인의 날9350
62015사)대한웅변협회경기도본부경제실에너지과제18회 에너지절약 나의주장·웅변·글짓기 대회5000
72015환경을사랑하는사람들의모임경제실산업정책과저탄소 녹색에너지 전문강사양성 및 에너지절약 홍보교육12000
82015(사)중소기업융합경기연합회경제실기업지원과중소기업 CEO 융합 워크샵9400
92015(사)IT여성기업인협회경기지회경제실기업지원과여성CEO 경쟁력 강화를 위한 스마트 경영시뮬레이션 포럼4300
집계년도단체명실국명실과명사업내용지원액(천원)
2702014미수복경기도중앙도민회자치행정국<NA>제32회 대통령기 이북도민체육대회1500
2712014미수복경기도중앙도민회자치행정국<NA>제15회 도민의날 대회3500
2722014민주평화통일자문회의경기지역회의자치행정국<NA>평화통일사업 추진을 위한 사무실 운영비10000
2732014민주평화통일자문회의경기지역회의자치행정국<NA>안보평화통일현장 견학1500
2742014경기도지방행정동우회자치행정국<NA>산불예방 캠페인 운동4600
2752014경기도지방행정동우회자치행정국<NA>문화유적지 질서 청결운동9600
2762014경기국학운동시민연합자치행정국<NA>신나는 역사문화 체험특강1000
2772014대한적십자사경기지사자치행정국<NA>제12회 희망나눔 1m 1원 자선걷기대회5000
2782014경기도지방행정동우회자치행정국<NA>도정 시책 홍보를 위한 인터넷 교육5500
2792014(사)자연보호중앙연맹경기도협의회환경국<NA>자연보호헌장 선포 제36주년 기념행사6500