Overview

Dataset statistics

Number of variables4
Number of observations520
Missing cells0
Missing cells (%)0.0%
Duplicate rows3
Duplicate rows (%)0.6%
Total size in memory16.9 KiB
Average record size in memory33.3 B

Variable types

DateTime1
Text1
Numeric1
Categorical1

Dataset

Description한국가스공사 기부금 지원 내역 데이터 현황으로, 기부일자와 수령인, 기부금액, 기부사유를 나타내고 2022년 1월부터 2022년 12월까지의 내역을 포함하고 있습니다.
URLhttps://www.data.go.kr/data/15068076/fileData.do

Alerts

Dataset has 3 (0.6%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 02:46:26.625557
Analysis finished2023-12-12 02:46:27.140998
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct162
Distinct (%)31.2%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
Minimum2022-01-17 00:00:00
Maximum2022-12-30 00:00:00
2023-12-12T11:46:27.239556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:46:27.427468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct281
Distinct (%)54.0%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-12T11:46:27.727966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length19
Mean length9.5769231
Min length2

Characters and Unicode

Total characters4980
Distinct characters295
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique201 ?
Unique (%)38.7%

Sample

1st row통영시종합사회복지관
2nd row참사랑
3rd row자생원
4th row통영시사회복지협의회
5th row신애원
ValueCountFrequency (%)
사회복지법인 49
 
7.2%
사회복지공동모금회 41
 
6.0%
대한적십자사 23
 
3.4%
사단법인 19
 
2.8%
당진시복지재단 18
 
2.6%
삼척시사회복지협의회 18
 
2.6%
사)한국자원봉사센터협회 14
 
2.1%
재단법인 13
 
1.9%
통영시사회복지협의회 13
 
1.9%
삼척시노인복지관 11
 
1.6%
Other values (289) 462
67.8%
2023-12-12T11:46:28.115071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
348
 
7.0%
305
 
6.1%
301
 
6.0%
234
 
4.7%
161
 
3.2%
152
 
3.1%
141
 
2.8%
112
 
2.2%
108
 
2.2%
* 104
 
2.1%
Other values (285) 3014
60.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4457
89.5%
Decimal Number 172
 
3.5%
Space Separator 161
 
3.2%
Other Punctuation 106
 
2.1%
Close Punctuation 39
 
0.8%
Open Punctuation 37
 
0.7%
Uppercase Letter 8
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
348
 
7.8%
305
 
6.8%
301
 
6.8%
234
 
5.3%
152
 
3.4%
141
 
3.2%
112
 
2.5%
108
 
2.4%
92
 
2.1%
86
 
1.9%
Other values (266) 2578
57.8%
Decimal Number
ValueCountFrequency (%)
1 37
21.5%
2 24
14.0%
3 21
12.2%
8 21
12.2%
7 20
11.6%
5 19
11.0%
0 13
 
7.6%
6 8
 
4.7%
9 5
 
2.9%
4 4
 
2.3%
Uppercase Letter
ValueCountFrequency (%)
A 2
25.0%
C 2
25.0%
M 2
25.0%
Y 2
25.0%
Other Punctuation
ValueCountFrequency (%)
* 104
98.1%
· 2
 
1.9%
Space Separator
ValueCountFrequency (%)
161
100.0%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4457
89.5%
Common 515
 
10.3%
Latin 8
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
348
 
7.8%
305
 
6.8%
301
 
6.8%
234
 
5.3%
152
 
3.4%
141
 
3.2%
112
 
2.5%
108
 
2.4%
92
 
2.1%
86
 
1.9%
Other values (266) 2578
57.8%
Common
ValueCountFrequency (%)
161
31.3%
* 104
20.2%
) 39
 
7.6%
1 37
 
7.2%
( 37
 
7.2%
2 24
 
4.7%
3 21
 
4.1%
8 21
 
4.1%
7 20
 
3.9%
5 19
 
3.7%
Other values (5) 32
 
6.2%
Latin
ValueCountFrequency (%)
A 2
25.0%
C 2
25.0%
M 2
25.0%
Y 2
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4457
89.5%
ASCII 521
 
10.5%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
348
 
7.8%
305
 
6.8%
301
 
6.8%
234
 
5.3%
152
 
3.4%
141
 
3.2%
112
 
2.5%
108
 
2.4%
92
 
2.1%
86
 
1.9%
Other values (266) 2578
57.8%
ASCII
ValueCountFrequency (%)
161
30.9%
* 104
20.0%
) 39
 
7.5%
1 37
 
7.1%
( 37
 
7.1%
2 24
 
4.6%
3 21
 
4.0%
8 21
 
4.0%
7 20
 
3.8%
5 19
 
3.6%
Other values (8) 38
 
7.3%
None
ValueCountFrequency (%)
· 2
100.0%

기부금액(원)
Real number (ℝ)

Distinct196
Distinct (%)37.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29744068
Minimum33100
Maximum2.439302 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.7 KiB
2023-12-12T11:46:28.262610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33100
5-th percentile76202.6
Q11000000
median2114800
Q34388900
95-th percentile60925750
Maximum2.439302 × 109
Range2.4392688 × 109
Interquartile range (IQR)3388900

Descriptive statistics

Standard deviation1.755917 × 108
Coefficient of variation (CV)5.9034191
Kurtosis108.47263
Mean29744068
Median Absolute Deviation (MAD)1414800
Skewness9.683602
Sum1.5466915 × 1010
Variance3.0832445 × 1016
MonotonicityNot monotonic
2023-12-12T11:46:28.794460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2000000 67
 
12.9%
3000000 61
 
11.7%
1000000 56
 
10.8%
5000000 44
 
8.5%
4000000 31
 
6.0%
9000000 9
 
1.7%
1500000 9
 
1.7%
2500000 8
 
1.5%
6000000 7
 
1.3%
700000 6
 
1.2%
Other values (186) 222
42.7%
ValueCountFrequency (%)
33100 1
0.2%
37160 1
0.2%
41807 1
0.2%
42232 1
0.2%
44517 1
0.2%
47192 1
0.2%
47448 1
0.2%
51023 1
0.2%
55157 1
0.2%
55497 1
0.2%
ValueCountFrequency (%)
2439301950 1
0.2%
2007000000 1
0.2%
1244000000 1
0.2%
1071000000 1
0.2%
1000000000 1
0.2%
919275000 1
0.2%
727000000 1
0.2%
700000000 1
0.2%
513000000 1
0.2%
360000000 1
0.2%

기부사유
Categorical

Distinct12
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
취약계층 지원
233 
지역협력사업 지원
111 
안전, 환경 지원
44 
군경 지원
39 
동반성장사업 지원
32 
Other values (7)
61 

Length

Max length19
Median length13
Mean length7.8519231
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row취약계층 지원
2nd row취약계층 지원
3rd row취약계층 지원
4th row취약계층 지원
5th row취약계층 지원

Common Values

ValueCountFrequency (%)
취약계층 지원 233
44.8%
지역협력사업 지원 111
21.3%
안전, 환경 지원 44
 
8.5%
군경 지원 39
 
7.5%
동반성장사업 지원 32
 
6.2%
재난 지원 20
 
3.8%
업 연계 에너지복지 지원 18
 
3.5%
장학금 지원 7
 
1.3%
2022년 천연가스 기지주변 지원금 7
 
1.3%
의료 지원 4
 
0.8%
Other values (2) 5
 
1.0%

Length

2023-12-12T11:46:28.953270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지원 513
45.0%
취약계층 233
20.5%
지역협력사업 111
 
9.7%
안전 44
 
3.9%
환경 44
 
3.9%
군경 39
 
3.4%
동반성장사업 32
 
2.8%
재난 20
 
1.8%
연계 18
 
1.6%
에너지복지 18
 
1.6%
Other values (11) 67
 
5.9%

Interactions

2023-12-12T11:46:26.857134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:46:29.032917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기부금액(원)기부사유
기부금액(원)1.0000.576
기부사유0.5761.000
2023-12-12T11:46:29.136984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기부금액(원)기부사유
기부금액(원)1.0000.285
기부사유0.2851.000

Missing values

2023-12-12T11:46:26.997152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:46:27.101800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기부일자수령인기부금액(원)기부사유
02022-01-17통영시종합사회복지관2450000취약계층 지원
12022-01-17참사랑2000000취약계층 지원
22022-01-17자생원2000000취약계층 지원
32022-01-17통영시사회복지협의회9000000취약계층 지원
42022-01-17신애원2000000취약계층 지원
52022-01-17꾸러기둥지지역아동센터1000000취약계층 지원
62022-01-17거류지역아동센터1000000취약계층 지원
72022-01-17제8358부대 117연대1000000군경 지원
82022-01-17제8358부대 1대대1000000군경 지원
92022-01-18경산시장애인종합복지관1000000취약계층 지원
기부일자수령인기부금액(원)기부사유
5102022-12-282작전사령부20000000군경 지원
5112022-12-28늘푸른나무복지관5000000업 연계 에너지복지 지원
5122022-12-29201신속대응여단3000000군경 지원
5132022-12-29인천지방경찰청 서부경찰서550000군경 지원
5142022-12-296755부대(50사단)4000000군경 지원
5152022-12-29옥서지역아동센터1684000취약계층 지원
5162022-12-29501여단2대대1000000군경 지원
5172022-12-29해군2함대사령부2000000군경 지원
5182022-12-30인천도시가스1123492취약계층 지원
5192022-12-30사회복지법인 삼척시사회복지협의회15200000지역협력사업 지원

Duplicate rows

Most frequently occurring

기부일자수령인기부금액(원)기부사유# duplicates
12022-09-07(사)한국자원봉사센터협회345000취약계층 지원3
02022-07-15사단법인 커뮤니티와 경제200000000사회적 경제 지원2
22022-10-20삼척시노인복지관4000000취약계층 지원2