Overview

Dataset statistics

Number of variables7
Number of observations6655
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory396.6 KiB
Average record size in memory61.0 B

Variable types

Categorical3
Numeric4

Dataset

Description회계년도,지급처리일,지급금액,분야코드,분야명,부문코드,부문명
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-12256/S/1/datasetView.do

Alerts

분야명 is highly overall correlated with 분야코드 and 2 other fieldsHigh correlation
부문명 is highly overall correlated with 분야코드 and 2 other fieldsHigh correlation
지급처리일 is highly overall correlated with 회계년도High correlation
분야코드 is highly overall correlated with 부문코드 and 2 other fieldsHigh correlation
부문코드 is highly overall correlated with 분야코드 and 2 other fieldsHigh correlation
회계년도 is highly overall correlated with 지급처리일High correlation

Reproduction

Analysis started2024-05-18 03:54:36.499404
Analysis finished2024-05-18 03:54:44.622914
Duration8.12 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

회계년도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size52.1 KiB
2023
3573 
2024
3082 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024
2nd row2024
3rd row2024
4th row2024
5th row2024

Common Values

ValueCountFrequency (%)
2023 3573
53.7%
2024 3082
46.3%

Length

2024-05-18T12:54:44.811884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T12:54:45.121915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 3573
53.7%
2024 3082
46.3%

지급처리일
Real number (ℝ)

HIGH CORRELATION 

Distinct252
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20235302
Minimum20230519
Maximum20240517
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size58.6 KiB
2024-05-18T12:54:45.586180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20230519
5-th percentile20230609
Q120230825
median20231220
Q320240308
95-th percentile20240502
Maximum20240517
Range9998
Interquartile range (IQR)9483

Descriptive statistics

Standard deviation4706.0139
Coefficient of variation (CV)0.00023256455
Kurtosis-1.9798964
Mean20235302
Median Absolute Deviation (MAD)619
Skewness0.12066717
Sum1.3466593 × 1011
Variance22146567
MonotonicityDecreasing
2024-05-18T12:54:46.051243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20240119 40
 
0.6%
20240229 39
 
0.6%
20231031 38
 
0.6%
20231205 38
 
0.6%
20240125 38
 
0.6%
20240130 38
 
0.6%
20231228 38
 
0.6%
20230908 38
 
0.6%
20240510 38
 
0.6%
20240329 38
 
0.6%
Other values (242) 6272
94.2%
ValueCountFrequency (%)
20230519 22
0.3%
20230522 19
0.3%
20230523 19
0.3%
20230524 27
0.4%
20230525 24
0.4%
20230526 24
0.4%
20230530 21
0.3%
20230531 33
0.5%
20230601 25
0.4%
20230602 21
0.3%
ValueCountFrequency (%)
20240517 36
0.5%
20240516 35
0.5%
20240515 1
 
< 0.1%
20240514 34
0.5%
20240513 36
0.5%
20240510 38
0.6%
20240509 35
0.5%
20240508 36
0.5%
20240507 36
0.5%
20240506 1
 
< 0.1%

지급금액
Real number (ℝ)

Distinct6529
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.9570994 × 109
Minimum-7.2252744 × 109
Maximum1.1413898 × 1012
Zeros0
Zeros (%)0.0%
Negative61
Negative (%)0.9%
Memory size58.6 KiB
2024-05-18T12:54:46.609954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-7.2252744 × 109
5-th percentile300000
Q113854204
median1.741657 × 108
Q39.4063226 × 108
95-th percentile9.5293729 × 109
Maximum1.1413898 × 1012
Range1.1486151 × 1012
Interquartile range (IQR)9.2677806 × 108

Descriptive statistics

Standard deviation3.5197333 × 1010
Coefficient of variation (CV)7.1003889
Kurtosis294.54624
Mean4.9570994 × 109
Median Absolute Deviation (MAD)1.730157 × 108
Skewness14.70015
Sum3.2989496 × 1013
Variance1.2388523 × 1021
MonotonicityNot monotonic
2024-05-18T12:54:47.097149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
300000 9
 
0.1%
1400000 7
 
0.1%
400000 6
 
0.1%
100000 6
 
0.1%
50000 5
 
0.1%
200000 4
 
0.1%
600000 4
 
0.1%
2000000 4
 
0.1%
300000000 3
 
< 0.1%
56000 3
 
< 0.1%
Other values (6519) 6604
99.2%
ValueCountFrequency (%)
-7225274410 1
< 0.1%
-2076940000 1
< 0.1%
-1313395884 1
< 0.1%
-1226000000 1
< 0.1%
-977439950 1
< 0.1%
-728883085 1
< 0.1%
-566439550 1
< 0.1%
-421883500 1
< 0.1%
-306753350 1
< 0.1%
-288745360 1
< 0.1%
ValueCountFrequency (%)
1141389790000 1
< 0.1%
689239678250 1
< 0.1%
686153989540 1
< 0.1%
645800722000 1
< 0.1%
611800409000 1
< 0.1%
545808177170 1
< 0.1%
530389826000 1
< 0.1%
453714505630 1
< 0.1%
450602236000 1
< 0.1%
443576868498 1
< 0.1%

분야코드
Real number (ℝ)

HIGH CORRELATION 

Distinct13
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean107.0263
Minimum10
Maximum900
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size58.6 KiB
2024-05-18T12:54:47.441073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile10
Q160
median80
Q3110
95-th percentile150
Maximum900
Range890
Interquartile range (IQR)50

Descriptive statistics

Standard deviation161.20929
Coefficient of variation (CV)1.5062587
Kurtosis19.122403
Mean107.0263
Median Absolute Deviation (MAD)20
Skewness4.4530093
Sum712260
Variance25988.436
MonotonicityNot monotonic
2024-05-18T12:54:47.919190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
80 1094
16.4%
70 909
13.7%
60 873
13.1%
120 601
9.0%
10 597
9.0%
110 512
7.7%
20 500
7.5%
140 372
 
5.6%
90 356
 
5.3%
900 251
 
3.8%
Other values (3) 590
8.9%
ValueCountFrequency (%)
10 597
9.0%
20 500
7.5%
50 248
 
3.7%
60 873
13.1%
70 909
13.7%
80 1094
16.4%
90 356
 
5.3%
100 188
 
2.8%
110 512
7.7%
120 601
9.0%
ValueCountFrequency (%)
900 251
 
3.8%
150 154
 
2.3%
140 372
 
5.6%
120 601
9.0%
110 512
7.7%
100 188
 
2.8%
90 356
 
5.3%
80 1094
16.4%
70 909
13.7%
60 873
13.1%

분야명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size52.1 KiB
사회복지
1094 
환경
909 
문화및관광
873 
교통및물류
601 
일반공공행정
597 
Other values (8)
2581 

Length

Max length11
Median length6
Mean length4.8589031
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육
2nd row사회복지
3rd row사회복지
4th row사회복지
5th row사회복지

Common Values

ValueCountFrequency (%)
사회복지 1094
16.4%
환경 909
13.7%
문화및관광 873
13.1%
교통및물류 601
9.0%
일반공공행정 597
9.0%
산업ㆍ중소기업및에너지 512
7.7%
공공질서및안전 500
7.5%
국토및지역개발 372
 
5.6%
보건 356
 
5.3%
기타 251
 
3.8%
Other values (3) 590
8.9%

Length

2024-05-18T12:54:48.187884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
사회복지 1094
16.4%
환경 909
13.7%
문화및관광 873
13.1%
교통및물류 601
9.0%
일반공공행정 597
9.0%
산업ㆍ중소기업및에너지 512
7.7%
공공질서및안전 500
7.5%
국토및지역개발 372
 
5.6%
보건 356
 
5.3%
기타 251
 
3.8%
Other values (3) 590
8.9%

부문코드
Real number (ℝ)

HIGH CORRELATION 

Distinct39
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean110.47258
Minimum11
Maximum901
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size58.6 KiB
2024-05-18T12:54:48.532204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11
5-th percentile14
Q162
median84
Q3114
95-th percentile153
Maximum901
Range890
Interquartile range (IQR)52

Descriptive statistics

Standard deviation160.66594
Coefficient of variation (CV)1.4543513
Kurtosis19.147552
Mean110.47258
Median Absolute Deviation (MAD)23
Skewness4.4569973
Sum735195
Variance25813.543
MonotonicityNot monotonic
2024-05-18T12:54:48.927836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
901 251
 
3.8%
121 218
 
3.3%
16 216
 
3.2%
142 206
 
3.1%
126 202
 
3.0%
91 202
 
3.0%
26 202
 
3.0%
76 201
 
3.0%
82 199
 
3.0%
61 199
 
3.0%
Other values (29) 4559
68.5%
ValueCountFrequency (%)
11 191
2.9%
13 25
 
0.4%
14 165
2.5%
16 216
3.2%
23 103
1.5%
25 195
2.9%
26 202
3.0%
51 145
2.2%
53 103
1.5%
61 199
3.0%
ValueCountFrequency (%)
901 251
3.8%
153 154
2.3%
142 206
3.1%
141 166
2.5%
126 202
3.0%
123 181
2.7%
121 218
3.3%
116 151
2.3%
114 198
3.0%
113 112
1.7%

부문명
Categorical

HIGH CORRELATION 

Distinct39
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size52.1 KiB
기타
 
251
도로
 
218
일반행정
 
216
지역및도시
 
206
소방
 
202
Other values (34)
5562 

Length

Max length10
Median length8
Mean length4.8740796
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유아및초중등교육
2nd row사회복지일반
3rd row주택
4th row노동
5th row노인ㆍ청소년

Common Values

ValueCountFrequency (%)
기타 251
 
3.8%
도로 218
 
3.3%
일반행정 216
 
3.2%
지역및도시 206
 
3.1%
소방 202
 
3.0%
대중교통ㆍ물류등기타 202
 
3.0%
보건의료 202
 
3.0%
환경보호일반 201
 
3.0%
취약계층지원 199
 
3.0%
문화예술 199
 
3.0%
Other values (29) 4559
68.5%

Length

2024-05-18T12:54:49.385687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기타 251
 
3.8%
도로 218
 
3.3%
일반행정 216
 
3.2%
지역및도시 206
 
3.1%
소방 202
 
3.0%
대중교통ㆍ물류등기타 202
 
3.0%
보건의료 202
 
3.0%
환경보호일반 201
 
3.0%
문화예술 199
 
3.0%
취약계층지원 199
 
3.0%
Other values (29) 4559
68.5%

Interactions

2024-05-18T12:54:42.621671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T12:54:38.366073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T12:54:39.751341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T12:54:41.283169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T12:54:42.968529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T12:54:38.684591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T12:54:40.145041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T12:54:41.607069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T12:54:43.313970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T12:54:39.037276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T12:54:40.574296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T12:54:41.973106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T12:54:43.671400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T12:54:39.421793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T12:54:40.912929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T12:54:42.291116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-18T12:54:49.664261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회계년도지급처리일지급금액분야코드분야명부문코드부문명
회계년도1.0001.0000.0330.0000.0130.0000.000
지급처리일1.0001.0000.0320.0000.0000.0000.000
지급금액0.0330.0321.0000.0410.1170.0410.415
분야코드0.0000.0000.0411.0001.0001.0001.000
분야명0.0130.0000.1171.0001.0001.0001.000
부문코드0.0000.0000.0411.0001.0001.0001.000
부문명0.0000.0000.4151.0001.0001.0001.000
2024-05-18T12:54:49.962095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야명회계년도부문명
분야명1.0000.0120.998
회계년도0.0121.0000.000
부문명0.9980.0001.000
2024-05-18T12:54:50.209881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지급처리일지급금액분야코드부문코드회계년도분야명부문명
지급처리일1.0000.153-0.005-0.0050.9870.0000.000
지급금액0.1531.0000.0740.0680.0250.0530.171
분야코드-0.0050.0741.0000.9940.0000.9990.997
부문코드-0.0050.0680.9941.0000.0000.9990.997
회계년도0.9870.0250.0000.0001.0000.0120.000
분야명0.0000.0530.9990.9990.0121.0000.998
부문명0.0000.1710.9970.9970.0000.9981.000

Missing values

2024-05-18T12:54:44.032973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-18T12:54:44.410952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회계년도지급처리일지급금액분야코드분야명부문코드부문명
020242024051732180050교육51유아및초중등교육
120242024051720000080사회복지89사회복지일반
22024202405172979304475080사회복지88주택
3202420240517242622080사회복지86노동
420242024051726648857418080사회복지85노인ㆍ청소년
520242024051718629266279080사회복지84보육ㆍ가족및여성
6202420240517323340875280사회복지82취약계층지원
720242024051778190787070환경76환경보호일반
820242024051715458233070환경74자연
920242024051752963978070환경73대기
회계년도지급처리일지급금액분야코드분야명부문코드부문명
664520232023051920380060문화및관광65문화및관광일반
664620232023051911494960100농림해양수산101농업ㆍ농촌
664720232023051928773895090보건91보건의료
66482023202305193177926020공공질서및안전26소방
664920232023051962733888010일반공공행정11입법및선거관리
6650202320230519437480080사회복지89사회복지일반
66512023202305198092303060문화및관광61문화예술
66522023202305195016750080사회복지85노인ㆍ청소년
6653202320230519687663855080사회복지84보육ㆍ가족및여성
6654202320230519274527439080사회복지82취약계층지원