Overview

Dataset statistics

Number of variables5
Number of observations89
Missing cells202
Missing cells (%)45.4%
Duplicate rows4
Duplicate rows (%)4.5%
Total size in memory3.7 KiB
Average record size in memory42.5 B

Variable types

Text1
Unsupported3
Numeric1

Dataset

Description경남개발공사 년간 결산 내역을 정리한 자료입니다. 본자료는 2021년 결산 내역이며, 회계 예산의 결산내역에 필요한 데이터로 참고하시기 바랍니다.
URLhttps://www.data.go.kr/data/15111599/fileData.do

Alerts

Dataset has 4 (4.5%) duplicate rowsDuplicates
결 산 내 역 has 3 (3.4%) missing valuesMissing
Unnamed: 1 has 27 (30.3%) missing valuesMissing
Unnamed: 2 has 72 (80.9%) missing valuesMissing
Unnamed: 3 has 29 (32.6%) missing valuesMissing
Unnamed: 4 has 71 (79.8%) missing valuesMissing
Unnamed: 1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 12:46:43.418605
Analysis finished2023-12-12 12:46:43.916235
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

결 산 내 역
Text

MISSING 

Distinct80
Distinct (%)93.0%
Missing3
Missing (%)3.4%
Memory size844.0 B
2023-12-12T21:46:44.129195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length20
Mean length9.4186047
Min length5

Characters and Unicode

Total characters810
Distinct characters123
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)89.5%

Sample

1st row제25기 2021년 12월 31일 현재
2nd row제24기 2020년 12월 31일 현재
3rd row경남개발공사
4th row과                     목
5th row자                     산
ValueCountFrequency (%)
1 13
 
7.4%
2 11
 
6.2%
4 8
 
4.5%
3 8
 
4.5%
정부보조금 5
 
2.8%
감가상각누계액 4
 
2.3%
4
 
2.3%
4
 
2.3%
4
 
2.3%
3
 
1.7%
Other values (94) 112
63.6%
2023-12-12T21:46:44.591722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
141
 
17.4%
. 58
 
7.2%
  48
 
5.9%
33
 
4.1%
28
 
3.5%
1 22
 
2.7%
21
 
2.6%
2 20
 
2.5%
18
 
2.2%
14
 
1.7%
Other values (113) 407
50.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 462
57.0%
Space Separator 189
23.3%
Decimal Number 78
 
9.6%
Other Punctuation 58
 
7.2%
Uppercase Letter 7
 
0.9%
Open Punctuation 6
 
0.7%
Close Punctuation 6
 
0.7%
Letter Number 4
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
7.1%
28
 
6.1%
21
 
4.5%
18
 
3.9%
14
 
3.0%
12
 
2.6%
11
 
2.4%
10
 
2.2%
10
 
2.2%
10
 
2.2%
Other values (93) 295
63.9%
Decimal Number
ValueCountFrequency (%)
1 22
28.2%
2 20
25.6%
3 10
12.8%
4 9
11.5%
0 4
 
5.1%
5 4
 
5.1%
6 3
 
3.8%
7 2
 
2.6%
9 2
 
2.6%
8 2
 
2.6%
Letter Number
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Space Separator
ValueCountFrequency (%)
141
74.6%
  48
 
25.4%
Other Punctuation
ValueCountFrequency (%)
. 58
100.0%
Uppercase Letter
ValueCountFrequency (%)
I 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 462
57.0%
Common 337
41.6%
Latin 11
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
7.1%
28
 
6.1%
21
 
4.5%
18
 
3.9%
14
 
3.0%
12
 
2.6%
11
 
2.4%
10
 
2.2%
10
 
2.2%
10
 
2.2%
Other values (93) 295
63.9%
Common
ValueCountFrequency (%)
141
41.8%
. 58
17.2%
  48
 
14.2%
1 22
 
6.5%
2 20
 
5.9%
3 10
 
3.0%
4 9
 
2.7%
( 6
 
1.8%
) 6
 
1.8%
0 4
 
1.2%
Other values (5) 13
 
3.9%
Latin
ValueCountFrequency (%)
I 7
63.6%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 462
57.0%
ASCII 296
36.5%
None 48
 
5.9%
Number Forms 4
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
141
47.6%
. 58
19.6%
1 22
 
7.4%
2 20
 
6.8%
3 10
 
3.4%
4 9
 
3.0%
I 7
 
2.4%
( 6
 
2.0%
) 6
 
2.0%
0 4
 
1.4%
Other values (5) 13
 
4.4%
None
ValueCountFrequency (%)
  48
100.0%
Hangul
ValueCountFrequency (%)
33
 
7.1%
28
 
6.1%
21
 
4.5%
18
 
3.9%
14
 
3.0%
12
 
2.6%
11
 
2.4%
10
 
2.2%
10
 
2.2%
10
 
2.2%
Other values (93) 295
63.9%
Number Forms
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing27
Missing (%)30.3%
Memory size844.0 B

Unnamed: 2
Real number (ℝ)

MISSING 

Distinct16
Distinct (%)94.1%
Missing72
Missing (%)80.9%
Infinite0
Infinite (%)0.0%
Mean3.9383244 × 1011
Minimum6.7843821 × 108
Maximum1.1158586 × 1012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size933.0 B
2023-12-12T21:46:44.726985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6.7843821 × 108
5-th percentile1.1301044 × 109
Q11.3792948 × 1011
median3.2501406 × 1011
Q34.568924 × 1011
95-th percentile1.1158586 × 1012
Maximum1.1158586 × 1012
Range1.1151801 × 1012
Interquartile range (IQR)3.1896292 × 1011

Descriptive statistics

Standard deviation3.4863778 × 1011
Coefficient of variation (CV)0.88524394
Kurtosis0.40062454
Mean3.9383244 × 1011
Median Absolute Deviation (MAD)1.8708458 × 1011
Skewness1.0058947
Sum6.6951514 × 1012
Variance1.215483 × 1023
MonotonicityNot monotonic
2023-12-12T21:46:44.843608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
1115858570743 2
 
2.2%
425109014229 1
 
1.1%
325014061267 1
 
1.1%
678438209 1
 
1.1%
321096625134 1
 
1.1%
1243020970 1
 
1.1%
1995976954 1
 
1.1%
365735495247 1
 
1.1%
290700737836 1
 
1.1%
368265429541 1
 
1.1%
Other values (6) 6
 
6.7%
(Missing) 72
80.9%
ValueCountFrequency (%)
678438209 1
1.1%
1243020970 1
1.1%
1995976954 1
1.1%
90217776898 1
1.1%
137929480000 1
1.1%
228745146468 1
1.1%
290700737836 1
1.1%
321096625134 1
1.1%
325014061267 1
1.1%
365735495247 1
1.1%
ValueCountFrequency (%)
1115858570743 2
2.2%
790844509476 1
1.1%
658966167377 1
1.1%
456892403366 1
1.1%
425109014229 1
1.1%
368265429541 1
1.1%
365735495247 1
1.1%
325014061267 1
1.1%
321096625134 1
1.1%
290700737836 1
1.1%

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing29
Missing (%)32.6%
Memory size844.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing71
Missing (%)79.8%
Memory size844.0 B

Interactions

2023-12-12T21:46:43.522232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:46:44.932431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
결 산 내 역Unnamed: 2
결 산 내 역1.0001.000
Unnamed: 21.0001.000

Missing values

2023-12-12T21:46:43.627451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:46:43.735561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T21:46:43.848343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

결 산 내 역Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4
0<NA>NaN<NA>NaNNaN
1제25기 2021년 12월 31일 현재NaN<NA>NaNNaN
2제24기 2020년 12월 31일 현재NaN<NA>NaNNaN
3<NA>NaN<NA>NaNNaN
4<NA>NaN<NA>NaNNaN
5경남개발공사NaN<NA>NaN(단위 : 원)
6과                     목제25기 말<NA>제24기 말NaN
7자                     산NaN<NA>NaNNaN
8I. 유동자산NaN790844509476NaN814001485605
9(1) 당좌자산NaN365735495247NaN379754440152
결 산 내 역Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4
79Ⅲ. 자본조정0<NA>0NaN
80Ⅳ. 기타포괄손익누계액NaN90217776898NaN90240152649
811. 자산재평가이익90217776898<NA>90240152649NaN
82Ⅴ. 이익잉여금NaN228745146468NaN231553484571
831. 이익준비금47311955000<NA>47210645000NaN
842. 감채적립금80688273300<NA>80232400000NaN
853. 사업준비금103553256271<NA>103097383030NaN
864. 미처분이익잉여금-2808338103<NA>1013056541NaN
87자    본    총    계NaN456892403366NaN459723117220
88부 채 와 자 본 총 계NaN1115858570743NaN1137801445496

Duplicate rows

Most frequently occurring

결 산 내 역Unnamed: 2# duplicates
1감가상각누계액<NA>4
2정부보조금<NA>3
3<NA><NA>3
0정부보조금<NA>2