Overview

Dataset statistics

Number of variables5
Number of observations37
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory46.6 B

Variable types

Text1
Numeric3
Categorical1

Dataset

Description부산도시공사_결산내역_20211231
Author부산도시공사
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15109853

Alerts

2021년(억) is highly overall correlated with 2020년(억) and 1 other fieldsHigh correlation
2020년(억) is highly overall correlated with 2021년(억) and 1 other fieldsHigh correlation
비고 is highly overall correlated with 2021년(억) and 1 other fieldsHigh correlation
과목 has unique valuesUnique
2021년(억) has 1 (2.7%) zerosZeros
2020년(억) has 1 (2.7%) zerosZeros
증감(억) has 2 (5.4%) zerosZeros

Reproduction

Analysis started2023-12-10 16:12:25.634598
Analysis finished2023-12-10 16:12:26.709234
Duration1.07 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

과목
Text

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size428.0 B
2023-12-11T01:12:26.824828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length10.783784
Min length4

Characters and Unicode

Total characters399
Distinct characters69
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st row1.유동자산
2nd row1.유동자산 - 당좌자산
3rd row1.유동자산 - 재고자산
4th row2. 비유동자산
5th row2. 비유동자산 - 투자자산
ValueCountFrequency (%)
18
17.8%
2 11
 
10.9%
비유동자산 5
 
5.0%
매출원가 5
 
5.0%
1.매출액 5
 
5.0%
4 4
 
4.0%
1.유동자산 3
 
3.0%
영업이익 3
 
3.0%
3 3
 
3.0%
개발사업 2
 
2.0%
Other values (35) 42
41.6%
2023-12-11T01:12:27.166506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
64
 
16.0%
. 33
 
8.3%
20
 
5.0%
- 18
 
4.5%
15
 
3.8%
13
 
3.3%
13
 
3.3%
12
 
3.0%
12
 
3.0%
2 12
 
3.0%
Other values (59) 187
46.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 251
62.9%
Space Separator 64
 
16.0%
Other Punctuation 33
 
8.3%
Decimal Number 33
 
8.3%
Dash Punctuation 18
 
4.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
8.0%
15
 
6.0%
13
 
5.2%
13
 
5.2%
12
 
4.8%
12
 
4.8%
12
 
4.8%
11
 
4.4%
11
 
4.4%
10
 
4.0%
Other values (49) 122
48.6%
Decimal Number
ValueCountFrequency (%)
2 12
36.4%
1 10
30.3%
4 4
 
12.1%
3 3
 
9.1%
6 2
 
6.1%
5 1
 
3.0%
7 1
 
3.0%
Space Separator
ValueCountFrequency (%)
64
100.0%
Other Punctuation
ValueCountFrequency (%)
. 33
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 251
62.9%
Common 148
37.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
8.0%
15
 
6.0%
13
 
5.2%
13
 
5.2%
12
 
4.8%
12
 
4.8%
12
 
4.8%
11
 
4.4%
11
 
4.4%
10
 
4.0%
Other values (49) 122
48.6%
Common
ValueCountFrequency (%)
64
43.2%
. 33
22.3%
- 18
 
12.2%
2 12
 
8.1%
1 10
 
6.8%
4 4
 
2.7%
3 3
 
2.0%
6 2
 
1.4%
5 1
 
0.7%
7 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 251
62.9%
ASCII 148
37.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
64
43.2%
. 33
22.3%
- 18
 
12.2%
2 12
 
8.1%
1 10
 
6.8%
4 4
 
2.7%
3 3
 
2.0%
6 2
 
1.4%
5 1
 
0.7%
7 1
 
0.7%
Hangul
ValueCountFrequency (%)
20
 
8.0%
15
 
6.0%
13
 
5.2%
13
 
5.2%
12
 
4.8%
12
 
4.8%
12
 
4.8%
11
 
4.4%
11
 
4.4%
10
 
4.0%
Other values (49) 122
48.6%

2021년(억)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct35
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5285
Minimum0
Maximum29614
Zeros1
Zeros (%)2.7%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-11T01:12:27.311688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile10.2
Q1186
median1043
Q38062
95-th percentile21806.8
Maximum29614
Range29614
Interquartile range (IQR)7876

Descriptive statistics

Standard deviation7994.0323
Coefficient of variation (CV)1.5125889
Kurtosis3.2081846
Mean5285
Median Absolute Deviation (MAD)1040
Skewness1.9172238
Sum195545
Variance63904553
MonotonicityNot monotonic
2023-12-11T01:12:27.421931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
280 2
 
5.4%
29614 2
 
5.4%
19855 1
 
2.7%
461 1
 
2.7%
42 1
 
2.7%
3732 1
 
2.7%
1043 1
 
2.7%
2283 1
 
2.7%
390 1
 
2.7%
16 1
 
2.7%
Other values (25) 25
67.6%
ValueCountFrequency (%)
0 1
2.7%
3 1
2.7%
12 1
2.7%
16 1
2.7%
42 1
2.7%
83 1
2.7%
94 1
2.7%
166 1
2.7%
173 1
2.7%
186 1
2.7%
ValueCountFrequency (%)
29614 2
5.4%
19855 1
2.7%
19248 1
2.7%
13946 1
2.7%
12364 1
2.7%
10366 1
2.7%
9759 1
2.7%
8392 1
2.7%
8062 1
2.7%
6205 1
2.7%

2020년(억)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct35
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5625.8108
Minimum0
Maximum30863
Zeros1
Zeros (%)2.7%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-11T01:12:27.538383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5.2
Q1328
median1394
Q37906
95-th percentile23398.2
Maximum30863
Range30863
Interquartile range (IQR)7578

Descriptive statistics

Standard deviation8340.8807
Coefficient of variation (CV)1.4826095
Kurtosis3.1225175
Mean5625.8108
Median Absolute Deviation (MAD)1388
Skewness1.9104942
Sum208155
Variance69570291
MonotonicityNot monotonic
2023-12-11T01:12:27.687715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
945 2
 
5.4%
30863 2
 
5.4%
21532 1
 
2.7%
998 1
 
2.7%
12 1
 
2.7%
3899 1
 
2.7%
762 1
 
2.7%
2807 1
 
2.7%
328 1
 
2.7%
2 1
 
2.7%
Other values (25) 25
67.6%
ValueCountFrequency (%)
0 1
2.7%
2 1
2.7%
6 1
2.7%
12 1
2.7%
17 1
2.7%
170 1
2.7%
181 1
2.7%
245 1
2.7%
260 1
2.7%
328 1
2.7%
ValueCountFrequency (%)
30863 2
5.4%
21532 1
2.7%
19493 1
2.7%
15940 1
2.7%
12613 1
2.7%
11370 1
2.7%
9759 1
2.7%
9331 1
2.7%
7906 1
2.7%
6205 1
2.7%

증감(억)
Real number (ℝ)

ZEROS 

Distinct34
Distinct (%)91.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-340.81081
Minimum-1994
Maximum693
Zeros2
Zeros (%)5.4%
Negative24
Negative (%)64.9%
Memory size465.0 B
2023-12-11T01:12:27.837284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-1994
5-th percentile-1681
Q1-591
median-167
Q34
95-th percentile439.6
Maximum693
Range2687
Interquartile range (IQR)595

Descriptive statistics

Standard deviation616.87806
Coefficient of variation (CV)-1.8100308
Kurtosis0.83822587
Mean-340.81081
Median Absolute Deviation (MAD)330
Skewness-1.0238364
Sum-12610
Variance380538.55
MonotonicityNot monotonic
2023-12-11T01:12:27.993986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
-665 2
 
5.4%
0 2
 
5.4%
-1249 2
 
5.4%
-1677 1
 
2.7%
-537 1
 
2.7%
-167 1
 
2.7%
281 1
 
2.7%
-524 1
 
2.7%
62 1
 
2.7%
14 1
 
2.7%
Other values (24) 24
64.9%
ValueCountFrequency (%)
-1994 1
2.7%
-1697 1
2.7%
-1677 1
2.7%
-1249 2
5.4%
-1004 1
2.7%
-704 1
2.7%
-665 2
5.4%
-591 1
2.7%
-556 1
2.7%
-537 1
2.7%
ValueCountFrequency (%)
693 1
2.7%
486 1
2.7%
428 1
2.7%
317 1
2.7%
281 1
2.7%
62 1
2.7%
30 1
2.7%
19 1
2.7%
14 1
2.7%
4 1
2.7%

비고
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size428.0 B
손익계산서
19 
재무상태표 [요약]
18 

Length

Max length10
Median length5
Mean length7.4324324
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row재무상태표 [요약]
2nd row재무상태표 [요약]
3rd row재무상태표 [요약]
4th row재무상태표 [요약]
5th row재무상태표 [요약]

Common Values

ValueCountFrequency (%)
손익계산서 19
51.4%
재무상태표 [요약] 18
48.6%

Length

2023-12-11T01:12:28.125569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:12:28.264100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
손익계산서 19
34.5%
재무상태표 18
32.7%
요약 18
32.7%

Interactions

2023-12-11T01:12:26.329808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:12:25.815418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:12:26.073698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:12:26.418239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:12:25.910972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:12:26.149362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:12:26.495727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:12:25.995327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:12:26.228763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:12:28.362471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과목2021년(억)2020년(억)증감(억)비고
과목1.0001.0001.0001.0001.000
2021년(억)1.0001.0000.9750.7720.543
2020년(억)1.0000.9751.0000.7480.735
증감(억)1.0000.7720.7481.0000.539
비고1.0000.5430.7350.5391.000
2023-12-11T01:12:28.494759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2021년(억)2020년(억)증감(억)비고
2021년(억)1.0000.978-0.3490.537
2020년(억)0.9781.000-0.4690.509
증감(억)-0.349-0.4691.0000.472
비고0.5370.5090.4721.000

Missing values

2023-12-11T01:12:26.590914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:12:26.675171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

과목2021년(억)2020년(억)증감(억)비고
01.유동자산1985521532-1677재무상태표 [요약]
11.유동자산 - 당좌자산59095592317재무상태표 [요약]
21.유동자산 - 재고자산1394615940-1994재무상태표 [요약]
32. 비유동자산97599331428재무상태표 [요약]
42. 비유동자산 - 투자자산36-3재무상태표 [요약]
52. 비유동자산 - 유형자산83927906486재무상태표 [요약]
62. 비유동자산 - 무형자산1217-5재무상태표 [요약]
72. 비유동자산 - 기타비유동자산13521402-50재무상태표 [요약]
8자산합계2961430863-1249재무상태표 [요약]
9부채합계1. 유동부채23041611693재무상태표 [요약]
과목2021년(억)2020년(억)증감(억)비고
272. 매출원가 - 기타사업16214손익계산서
283. 매출총이익461998-537손익계산서
293. 매출총이익 - 판매비와 관리비26424519손익계산서
304. 영업이익197753-556손익계산서
314. 영업이익 - 영업외 수익166373-207손익계산서
324. 영업이익 - 영업외 비용83181-98손익계산서
335. 경상이익280945-665손익계산서
346. 세전순이익280945-665손익계산서
356. 세전순이익 - 법인세비용186260-74손익계산서
367. 당기순이익94685-591손익계산서