Overview

Dataset statistics

Number of variables6
Number of observations52
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 KiB
Average record size in memory53.5 B

Variable types

Text1
Categorical2
Numeric3

Dataset

Description부산광역시_지하차도현황_20230822
Author부산광역시
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15119688

Alerts

시도 has constant value ""Constant
총폭 is highly overall correlated with 높이High correlation
높이 is highly overall correlated with 총폭High correlation

Reproduction

Analysis started2024-04-21 09:56:49.865657
Analysis finished2024-04-21 09:56:51.832439
Duration1.97 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct50
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size544.0 B
2024-04-21T18:56:52.407829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length6.9038462
Min length3

Characters and Unicode

Total characters359
Distinct characters101
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)92.3%

Sample

1st row초량제1지하차도
2nd row초량제1지하차도
3rd row부산진시장 지하차도
4th row범천지하차도
5th row문전교차로 지하차도
ValueCountFrequency (%)
지하차도 12
 
18.2%
초량제1지하차도 2
 
3.0%
장전지하차도 2
 
3.0%
신선대지하차도 1
 
1.5%
와석지하차도 1
 
1.5%
장평지하차도 1
 
1.5%
자유아파트 1
 
1.5%
명지동진 1
 
1.5%
명지지하차도 1
 
1.5%
봉림지하차도 1
 
1.5%
Other values (43) 43
65.2%
2024-04-21T18:56:53.289407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
11.7%
40
 
11.1%
40
 
11.1%
40
 
11.1%
14
 
3.9%
7
 
1.9%
6
 
1.7%
5
 
1.4%
4
 
1.1%
2 4
 
1.1%
Other values (91) 157
43.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 326
90.8%
Space Separator 14
 
3.9%
Decimal Number 11
 
3.1%
Close Punctuation 4
 
1.1%
Open Punctuation 4
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
12.9%
40
 
12.3%
40
 
12.3%
40
 
12.3%
7
 
2.1%
6
 
1.8%
5
 
1.5%
4
 
1.2%
4
 
1.2%
4
 
1.2%
Other values (84) 134
41.1%
Decimal Number
ValueCountFrequency (%)
2 4
36.4%
1 4
36.4%
3 2
18.2%
5 1
 
9.1%
Space Separator
ValueCountFrequency (%)
14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 326
90.8%
Common 33
 
9.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
12.9%
40
 
12.3%
40
 
12.3%
40
 
12.3%
7
 
2.1%
6
 
1.8%
5
 
1.5%
4
 
1.2%
4
 
1.2%
4
 
1.2%
Other values (84) 134
41.1%
Common
ValueCountFrequency (%)
14
42.4%
2 4
 
12.1%
) 4
 
12.1%
( 4
 
12.1%
1 4
 
12.1%
3 2
 
6.1%
5 1
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 326
90.8%
ASCII 33
 
9.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
12.9%
40
 
12.3%
40
 
12.3%
40
 
12.3%
7
 
2.1%
6
 
1.8%
5
 
1.5%
4
 
1.2%
4
 
1.2%
4
 
1.2%
Other values (84) 134
41.1%
ASCII
ValueCountFrequency (%)
14
42.4%
2 4
 
12.1%
) 4
 
12.1%
( 4
 
12.1%
1 4
 
12.1%
3 2
 
6.1%
5 1
 
3.0%

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size544.0 B
부산광역시
52 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
부산광역시 52
100.0%

Length

2024-04-21T18:56:53.507800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T18:56:53.662793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 52
100.0%

시군구
Categorical

Distinct12
Distinct (%)23.1%
Missing0
Missing (%)0.0%
Memory size544.0 B
사상구
11 
해운대구
부산진구
강서구
남구
Other values (7)
18 

Length

Max length4
Median length3
Mean length3.0769231
Min length2

Unique

Unique1 ?
Unique (%)1.9%

Sample

1st row동구
2nd row동구
3rd row동구
4th row부산진구
5th row부산진구

Common Values

ValueCountFrequency (%)
사상구 11
21.2%
해운대구 8
15.4%
부산진구 6
11.5%
강서구 5
9.6%
남구 4
 
7.7%
동구 3
 
5.8%
동래구 3
 
5.8%
북구 3
 
5.8%
금정구 3
 
5.8%
기장군 3
 
5.8%
Other values (2) 3
 
5.8%

Length

2024-04-21T18:56:53.849452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
사상구 11
21.2%
해운대구 8
15.4%
부산진구 6
11.5%
강서구 5
9.6%
남구 4
 
7.7%
동구 3
 
5.8%
동래구 3
 
5.8%
북구 3
 
5.8%
금정구 3
 
5.8%
기장군 3
 
5.8%
Other values (2) 3
 
5.8%

총길이
Real number (ℝ)

Distinct47
Distinct (%)90.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean295.81154
Minimum8.2
Maximum2000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size596.0 B
2024-04-21T18:56:54.084533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8.2
5-th percentile17
Q139.25
median143.5
Q3385
95-th percentile1240.2
Maximum2000
Range1991.8
Interquartile range (IQR)345.75

Descriptive statistics

Standard deviation431.44588
Coefficient of variation (CV)1.4585161
Kurtosis6.8374548
Mean295.81154
Median Absolute Deviation (MAD)111.5
Skewness2.5874465
Sum15382.2
Variance186145.55
MonotonicityNot monotonic
2024-04-21T18:56:54.348008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
30.0 3
 
5.8%
40.0 2
 
3.8%
17.0 2
 
3.8%
32.0 2
 
3.8%
8.2 1
 
1.9%
160.0 1
 
1.9%
380.0 1
 
1.9%
470.0 1
 
1.9%
79.0 1
 
1.9%
48.0 1
 
1.9%
Other values (37) 37
71.2%
ValueCountFrequency (%)
8.2 1
 
1.9%
14.0 1
 
1.9%
17.0 2
3.8%
25.0 1
 
1.9%
26.0 1
 
1.9%
30.0 3
5.8%
32.0 2
3.8%
35.0 1
 
1.9%
37.0 1
 
1.9%
40.0 2
3.8%
ValueCountFrequency (%)
2000.0 1
1.9%
1800.0 1
1.9%
1370.0 1
1.9%
1134.0 1
1.9%
1000.0 1
1.9%
604.0 1
1.9%
566.0 1
1.9%
470.0 1
1.9%
460.0 1
1.9%
436.0 1
1.9%

총폭
Real number (ℝ)

HIGH CORRELATION 

Distinct31
Distinct (%)59.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.011538
Minimum3.5
Maximum46
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size596.0 B
2024-04-21T18:56:54.591544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.5
5-th percentile4.22
Q110.75
median15.75
Q320
95-th percentile30
Maximum46
Range42.5
Interquartile range (IQR)9.25

Descriptive statistics

Standard deviation7.9744851
Coefficient of variation (CV)0.49804615
Kurtosis2.7621779
Mean16.011538
Median Absolute Deviation (MAD)4.25
Skewness1.09209
Sum832.6
Variance63.592413
MonotonicityNot monotonic
2024-04-21T18:56:54.790505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
17.0 4
 
7.7%
20.0 4
 
7.7%
15.0 3
 
5.8%
30.0 3
 
5.8%
18.0 3
 
5.8%
10.0 3
 
5.8%
14.0 3
 
5.8%
16.0 3
 
5.8%
4.0 2
 
3.8%
8.0 2
 
3.8%
Other values (21) 22
42.3%
ValueCountFrequency (%)
3.5 1
 
1.9%
4.0 2
3.8%
4.4 1
 
1.9%
5.5 1
 
1.9%
6.0 1
 
1.9%
7.2 1
 
1.9%
8.0 2
3.8%
9.3 1
 
1.9%
10.0 3
5.8%
11.0 1
 
1.9%
ValueCountFrequency (%)
46.0 1
 
1.9%
30.0 3
5.8%
27.0 1
 
1.9%
25.5 1
 
1.9%
25.0 2
3.8%
21.7 1
 
1.9%
20.5 1
 
1.9%
20.0 4
7.7%
18.0 3
5.8%
17.6 1
 
1.9%

높이
Real number (ℝ)

HIGH CORRELATION 

Distinct20
Distinct (%)38.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.5313462
Minimum2
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size596.0 B
2024-04-21T18:56:54.984162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2.155
Q14.2
median4.5
Q34.825
95-th percentile6.725
Maximum10
Range8
Interquartile range (IQR)0.625

Descriptive statistics

Standard deviation1.2862455
Coefficient of variation (CV)0.28385506
Kurtosis6.4405111
Mean4.5313462
Median Absolute Deviation (MAD)0.3
Skewness1.2921011
Sum235.63
Variance1.6544276
MonotonicityNot monotonic
2024-04-21T18:56:55.197732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
4.5 12
23.1%
5.0 8
15.4%
4.8 4
 
7.7%
3.5 3
 
5.8%
4.6 3
 
5.8%
4.2 3
 
5.8%
4.7 3
 
5.8%
4.0 2
 
3.8%
2.0 2
 
3.8%
2.2 2
 
3.8%
Other values (10) 10
19.2%
ValueCountFrequency (%)
2.0 2
3.8%
2.1 1
 
1.9%
2.2 2
3.8%
3.5 3
5.8%
4.0 2
3.8%
4.1 1
 
1.9%
4.2 3
5.8%
4.3 1
 
1.9%
4.35 1
 
1.9%
4.38 1
 
1.9%
ValueCountFrequency (%)
10.0 1
 
1.9%
7.4 1
 
1.9%
7.0 1
 
1.9%
6.5 1
 
1.9%
5.0 8
15.4%
4.9 1
 
1.9%
4.8 4
 
7.7%
4.7 3
 
5.8%
4.6 3
 
5.8%
4.5 12
23.1%

Interactions

2024-04-21T18:56:51.055985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:56:50.173610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:56:50.644144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:56:51.225363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:56:50.339732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:56:50.790249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:56:51.372072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:56:50.475988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T18:56:50.907354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T18:56:55.354266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설명시군구총길이총폭높이
시설명1.0001.0000.0000.8161.000
시군구1.0001.0000.6230.6630.611
총길이0.0000.6231.0000.7620.000
총폭0.8160.6630.7621.0000.707
높이1.0000.6110.0000.7071.000
2024-04-21T18:56:55.517584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
총길이총폭높이시군구
총길이1.0000.2900.4050.298
총폭0.2901.0000.6260.336
높이0.4050.6261.0000.328
시군구0.2980.3360.3281.000

Missing values

2024-04-21T18:56:51.582898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T18:56:51.763827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설명시도시군구총길이총폭높이
0초량제1지하차도부산광역시동구165.015.03.5
1초량제1지하차도부산광역시동구175.015.03.5
2부산진시장 지하차도부산광역시동구178.012.04.0
3범천지하차도부산광역시부산진구423.016.04.5
4문전교차로 지하차도부산광역시부산진구436.08.04.7
5개금지하차도부산광역시부산진구179.011.34.3
6당감지하차도부산광역시부산진구141.012.14.0
7감고개공원복개구조물부산광역시부산진구105.046.06.5
8범일지하도부산광역시부산진구25.08.03.5
9내성 지하차도부산광역시동래구185.017.04.6
시설명시도시군구총길이총폭높이
42동부지하차도부산광역시기장군40.015.04.8
43무곡지하차도부산광역시기장군17.011.04.38
44삼성2지하차도부산광역시기장군14.020.04.35
45센텀시티지하차도부산광역시해운대구1370.030.05.0
46신선대지하차도부산광역시남구1800.014.04.7
47와석지하차도부산광역시북구604.010.04.5
48장전지하차도부산광역시금정구1000.020.54.8
49장전지하차도부산광역시금정구220.09.34.8
50장평지하차도부산광역시사하구2000.021.74.5
51감천지하차도부산광역시사하구1134.010.04.7